Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

4.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.0K
Regression Toward the Mean01:52

Regression Toward the Mean

7.2K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
7.2K
Correlation and Regression00:53

Correlation and Regression

3.5K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
3.5K
Regression Analysis01:11

Regression Analysis

8.5K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.5K
Microsoft Excel: Regression Analysis01:18

Microsoft Excel: Regression Analysis

1.6K
Regression analysis in Microsoft Excel is a powerful statistical method for examining the relationship between a dependent variable and one or more independent variables. It's used extensively in fields such as economics, biology, and business to predict outcomes, understand relationships, and make data-driven decisions. The most common type is linear regression, which attempts to fit a straight line through the data points to model the relationship between variables.
To perform regression...
1.6K
Multiple Allele Traits01:49

Multiple Allele Traits

38.2K
The Concept of Multiple Allelism
38.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Causal Effect Estimation With TMLE: Handling Missing Data and Near Violations of Positivity.

Biometrical journal. Biometrische Zeitschrift·2026
Same author

In Vitro Study to Evaluate the Antibacterial Effect of an Oxidising Agent on Ex Vivo Biofilm.

Oral health & preventive dentistry·2026
Same author

Local and global mortality experience: A novel hierarchical model for regional mortality risk.

PloS one·2026
Same author

Glucose-6-Phosphatase-Dehydrogenase activity as modulative association between Parkinson's disease and periodontitis.

Frontiers in cellular and infection microbiology·2024
Same author

The Impact of Implant Abutment Angle and Height on Peri-implant Tissue Health: Retrospective Analyses from a Randomized Controlled Clinical Trial.

The International journal of prosthodontics·2024
Same author

Association between Average Vitamin D Levels and COVID-19 Mortality in 19 European Countries-A Population-Based Study.

Nutrients·2023
Same journal

Regression analysis of misclassified current status data with potentially unknown test accuracy.

Statistical methods in medical research·2026
Same journal

Bayesian multivariate linear mixed-effects models with varied association structures.

Statistical methods in medical research·2026
Same journal

Inference about the ratio of age-standardized rates between two overlapping populations.

Statistical methods in medical research·2026
Same journal

A robust neural network with random effects for subject-specific prediction of clustered count data.

Statistical methods in medical research·2026
Same journal

A comparison of methods for designing hybrid type 2 cluster-randomized trials with continuous effectiveness and implementation endpoints.

Statistical methods in medical research·2026
Same journal

Joint analysis of longitudinal and recurrent event data: A functional regression approach with autoregressive frailty.

Statistical methods in medical research·2026
See all related articles

Related Experiment Video

Updated: Feb 14, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.9K

Multiple imputation with sequential penalized regression.

Faisal M Zahid1,2, Christian Heumann2

  • 11 Department of Statistics, Government College University, Faisalabad, Pakistan.

Statistical Methods in Medical Research
|February 17, 2018
PubMed
Summary
This summary is machine-generated.

A new multiple imputation algorithm, mispr, effectively handles missing data in large datasets with many covariates. It outperforms existing methods, offering better imputation accuracy and regression estimates, especially with limited sample sizes.

Keywords:
Conditional distributionhigh-dimensional datamissing datamultiple imputationregularization

More Related Videos

Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos
09:20

Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos

Published on: May 14, 2019

14.9K
Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures
09:04

Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures

Published on: October 29, 2016

12.1K

Related Experiment Videos

Last Updated: Feb 14, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.9K
Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos
09:20

Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos

Published on: May 14, 2019

14.9K
Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures
09:04

Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures

Published on: October 29, 2016

12.1K

Area of Science:

  • Statistics
  • Data Science
  • Biostatistics

Background:

  • Missing data is a pervasive challenge in research, impacting estimation and inference.
  • Existing multiple imputation software struggles with datasets featuring numerous covariates with missing values.
  • Maximum Likelihood Estimation (MLE) can be suboptimal with many predictors relative to sample size.

Purpose of the Study:

  • To introduce mispr, a novel multiple imputation algorithm designed for complex datasets.
  • To address limitations of current software in handling high-dimensional missing data.
  • To improve imputation accuracy and regression analysis performance.

Main Methods:

  • Developed mispr using sequential penalized regression models with ridge penalty.
  • Each variable's imputation model allows for different distributional forms.
  • Employed a quadratic penalty for unique parameter estimates and improved predictions, especially when predictors exceed sample size.

Main Results:

  • mispr demonstrated superior performance compared to mice, VIM, and Amelia in simulation studies.
  • Achieved lower mean squared imputation error (MSE), mean absolute imputation error (MAIE), and regression MSE.
  • Performance gains were particularly pronounced with an increasing number of covariates, especially in small sample scenarios.

Conclusions:

  • mispr offers a robust and accurate solution for multiple imputation, particularly in high-dimensional settings.
  • The algorithm provides a favorable bias-variance trade-off, enhancing predictive accuracy.
  • mispr is a competitive alternative to existing imputation methods, showing significant advantages in challenging data conditions.