Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Regression Toward the Mean

Regression Toward the Mean

Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...

Correlation and Regression

Correlation and Regression

In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

Microsoft Excel: Regression Analysis

Microsoft Excel: Regression Analysis

Regression analysis in Microsoft Excel is a powerful statistical method for examining the relationship between a dependent variable and one or more independent variables. It's used extensively in fields such as economics, biology, and business to predict outcomes, understand relationships, and make data-driven decisions. The most common type is linear regression, which attempts to fit a straight line through the data points to model the relationship between variables.
To perform regression...

Multiple Allele Traits

Multiple Allele Traits

The Concept of Multiple Allelism

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Causal Effect Estimation With TMLE: Handling Missing Data and Near Violations of Positivity.

Biometrical journal. Biometrische Zeitschrift·2026

Same author

In Vitro Study to Evaluate the Antibacterial Effect of an Oxidising Agent on Ex Vivo Biofilm.

Oral health & preventive dentistry·2026

Same author

Local and global mortality experience: A novel hierarchical model for regional mortality risk.

PloS one·2026

Same author

Glucose-6-Phosphatase-Dehydrogenase activity as modulative association between Parkinson's disease and periodontitis.

Frontiers in cellular and infection microbiology·2024

Same author

The Impact of Implant Abutment Angle and Height on Peri-implant Tissue Health: Retrospective Analyses from a Randomized Controlled Clinical Trial.

The International journal of prosthodontics·2024

Same author

Association between Average Vitamin D Levels and COVID-19 Mortality in 19 European Countries-A Population-Based Study.

Nutrients·2023

Same journal

Regression analysis of misclassified current status data with potentially unknown test accuracy.

Statistical methods in medical research·2026

Same journal

Bayesian multivariate linear mixed-effects models with varied association structures.

Statistical methods in medical research·2026

Same journal

Inference about the ratio of age-standardized rates between two overlapping populations.

Statistical methods in medical research·2026

Same journal

A robust neural network with random effects for subject-specific prediction of clustered count data.

Statistical methods in medical research·2026

Same journal

A comparison of methods for designing hybrid type 2 cluster-randomized trials with continuous effectiveness and implementation endpoints.

Statistical methods in medical research·2026

Same journal

Joint analysis of longitudinal and recurrent event data: A functional regression approach with autoregressive frailty.

Statistical methods in medical research·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 14, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Multiple imputation with sequential penalized regression.

Faisal M Zahid^1,2, Christian Heumann²

¹1 Department of Statistics, Government College University, Faisalabad, Pakistan.

Statistical Methods in Medical Research

|February 17, 2018

Summary

This summary is machine-generated.

A new multiple imputation algorithm, mispr, effectively handles missing data in large datasets with many covariates. It outperforms existing methods, offering better imputation accuracy and regression estimates, especially with limited sample sizes.

Keywords:

Conditional distribution high-dimensional data missing data multiple imputation regularization

More Related Videos

Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos

Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos

Published on: May 14, 2019

Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures

Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures

Published on: October 29, 2016

Related Experiment Videos

Last Updated: Feb 14, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos

Sequential Immunofluorescence and Immunohistochemistry on Cryosectioned Zebrafish Embryos

Published on: May 14, 2019

Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures

Assessment of Labile Organic Carbon in Soil Using Sequential Fumigation Incubation Procedures

Published on: October 29, 2016

Area of Science:

Statistics
Data Science
Biostatistics

Background:

Missing data is a pervasive challenge in research, impacting estimation and inference.
Existing multiple imputation software struggles with datasets featuring numerous covariates with missing values.
Maximum Likelihood Estimation (MLE) can be suboptimal with many predictors relative to sample size.

Purpose of the Study:

To introduce mispr, a novel multiple imputation algorithm designed for complex datasets.
To address limitations of current software in handling high-dimensional missing data.
To improve imputation accuracy and regression analysis performance.

Main Methods:

Developed mispr using sequential penalized regression models with ridge penalty.
Each variable's imputation model allows for different distributional forms.
Employed a quadratic penalty for unique parameter estimates and improved predictions, especially when predictors exceed sample size.

Main Results:

mispr demonstrated superior performance compared to mice, VIM, and Amelia in simulation studies.
Achieved lower mean squared imputation error (MSE), mean absolute imputation error (MAIE), and regression MSE.
Performance gains were particularly pronounced with an increasing number of covariates, especially in small sample scenarios.

Conclusions:

mispr offers a robust and accurate solution for multiple imputation, particularly in high-dimensional settings.
The algorithm provides a favorable bias-variance trade-off, enhancing predictive accuracy.
mispr is a competitive alternative to existing imputation methods, showing significant advantages in challenging data conditions.