Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Selected Data About Geographic Locations01:25

Selected Data About Geographic Locations

270
Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...
270
Variables Affecting Phosphorescence and Fluorescence01:26

Variables Affecting Phosphorescence and Fluorescence

1.5K
Fluorescence and phosphorescence are essential phenomena in fields like analytical chemistry, biological imaging, and materials science, where they detect molecular properties and visualize cellular structures. Understanding the variables that influence these luminescent behaviors is crucial for maximizing accuracy and efficiency in their applications. These variables can broadly be grouped into chemical structure, solvent properties, and external conditions, each playing a distinct role in...
1.5K
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

936
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
936
Variability: Analysis01:11

Variability: Analysis

500
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
500
Random Variables01:09

Random Variables

17.8K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
17.8K
Statistical Methods to Analyze Parametric Data: ANOVA01:12

Statistical Methods to Analyze Parametric Data: ANOVA

1.6K
Analysis of Variance, or ANOVA, is a powerful statistical technique used to analyze parametric data, primarily in research and experimental studies. It's designed to compare the means of two or more groups, assisting researchers in identifying any significant differences between these group means. There are two main types of ANOVA based on the complexity of the analysis: one-way and two-way.
One-way ANOVA is applied when a single independent variable or factor is scrutinized. It compares...
1.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Is the whole more than the sum of its parts? Considering global and local features of the connectome improves prediction of individuals and phenotypes.

Imaging neuroscience (Cambridge, Mass.)·2026
Same author

Dynamic interplay between food addiction, psychological and behavioral factors, and weight-related measures: A longitudinal network analysis in developing youth.

Journal of behavioral addictions·2026
Same author

A NOVEL BAYESIAN FRAMEWORK UNCOVERING BRAIN CONNECTIVITY-TO-SHAPE RELATIONSHIP IN PRECLINICAL ALZHEIMER'S DISEASE.

The annals of applied statistics·2026
Same author

Correction: Association of intestinal mucosal barrier function with intestinal microbiota in Spleen-Kidney Yang Deficiency IBS-D mice.

Frontiers in microbiology·2026
Same author

Adaptive Integration of Incomplete Multimodal 3D Neuroimaging for Alzheimer's Prediction and Biomarker Discovery.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

Validation of Smartwatches Integrated With Photoplethysmography for Continuous Evaluation of Atrial Fibrillation Burden.

JACC. Clinical electrophysiology·2026
Same journal

Contrastive Dimension Reduction: A Systematic Review.

Wiley interdisciplinary reviews. Computational statistics·2026
Same journal

Linear Dimensionality Reduction Methods for Analyzing Structured Biomedical Data: Existing Research and Future Opportunities.

Wiley interdisciplinary reviews. Computational statistics·2025
Same journal

Vector AutoRegressive Moving Average Models: A Review.

Wiley interdisciplinary reviews. Computational statistics·2025
Same journal

The discrete empirical interpolation method in class identification and data summarization.

Wiley interdisciplinary reviews. Computational statistics·2025
Same journal

Neuroimaging statistical approaches for determining neural correlates of Alzheimer's disease via positron emission tomography imaging.

Wiley interdisciplinary reviews. Computational statistics·2024
Same journal

Genome-wide Prediction of Chromatin Accessibility Based on Gene Expression.

Wiley interdisciplinary reviews. Computational statistics·2024
See all related articles

Related Experiment Video

Updated: Jan 28, 2026

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump
09:04

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

Published on: June 1, 2022

3.6K

Variable Selection in the Presence of Missing Data: Imputation-based Methods.

Yize Zhao1, Qi Long2

  • 1Department of Healthcare Policy and Research, Weill Cornell Medical College, Cornell University.

Wiley Interdisciplinary Reviews. Computational Statistics
|November 1, 2017
PubMed
Summary
This summary is machine-generated.

Variable selection in regression analysis is crucial for accurate models. New methods are needed to effectively handle missing data using imputation techniques for better predictive performance.

Keywords:
MARMCARMNARbootstrapimputationmissing dataresamplingvariable selection

More Related Videos

A Rapid Method for Modeling a Variable Cycle Engine
04:58

A Rapid Method for Modeling a Variable Cycle Engine

Published on: August 13, 2019

8.0K
Calculating Heart Rate Variability from ECG Data from Youth with Cerebral Palsy During Active Video Game Sessions
08:12

Calculating Heart Rate Variability from ECG Data from Youth with Cerebral Palsy During Active Video Game Sessions

Published on: June 5, 2019

20.5K

Related Experiment Videos

Last Updated: Jan 28, 2026

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump
09:04

A Modeling and Simulation Method for Preliminary Design of an Electro-Variable Displacement Pump

Published on: June 1, 2022

3.6K
A Rapid Method for Modeling a Variable Cycle Engine
04:58

A Rapid Method for Modeling a Variable Cycle Engine

Published on: August 13, 2019

8.0K
Calculating Heart Rate Variability from ECG Data from Youth with Cerebral Palsy During Active Video Game Sessions
08:12

Calculating Heart Rate Variability from ECG Data from Youth with Cerebral Palsy During Active Video Game Sessions

Published on: June 5, 2019

20.5K

Area of Science:

  • Statistics
  • Data Science
  • Machine Learning

Background:

  • Variable selection is vital for regression analysis and predictive accuracy.
  • Existing methods are well-developed for complete data but require adaptation for missing data.
  • Handling missing data is essential for reliable statistical modeling.

Purpose of the Study:

  • To explore statistical methods for variable selection in the presence of missing data.
  • To investigate the integration of imputation techniques with variable selection strategies.
  • To highlight the need for robust methods under missing at random (MAR) and missing completely at random (MCAR) assumptions.

Main Methods:

  • Review of three main strategies for variable selection with imputation.
  • Strategy 1: Applying selection methods to imputed datasets and combining results.
  • Strategy 2: Applying selection methods to stacked imputed datasets.
  • Strategy 3: Combining imputation with resampling techniques like bootstrap.

Main Results:

  • Imputation-based variable selection methods are of particular interest.
  • Current methods are valid under MAR and MCAR assumptions.
  • The field of variable selection with missing data is still developing.

Conclusions:

  • Variable selection methods must account for missing data mechanisms.
  • Imputation offers a practical approach for handling missing data in variable selection.
  • Further research is needed to advance variable selection techniques for incomplete datasets.