Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Multiple imputation using an iterative hot-deck with distance-based donor selection.

Juned Siddique1, Thomas R Belin

  • 1Department of Health Studies, University of Chicago, Chicago, IL 60637, U.S.A. siddique@uchicago.edu

Statistics in Medicine
|July 20, 2007
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Effects of the Fit2Thrive technology-supported physical activity promotion intervention components on sedentary behavior and light physical activity in breast cancer survivors.

Journal of cancer survivorship : research and practice·2026
Same author

The co-creation of Foot Selfie, a patient-centered mobile intervention to prevent diabetic foot ulcers.

BMC health services research·2026
Same author

Letter to the Editor Regarding "How Choice of Effect Measure Influences Minimally Sufficient Adjustment Sets for External Validity" by Webster-Clark and Keil (2023): The Covariate Class Taxonomy Should Be Strictly Defined With Respect To The Full Set Of Potential Outcome-Generating Covariates.

American journal of epidemiology·2026
Same author

Imputation of Missing Continuous Glucose Monitor Data.

Journal of diabetes science and technology·2026
Same author

Modeling a mixture of ordinal and continuous repeated measures.

Journal of statistical computation and simulation·2026
Same author

A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values.

Journal of multivariate analysis·2026
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
Same journal

Beyond Fixed Thresholds: Optimizing Summaries of Wearable Device Data via Piecewise Linearization of Quantile Functions.

Statistics in medicine·2026
Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026
Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026
See all related articles

This study introduces iterative hot-deck multiple imputation, a novel method for handling missing data that combines ease of use with Bayesian statistical features. The new approach effectively balances bias and variance in statistical estimates.

Area of Science:

  • Statistics
  • Computational Statistics
  • Data Science

Background:

  • Missing data present challenges in statistical analysis, potentially biasing results.
  • Traditional hot-deck imputation methods lack modern statistical computing features.
  • Bayesian approaches offer robust methods but can be computationally intensive.

Purpose of the Study:

  • To develop an iterative hot-deck multiple imputation method with distance-based donor selection.
  • To integrate the ease of hot-deck imputation with the strengths of Bayesian methods.
  • To evaluate the impact of various parameters on imputation accuracy and statistical inference.

Main Methods:

  • Implemented iterative hot-deck multiple imputation using distance-based donor selection.
  • Defined distance as a monotonic function of predictive mean differences.

Related Experiment Videos

  • Donors were selected with probability inversely proportional to distance.
  • Explored variations including nearest-neighbor and random hot-deck imputation.
  • Iterated the procedure, drawing parallels to Markov Chain Monte Carlo (MCMC) methods.
  • Main Results:

    • The proposed distance measure effectively balances bias and variance in estimates.
    • Inferences from a depression treatment trial were largely insensitive to the choice of distance metric.
    • While 10 iterations showed some differences from 1 iteration, 500 iterations yielded no meaningful changes from 10.
    • Starting values had no impact on inferences, but the order of variable imputation was significant.

    Conclusions:

    • Iterative hot-deck multiple imputation offers a practical yet statistically sound approach to missing data.
    • The method provides a valuable alternative for researchers seeking to leverage Bayesian principles without excessive computational complexity.
    • Careful consideration of variable imputation order is recommended for robust statistical inference.