Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Steps in Outbreak Investigation01:18

Steps in Outbreak Investigation

114
In the ever-evolving field of public health, statistical analysis serves as a cornerstone for understanding and managing disease outbreaks. By leveraging various statistical tools, health professionals can predict potential outbreaks, analyze ongoing situations, and devise effective responses to mitigate impact. For that to happen, there are a few possible stages of the analysis:
114
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

33
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
33
Kaplan-Meier Approach01:24

Kaplan-Meier Approach

111
The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...
111
Statistical Software for Data Analysis and Clinical Trials01:12

Statistical Software for Data Analysis and Clinical Trials

517
Statistical software is pivotal in data analysis and clinical trials by providing tools to analyze data, draw conclusions, and make predictions. These software packages range from simple data management applications to complex analytical platforms, supporting various statistical tests, models, and simulation techniques. Their significance lies in their ability to handle vast amounts of data with precision and efficiency, enabling researchers to validate hypotheses, identify trends, and make...
517
Censoring Survival Data01:09

Censoring Survival Data

72
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
72

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dynamic interplay between food addiction, psychological and behavioral factors, and weight-related measures: A longitudinal network analysis in developing youth.

Journal of behavioral addictions·2026
Same author

Correction: Association of intestinal mucosal barrier function with intestinal microbiota in Spleen-Kidney Yang Deficiency IBS-D mice.

Frontiers in microbiology·2026
Same author

Adaptive Integration of Incomplete Multimodal 3D Neuroimaging for Alzheimer's Prediction and Biomarker Discovery.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

A powerful representation learning method for enhanced analysis of incomplete multi-omics data.

NPJ systems biology and applications·2026
Same author

Melanoma-patient-derived xenograft multi-omics resource melPDomiX maps gain- and loss-of-function alterations.

Cell reports·2026
Same author

Reversing the Hydrogenation Pathways of Nitrogen-Containing Intermediates for the Kinetics-Matched Urea Electrosynthesis.

Angewandte Chemie (International ed. in English)·2026
Same journal

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift.

Proceedings of machine learning research·2026
Same journal

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video.

Proceedings of machine learning research·2026
Same journal

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026
Same journal

Classifying Phonotrauma Severity from Vocal Fold Images with Soft Ordinal Regression.

Proceedings of machine learning research·2026
Same journal

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Proceedings of machine learning research·2026
Same journal

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential.

Proceedings of machine learning research·2026
See all related articles

Related Experiment Video

Updated: Jun 13, 2025

A Data-Driven Approach to Quantifying Immune States in Sepsis
07:42

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

149

SADI: Similarity-Aware Diffusion Model-Based Imputation for Incomplete Temporal EHR Data.

Zongyu Dai1, Emily Getzen1, Qi Long1

  • 1University of Pennsylvania.

Proceedings of Machine Learning Research
|September 13, 2024
PubMed
Summary
This summary is machine-generated.

A new method called Similarity-Aware Diffusion Model-Based Imputation (SADI) improves electronic health record (EHR) data imputation. SADI effectively handles missing data in sparse temporal EHRs by considering similar patients, outperforming existing models.

More Related Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.4K
A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.7K

Related Experiment Videos

Last Updated: Jun 13, 2025

A Data-Driven Approach to Quantifying Immune States in Sepsis
07:42

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

149
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.4K
A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data
10:46

A Method of Trigonometric Modelling of Seasonal Variation Demonstrated with Multiple Sclerosis Relapse Data

Published on: December 9, 2015

10.7K

Area of Science:

  • Health Informatics
  • Machine Learning
  • Biostatistics

Background:

  • Missing values in temporal electronic health records (EHR) complicate analysis and bias results.
  • Current imputation methods struggle with sparse EHR data common in non-ICU settings.
  • Existing models rely on temporal and feature correlations, which are weak in sparse data.

Purpose of the Study:

  • To introduce a novel imputation method, Similarity-Aware Diffusion Model-Based Imputation (SADI), for temporal EHR data.
  • To address the limitations of current state-of-the-art (SOTA) models in handling sparse EHR data.
  • To leverage diffusion models and patient similarity for improved imputation.

Main Methods:

  • Developed SADI, a diffusion model-based imputation technique.
  • Incorporated a similarity-aware denoising function with a self-attention mechanism.
  • Modeled correlations across time points, features, and similar patients.

Main Results:

  • SADI demonstrated superior performance over SOTA imputation methods.
  • Experiments were conducted on Critical Path For Alzheimer's Disease (CPAD) and PhysioNet Challenge 2012 datasets.
  • SADI proved effective under various missing data mechanisms (MCAR, MAR, MNAR).

Conclusions:

  • SADI offers a significant advancement in imputing missing temporal EHR data.
  • The method's ability to utilize information from similar patients is a key innovation.
  • SADI provides a robust solution for EHR data imputation, especially in non-ICU settings.