Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
Prediction Intervals01:03

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
The...
Survival Tree01:19

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a survival tree begins...
Truncation in Survival Analysis01:09

Truncation in Survival Analysis

Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are observed.
Parametric Survival Analysis: Weibull and Exponential Methods01:14

Parametric Survival Analysis: Weibull and Exponential Methods

Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...
Introduction To Survival Analysis01:18

Introduction To Survival Analysis

Survival analysis is a statistical method used to study time-to-event data, where the "event" might represent outcomes like death, disease relapse, system failure, or recovery. A unique feature of survival data is censoring, which occurs when the event of interest has not been observed for some individuals during the study period. This requires specialized techniques to handle incomplete data effectively.
The primary goal of survival analysis is to estimate survival time—the time until a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

MIND: multimodal integration with neighbourhood-aware distributions.

Nature communications·2026
Same author

Investigating the effect of climate and air pollution on prescription uptake in the England.

BMC public health·2026
Same author

Intimate partner violence, multiple mental health conditions and risk of small vulnerable newborn births: a maternity population-based data linkage study.

EClinicalMedicine·2026
Same author

Understanding the spatial determinants of the Oxford Classic prognostic signature for high-grade serous ovarian cancer.

Communications medicine·2026
Same author

The effect of intracranial pressure monitoring on severe traumatic brain injury patients who undergo subdural hematoma evacuation.

Neurosurgical review·2026
Same author

Autoimmune diseases and risk of adverse pregnancy outcomes: a population-based cohort study of five million pregnancies in the UK.

BMC medicine·2026
Same journal

Designing an mHealth App for Defecatory Function Rehabilitation of Post-Surgical Colorectal Cancer Survivors.

Journal of healthcare informatics research·2026
Same journal

A Literature Review on Example-Based Explanations in Medical Image Analysis.

Journal of healthcare informatics research·2026
Same journal

A Scoping Review of Synthetic Data Generation by Language Models in Biomedical Research and Application: Data Utility and Quality Perspectives.

Journal of healthcare informatics research·2026
Same journal

Towards Accurate and Reliable ICU Outcome Prediction: A Multimodal Learning Framework Based on Belief Function Theory using Structured EHRs and Free-Text Notes.

Journal of healthcare informatics research·2026
Same journal

Multimodal Federated Learning in Healthcare: A Review.

Journal of healthcare informatics research·2026
Same journal

A Hybrid Language Framework for Ontology-Based Clinical Concept Extraction.

Journal of healthcare informatics research·2026
See all related articles

Related Experiment Video

Updated: May 12, 2026

DeepOmicsAE: Representing Signaling Modules in Alzheimer's Disease with Deep Learning Analysis of Proteomics, Metabolomics, and Clinical Data
09:47

DeepOmicsAE: Representing Signaling Modules in Alzheimer's Disease with Deep Learning Analysis of Proteomics, Metabolomics, and Clinical Data

Published on: December 15, 2023

Imputation Free Deep Survival Prediction with Conditional Variational Autoencoders.

Natalia Hong1,2,3, Aditya Acharya4, Krishna Gokhale4

  • 1Nuffield Department of Women's & Reproductive Health, University of Oxford, Oxford, United Kingdom.

Journal of Healthcare Informatics Research
|May 11, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a novel imputation-free framework for risk prediction using incomplete Electronic Health Records (EHRs). The method enhances model reliability and robustness in clinical decision-making.

Keywords:
Deep learningElectronic health recordsMissing dataSurvival predictionVariational autoencoder

Related Experiment Videos

Last Updated: May 12, 2026

DeepOmicsAE: Representing Signaling Modules in Alzheimer's Disease with Deep Learning Analysis of Proteomics, Metabolomics, and Clinical Data
09:47

DeepOmicsAE: Representing Signaling Modules in Alzheimer's Disease with Deep Learning Analysis of Proteomics, Metabolomics, and Clinical Data

Published on: December 15, 2023

Area of Science:

  • Healthcare Informatics
  • Machine Learning in Medicine
  • Biostatistics

Background:

  • Electronic Health Records (EHRs) are valuable for clinical decision-making but often contain incomplete data due to selective recording.
  • Missing data patterns in EHRs can be informative but pose challenges for predictive model reliability and transportability.
  • Existing methods struggle with missingness that varies between model development and real-world deployment.

Purpose of the Study:

  • To develop an imputation-free framework for risk prediction directly from incomplete EHR data.
  • To improve the reliability and transportability of predictive models in healthcare settings.
  • To address challenges posed by informative and shifting missing data patterns.

Main Methods:

  • Jointly training Conditional Variational Autoencoders with deep survival models.
  • Utilizing the deep survival model DeSurv for risk prediction.
  • Evaluating performance through simulation studies and retrospective cohorts from the Clinical Practice Research Datalink database.

Main Results:

  • The proposed framework consistently outperformed conventional missing data methods.
  • Achieved superior performance on ground-truth metrics in simulations.
  • Demonstrated improved calibration-based survival metrics and robustness to unseen missingness patterns in real-world cohorts.

Conclusions:

  • The imputation-free framework offers a unified strategy for handling missing EHR data across the model lifecycle.
  • Advances methodological robustness in healthcare informatics.
  • Supports more reliable and robust clinical risk prediction in practice.