Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Truncation in Survival Analysis01:09

Truncation in Survival Analysis

685
Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...
685
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

483
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
483
Contingency Table01:29

Contingency Table

4.8K
A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...
4.8K
Censoring Survival Data01:09

Censoring Survival Data

631
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
631
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

697
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
697
Survival Tree01:19

Survival Tree

464
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
464

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Assessing the quality of electronic health record data and the claims linked data for target trial emulation studies.

JAMIA open·2026
Same author

PEPRVision: a visualization framework for Pharmacoepidemiology research.

American journal of epidemiology·2026
Same author

Effectiveness of THC-containing cannabis for inflammatory bowel disease: a systematic review.

Journal of cannabis research·2026
Same author

Joint Position Statement from the Society for Birth Defects Research and Prevention, the Organization of Teratology Information Specialists, and the Developmental Neurotoxicology Society: A Call for Implementation of Risk Evaluation and Mitigation Strategies to Reduce Prenatal Exposure to Valproate.

Birth defects research·2026
Same author

Risk of Fetal Exposure to Teratogenic Medications: Development of Evidence for the Teratogenic Risk Impact and Mitigation (TRIM) Tool.

Drug safety·2026
Same author

Pain and treatment outcomes after initiating methadone vs buprenorphine among medicare patients with opioid use disorder and comorbid chronic pain: A target trial emulation.

PLoS medicine·2026
Same journal

A Bayesian functional concurrent zero-inflated Dirichlet-multinomial regression model with application to infant microbiome.

Biostatistics (Oxford, England)·2026
Same journal

Towards optimal environmental policies: policy learning under arbitrary bipartite network interference.

Biostatistics (Oxford, England)·2026
Same journal

Multilevel functional quantile principal component analysis.

Biostatistics (Oxford, England)·2026
Same journal

Adaptive transfer learning for time-to-event modeling with applications in disease risk assessment.

Biostatistics (Oxford, England)·2026
Same journal

High-dimensional test for one-sided hypotheses.

Biostatistics (Oxford, England)·2026
Same journal

NBSR: a Negative Binomial Softmax Regression model for microRNA-seq data analysis.

Biostatistics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Mar 24, 2026

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.5K

Sequential BART for imputation of missing covariates.

Dandan Xu1, Michael J Daniels2, Almut G Winterstein3

  • 1Department of Statistics, University of Florida, Gainesville, FL 32601, USA.

Biostatistics (Oxford, England)
|March 17, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces a flexible Bayesian nonparametric approach to accurately impute missing covariates in electronic health records (EHR) data. This method improves efficiency and reduces bias in comparative effectiveness research.

Keywords:
Bayesian additive regression treesCongenial modelsMultiple imputation

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.8K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K

Related Experiment Videos

Last Updated: Mar 24, 2026

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.5K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.8K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K

Area of Science:

  • Biostatistics
  • Health Informatics
  • Data Science

Background:

  • Comparative effectiveness research using electronic health records (EHR) requires numerous covariates to address biases.
  • Missing data in these covariates is common and leads to efficiency losses and potential bias.
  • Standard multiple imputation methods may fail to capture complex relationships or face compatibility issues.

Purpose of the Study:

  • To develop and evaluate a flexible Bayesian nonparametric approach for imputing missing covariates in EHR data.
  • To address limitations of standard imputation techniques in handling nonlinear relationships and uncongeniality.
  • To improve the accuracy and efficiency of comparative effectiveness research with incomplete covariate data.

Main Methods:

  • A Bayesian nonparametric approach modeling sequential conditional distributions of covariates.
  • Utilizing Bayesian additive regression trees (BART) for univariate conditional modeling.
  • Employing data augmentation for simultaneous posterior sampling of conditionals.

Main Results:

  • The proposed method effectively imputes missing covariates, handling missingness at random mechanisms.
  • Demonstrated superior performance compared to parametric sequential imputation and multiple imputation by chained equations.
  • Successfully applied to EHR data to identify factors associated with hyperglycemia.

Conclusions:

  • The Bayesian nonparametric approach offers a flexible and robust solution for missing covariate data in EHR.
  • This method enhances the reliability of comparative effectiveness research by mitigating bias and improving efficiency.
  • The approach is suitable for complex datasets and can be applied to various health research questions.