Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a survival tree begins...
Multiple Regression01:25

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
Truncation in Survival Analysis01:09

Truncation in Survival Analysis

Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are observed.
Regression Analysis01:11

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
Parametric Survival Analysis: Weibull and Exponential Methods01:14

Parametric Survival Analysis: Weibull and Exponential Methods

Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Outcome-Assisted Multiple Imputation of Missing Treatments.

Observational studies·2026
Same author

Optimal <i>F</i>-score Matching for Bipartite Record Linkage.

Statistics and computing·2026
Same author

Dyadic association between support persons' attitudes towards medication for opioid use disorder and patients' mental health outcomes.

Drug and alcohol dependence·2026
Same author

Fully Synthetic Data for Complex Surveys.

Survey methodology·2025
Same author

Studying Chinese immigrants' spatial distribution in the Raleigh-Durham area by linking survey and commercial data using romanized names.

Journal of the Royal Statistical Society. Series A, (Statistics in Society)·2025
Same author

Evaluating Binary Outcome Classifiers Estimated from Survey Data.

Epidemiology (Cambridge, Mass.)·2024
Same journal

Correction to: Developing a novel algorithm to identify incident and prevalent dementia in Medicare claims-the ARIC Study.

American journal of epidemiology·2026
Same journal

RE: advancing observational research on arts and health: theory-informed approaches using the RADIANCE framework.

American journal of epidemiology·2026
Same journal

Maternal Cesarean Section and Offspring ASD or ADHD Risk: A Nurses' Health Study II Analysis.

American journal of epidemiology·2026
Same journal

Immigration and epigenetic age acceleration in the health and retirement study: differences Between Hispanics and Non-Hispanics.

American journal of epidemiology·2026
Same journal

Age-period-cohort analysis of low birth weight, early term delivery, induction and csection, US births 1990-2019.

American journal of epidemiology·2026
Same journal

A measurement comparability study to support instrument harmonization in the Canadian longitudinal study on aging.

American journal of epidemiology·2026
See all related articles

Related Experiment Video

Updated: Jun 8, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Multiple imputation for missing data via sequential regression trees.

Lane F Burgette1, Jerome P Reiter

  • 1Department of Statistical Science, Duke University, Durham, North Carolina 27708, USA. lb131@stat.duke.edu

American Journal of Epidemiology
|September 16, 2010
PubMed
Summary
This summary is machine-generated.

This study introduces a new nonparametric method using sequential regression trees for multiple imputation in large epidemiologic studies. This approach handles complex data relationships more effectively, leading to more reliable research findings.

Related Experiment Videos

Last Updated: Jun 8, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Area of Science:

  • Epidemiology
  • Biostatistics
  • Data Science

Background:

  • Missing data is a common challenge in large epidemiologic studies, impacting analysis validity.
  • Standard multiple imputation methods struggle with complex relationships (interactions, nonlinearities) among numerous variables.
  • Accurate imputation is crucial for reliable inferences in studies with many data users and diverse analyses.

Purpose of the Study:

  • To present a novel nonparametric approach for multiple imputation using sequential regression trees.
  • To demonstrate the method's ability to capture complex variable relationships in large datasets.
  • To improve the accuracy and reliability of imputations in complex epidemiologic settings.

Main Methods:

  • Multiple imputation via chained equations (MICE) framework.
  • Utilized sequential regression trees as nonparametric conditional models within MICE.
  • Simulations were conducted to compare the new method against standard techniques.
  • Applied the approach to a real-world dataset on adverse birth outcomes with over 100 variables.

Main Results:

  • The nonparametric regression tree approach demonstrated superior performance in simulations for complex data structures.
  • Imputations generated by the new method were more plausible, leading to more reliable inferences.
  • Posterior predictive checks confirmed the validity of imputations in subsequent epidemiologic analyses.

Conclusions:

  • Sequential regression trees offer a powerful, minimally tuned nonparametric alternative for multiple imputation in complex epidemiologic studies.
  • This method enhances the handling of intricate variable interactions and nonlinearities, improving data integrity.
  • The approach provides a more robust solution for missing data challenges in large-scale observational research.