Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

291
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
291
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

26
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
26
Kaplan-Meier Approach01:24

Kaplan-Meier Approach

80
The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...
80
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

127
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
127
Censoring Survival Data01:09

Censoring Survival Data

56
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
56
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

38
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
38

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Stable individual differences dominate adult brain volume variation until later life.

Imaging neuroscience (Cambridge, Mass.)·2026
Same author

The methodological foundations of lesion network mapping remain sound.

bioRxiv : the preprint server for biology·2026
Same author

Author Correction: UK Biobank at 20 - a growing, global resource for dementia research.

Nature reviews. Neurology·2026
Same author

UK Biobank at 20 - a growing, global resource for dementia research.

Nature reviews. Neurology·2026
Same author

Profiling of 5-hydroxymethylcytosine in blood reveals preferential enrichment at exon-intron junctions and predictive value for Parkinson's disease.

NPJ Parkinson's disease·2026
Same author

Disease Site-Specific Outcomes in p16-Positive Non-Oropharyngeal Mucosal Head and Neck Cancer.

Head & neck·2026
Same journal

Methods for incorporating test result information within the high-dimensional propensity score framework: application in UK electronic health record data.

BMC medical research methodology·2026
Same journal

Sparse multi-way DMDC for longitudinal classification in high dimension low sample size data.

BMC medical research methodology·2026
Same journal

Tree-based exploratory identification of predictive biomarkers in non-randomized data.

BMC medical research methodology·2026
Same journal

Comparative evaluation of interrupted time series analytical methods for healthcare quality improvement research: a Monte Carlo simulation study.

BMC medical research methodology·2026
Same journal

Methodological advances in claims-based dementia algorithms: integrating medication and clinical data for medicare populations.

BMC medical research methodology·2026
Same journal

An interpretable XGboost algorithm for predicting 30-day mortality in acute pancreatitis using routine biomarkers.

BMC medical research methodology·2026
See all related articles

Related Experiment Video

Updated: May 28, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.4K

A generative model for evaluating missing data methods in large epidemiological cohorts.

Lav Radosavljević1, Stephen M Smith2, Thomas E Nichols3

  • 1Nuffield Department of Population Health, University of Oxford, Oxford, UK.

BMC Medical Research Methodology
|February 8, 2025
PubMed
Summary
This summary is machine-generated.

Researchers developed a new tool to simulate complex missing data patterns in large datasets, crucial for accurately evaluating data imputation methods. This simulation framework reveals challenges in handling missingness and suggests iterative imputation as a promising approach.

Keywords:
ImputationMissing dataMultivariate modellingNeuroimagingStructured missingnessUK Biobank

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Related Experiment Videos

Last Updated: May 28, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.4K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Area of Science:

  • Epidemiology
  • Data Science
  • Bioinformatics

Background:

  • Large-scale datasets are valuable but often suffer from missing data, hindering their utility.
  • Current evaluation methods for imputation lack realism, using simplified missing data mechanisms.
  • Real-world data, like the UK Biobank, exhibit structured missingness (e.g., block-wise) due to study design.

Purpose of the Study:

  • To develop a novel tool for generating synthetic large-scale epidemiological data with realistic mixed-type missingness.
  • To account for structured, unstructured, and informative missingness patterns.
  • To provide a robust framework for evaluating data imputation methods.

Main Methods:

  • Proposed a tool to mimic key properties of real large-scale epidemiological data.
  • Utilized hierarchical clustering to identify sub-studies based on missingness patterns.
  • Modeled inter-variable correlation and co-missingness to capture data dependencies.

Main Results:

  • Identified significant block-wise missing data in the UK Biobank brain imaging cohort.
  • Evaluated multiple imputation methods, finding iterative imputation performed best.
  • Compared synthetic data evaluations to real data analysis, noting minor differences in variable selection outcomes.

Conclusions:

  • A framework was created to simulate large-scale data with complex, realistic missingness patterns.
  • Evaluations highlight the significant challenges in data imputation for such complex datasets.
  • The study underscores the need for advanced methods to address missing data in large-scale studies.