Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Statistical Methods for Analyzing Epidemiological Data

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic Models: Compartment Models in Individual and Population Analysis

Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...

Kaplan-Meier Approach

Kaplan-Meier Approach

The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...

Censoring Survival Data

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Stable individual differences dominate adult brain volume variation until later life.

Imaging neuroscience (Cambridge, Mass.)·2026

Same author

The methodological foundations of lesion network mapping remain sound.

bioRxiv : the preprint server for biology·2026

Same author

Author Correction: UK Biobank at 20 - a growing, global resource for dementia research.

Nature reviews. Neurology·2026

Same author

UK Biobank at 20 - a growing, global resource for dementia research.

Nature reviews. Neurology·2026

Same author

Profiling of 5-hydroxymethylcytosine in blood reveals preferential enrichment at exon-intron junctions and predictive value for Parkinson's disease.

NPJ Parkinson's disease·2026

Same author

Disease Site-Specific Outcomes in p16-Positive Non-Oropharyngeal Mucosal Head and Neck Cancer.

Head & neck·2026

Same journal

Methods for incorporating test result information within the high-dimensional propensity score framework: application in UK electronic health record data.

BMC medical research methodology·2026

Same journal

Sparse multi-way DMDC for longitudinal classification in high dimension low sample size data.

BMC medical research methodology·2026

Same journal

Tree-based exploratory identification of predictive biomarkers in non-randomized data.

BMC medical research methodology·2026

Same journal

Comparative evaluation of interrupted time series analytical methods for healthcare quality improvement research: a Monte Carlo simulation study.

BMC medical research methodology·2026

Same journal

Methodological advances in claims-based dementia algorithms: integrating medication and clinical data for medicare populations.

BMC medical research methodology·2026

Same journal

An interpretable XGboost algorithm for predicting 30-day mortality in acute pancreatitis using routine biomarkers.

BMC medical research methodology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 28, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

A generative model for evaluating missing data methods in large epidemiological cohorts.

Lav Radosavljević¹, Stephen M Smith², Thomas E Nichols³

¹Nuffield Department of Population Health, University of Oxford, Oxford, UK.

BMC Medical Research Methodology

|February 8, 2025

Summary

This summary is machine-generated.

Researchers developed a new tool to simulate complex missing data patterns in large datasets, crucial for accurately evaluating data imputation methods. This simulation framework reveals challenges in handling missingness and suggests iterative imputation as a promising approach.

Keywords:

Imputation Missing data Multivariate modelling Neuroimaging Structured missingness UK Biobank

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: May 28, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Epidemiology
Data Science
Bioinformatics

Background:

Large-scale datasets are valuable but often suffer from missing data, hindering their utility.
Current evaluation methods for imputation lack realism, using simplified missing data mechanisms.
Real-world data, like the UK Biobank, exhibit structured missingness (e.g., block-wise) due to study design.

Purpose of the Study:

To develop a novel tool for generating synthetic large-scale epidemiological data with realistic mixed-type missingness.
To account for structured, unstructured, and informative missingness patterns.
To provide a robust framework for evaluating data imputation methods.

Main Methods:

Proposed a tool to mimic key properties of real large-scale epidemiological data.
Utilized hierarchical clustering to identify sub-studies based on missingness patterns.
Modeled inter-variable correlation and co-missingness to capture data dependencies.

Main Results:

Identified significant block-wise missing data in the UK Biobank brain imaging cohort.
Evaluated multiple imputation methods, finding iterative imputation performed best.
Compared synthetic data evaluations to real data analysis, noting minor differences in variable selection outcomes.

Conclusions:

A framework was created to simulate large-scale data with complex, realistic missingness patterns.
Evaluations highlight the significant challenges in data imputation for such complex datasets.
The study underscores the need for advanced methods to address missing data in large-scale studies.