Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Strategies for Assessing and Addressing Confounding

Strategies for Assessing and Addressing Confounding

Confounding is a critical issue in epidemiological studies, often leading to misleading conclusions about associations between exposures and outcomes. It occurs when the relationship between the exposure and the outcome is mixed with the effects of other factors that influence the outcome. Given that, addressing confounding is of high importance for drawing accurate inferences in research.
Confounding can be addressed at both the design phase of a study and through analytical methods after data...

Stratified Sampling Method

Stratified Sampling Method

Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...

Confounding in Epidemiological Studies

Confounding in Epidemiological Studies

Confounding in statistical epidemiology represents a pivotal challenge, referring to the distortion in the perceived relationship between an exposure and an outcome due to the presence of a third variable, known as a confounder. This variable is associated with both the exposure and the outcome but is not a direct link in their causal chain. Its presence can lead to erroneous interpretations of the exposure's effect, either exaggerating or underestimating the true association. This...

Study Design in Statistics

Study Design in Statistics

A study design is a set of techniques that allow a researcher to collect and analyze data from different variables defined for a specific research problem. Statistics is commonly for effective study design and more robust experiments,
Does aspirin reduce the risk of heart attacks? Is one brand of fertilizer more effective at growing roses than another? Is fatigue as dangerous to a driver as the influence of alcohol? Questions like these are answered using randomized experiments with proper...

Longitudinal Studies

Longitudinal Studies

Longitudinal studies are also widely used in other medical and social science fields. For instance, in cardiovascular research, they can monitor patients' health over decades to identify risk factors for heart disease, such as high cholesterol or smoking, and evaluate the long-term effectiveness of preventive measures. Similarly, in mental health studies, researchers might follow individuals from adolescence into adulthood to understand the development and progression of conditions like...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Smartwatch- and smartphone-based remote assessment of brain health and detection of mild cognitive impairment.

Nature medicine·2025

Same author

Time-dependent prognostic accuracy measures for recurrent event data.

Biometrics·2024

Same author

Inference for covariate-adjusted time-dependent prognostic accuracy measures.

Statistics in medicine·2023

Same author

Biomarker modeling of Alzheimer's disease using PET-based Braak staging.

Nature aging·2023

Same author

Intrinsic connectivity of the human brain provides scaffold for tau aggregation in clinical variants of Alzheimer's disease.

Science translational medicine·2022

Same author

Preconception vitamin D and miscarriage in a prospective cohort study.

Human reproduction (Oxford, England)·2022

Same journal

Methods for incorporating test result information within the high-dimensional propensity score framework: application in UK electronic health record data.

BMC medical research methodology·2026

Same journal

Sparse multi-way DMDC for longitudinal classification in high dimension low sample size data.

BMC medical research methodology·2026

Same journal

Tree-based exploratory identification of predictive biomarkers in non-randomized data.

BMC medical research methodology·2026

Same journal

Comparative evaluation of interrupted time series analytical methods for healthcare quality improvement research: a Monte Carlo simulation study.

BMC medical research methodology·2026

Same journal

Methodological advances in claims-based dementia algorithms: integrating medication and clinical data for medicare populations.

BMC medical research methodology·2026

Same journal

An interpretable XGboost algorithm for predicting 30-day mortality in acute pancreatitis using routine biomarkers.

BMC medical research methodology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 23, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Addressing data privacy in matched studies via virtual pooling.

P Saha-Chaudhuri¹, C R Weinberg²

¹Department of Epidemiology, Biostatistics and Occupational Health, McGill University, 1020 Pine Avenue West, Montreal QC, Montreal, Canada. paramita.sahachaudhuri.work@gmail.com.

BMC Medical Research Methodology

|September 9, 2017

Summary

This summary is machine-generated.

Virtual data pooling enables secure multi-center study analysis by aggregating covariate data within nodes, preserving confidentiality while yielding accurate results comparable to individual data analysis.

Keywords:

Conditional logistic regression Data privacy Distributed data network Matched case-control design Specimen pooling

More Related Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

Published on: November 7, 2025

Related Experiment Videos

Last Updated: Feb 23, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

Published on: November 7, 2025

Area of Science:

Epidemiology
Biostatistics
Data Science

Background:

Balancing data confidentiality and shared use is challenging in multi-center studies.
Confidentiality restrictions prevent single-dataset creation for distributed data analysis.
Existing methods like aggregate data sharing have limitations.

Purpose of the Study:

To propose a novel method for confidentiality-preserving analysis in multi-center studies.
To adapt specimen pooling methodology for virtual data aggregation.
To enable accurate estimation of individual-level effects from distributed data.

Main Methods:

Virtual pooling of covariates within nodes, analogous to specimen pooling.
Application to matched case-control, multi-center study designs.
Using aggregated covariate data in a conditional logistic regression model.

Main Results:

Virtual pooling retains significant information compared to individual data analysis.
Parameter estimates from virtual pooling are similar to standard methods.
Aggregated data analysis shows comparable standard errors and confidence interval coverage.

Conclusions:

Virtual data pooling effectively maintains data confidentiality in multi-center research.
This method is particularly valuable for large-scale distributed data analysis.
It offers a practical solution for balancing data sharing and privacy.