Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Strategies for Assessing and Addressing Confounding01:25

Strategies for Assessing and Addressing Confounding

456
Confounding is a critical issue in epidemiological studies, often leading to misleading conclusions about associations between exposures and outcomes. It occurs when the relationship between the exposure and the outcome is mixed with the effects of other factors that influence the outcome. Given that, addressing confounding is of high importance for drawing accurate inferences in research.
Confounding can be addressed at both the design phase of a study and through analytical methods after data...
456
Stratified Sampling Method01:16

Stratified Sampling Method

15.6K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...
15.6K
Confounding in Epidemiological Studies01:27

Confounding in Epidemiological Studies

888
Confounding in statistical epidemiology represents a pivotal challenge, referring to the distortion in the perceived relationship between an exposure and an outcome due to the presence of a third variable, known as a confounder. This variable is associated with both the exposure and the outcome but is not a direct link in their causal chain. Its presence can lead to erroneous interpretations of the exposure's effect, either exaggerating or underestimating the true association. This...
888
Study Design in Statistics01:15

Study Design in Statistics

10.1K
A study design is a set of techniques that allow a researcher to collect and analyze data from different variables defined for a specific research problem. Statistics is commonly for effective study design and more robust experiments,
Does aspirin reduce the risk of heart attacks? Is one brand of fertilizer more effective at growing roses than another? Is fatigue as dangerous to a driver as the influence of alcohol? Questions like these are answered using randomized experiments with proper...
10.1K
Longitudinal Studies01:26

Longitudinal Studies

563
Longitudinal studies are also widely used in other medical and social science fields. For instance, in cardiovascular research, they can monitor patients' health over decades to identify risk factors for heart disease, such as high cholesterol or smoking, and evaluate the long-term effectiveness of preventive measures. Similarly, in mental health studies, researchers might follow individuals from adolescence into adulthood to understand the development and progression of conditions like...
563
Randomized Experiments01:13

Randomized Experiments

9.1K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
9.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Smartwatch- and smartphone-based remote assessment of brain health and detection of mild cognitive impairment.

Nature medicine·2025
Same author

Time-dependent prognostic accuracy measures for recurrent event data.

Biometrics·2024
Same author

Inference for covariate-adjusted time-dependent prognostic accuracy measures.

Statistics in medicine·2023
Same author

Biomarker modeling of Alzheimer's disease using PET-based Braak staging.

Nature aging·2023
Same author

Intrinsic connectivity of the human brain provides scaffold for tau aggregation in clinical variants of Alzheimer's disease.

Science translational medicine·2022
Same author

Preconception vitamin D and miscarriage in a prospective cohort study.

Human reproduction (Oxford, England)·2022
Same journal

Methods for incorporating test result information within the high-dimensional propensity score framework: application in UK electronic health record data.

BMC medical research methodology·2026
Same journal

Sparse multi-way DMDC for longitudinal classification in high dimension low sample size data.

BMC medical research methodology·2026
Same journal

Tree-based exploratory identification of predictive biomarkers in non-randomized data.

BMC medical research methodology·2026
Same journal

Comparative evaluation of interrupted time series analytical methods for healthcare quality improvement research: a Monte Carlo simulation study.

BMC medical research methodology·2026
Same journal

Methodological advances in claims-based dementia algorithms: integrating medication and clinical data for medicare populations.

BMC medical research methodology·2026
Same journal

An interpretable XGboost algorithm for predicting 30-day mortality in acute pancreatitis using routine biomarkers.

BMC medical research methodology·2026
See all related articles

Related Experiment Video

Updated: Feb 23, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.8K

Addressing data privacy in matched studies via virtual pooling.

P Saha-Chaudhuri1, C R Weinberg2

  • 1Department of Epidemiology, Biostatistics and Occupational Health, McGill University, 1020 Pine Avenue West, Montreal QC, Montreal, Canada. paramita.sahachaudhuri.work@gmail.com.

BMC Medical Research Methodology
|September 9, 2017
PubMed
Summary
This summary is machine-generated.

Virtual data pooling enables secure multi-center study analysis by aggregating covariate data within nodes, preserving confidentiality while yielding accurate results comparable to individual data analysis.

Keywords:
Conditional logistic regressionData privacyDistributed data networkMatched case-control designSpecimen pooling

More Related Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.4K
Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies
07:34

Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

Published on: November 7, 2025

302

Related Experiment Videos

Last Updated: Feb 23, 2026

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.8K
Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.4K
Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies
07:34

Utilizing vmTracking to Improve the Accuracy of Multi-Animal Pose Estimation in Rodent Social Behavior Studies

Published on: November 7, 2025

302

Area of Science:

  • Epidemiology
  • Biostatistics
  • Data Science

Background:

  • Balancing data confidentiality and shared use is challenging in multi-center studies.
  • Confidentiality restrictions prevent single-dataset creation for distributed data analysis.
  • Existing methods like aggregate data sharing have limitations.

Purpose of the Study:

  • To propose a novel method for confidentiality-preserving analysis in multi-center studies.
  • To adapt specimen pooling methodology for virtual data aggregation.
  • To enable accurate estimation of individual-level effects from distributed data.

Main Methods:

  • Virtual pooling of covariates within nodes, analogous to specimen pooling.
  • Application to matched case-control, multi-center study designs.
  • Using aggregated covariate data in a conditional logistic regression model.

Main Results:

  • Virtual pooling retains significant information compared to individual data analysis.
  • Parameter estimates from virtual pooling are similar to standard methods.
  • Aggregated data analysis shows comparable standard errors and confidence interval coverage.

Conclusions:

  • Virtual data pooling effectively maintains data confidentiality in multi-center research.
  • This method is particularly valuable for large-scale distributed data analysis.
  • It offers a practical solution for balancing data sharing and privacy.