Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Analysis: Overview01:11

Statistical Analysis: Overview

13.9K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
13.9K
Censoring Survival Data01:09

Censoring Survival Data

468
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
468
What are Estimates?01:06

What are Estimates?

7.8K
It isn't easy to measure a parameter such as the mean height or the mean weight of a population. So, we draw samples from the population and calculate the mean height or mean weight of the individuals in the sample. This sample data acts as a representative measure of the population parameter. These sample statistics are known as estimates. 
The estimate for the mean of a sample is denoted by ͞x, whereas the mean of the population is designated as μ. Further, parameters such...
7.8K
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

838
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
838
Biostatistics: Overview01:20

Biostatistics: Overview

658
Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...
658
Study Design in Statistics01:15

Study Design in Statistics

9.9K
A study design is a set of techniques that allow a researcher to collect and analyze data from different variables defined for a specific research problem. Statistics is commonly for effective study design and more robust experiments,
Does aspirin reduce the risk of heart attacks? Is one brand of fertilizer more effective at growing roses than another? Is fatigue as dangerous to a driver as the influence of alcohol? Questions like these are answered using randomized experiments with proper...
9.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Data Resource Profile: Cheeloo Lifespan Electronic-health reseArch Data-library (Cheeloo LEAD).

International journal of epidemiology·2026
Same author

Global complications among patients with mpox: a systematic review and meta-analysis.

EClinicalMedicine·2026
Same author

Integrative omics analysis incorporating cardiovascular magnetic resonance imaging pinpoints potentially druggable plasma proteins for cardiovascular diseases.

Life metabolism·2026
Same author

Exploring schizophrenia-associated image-derived digital phenotypes in dual-energy X-ray densitometer scans of non-dominant distal forearm: An explainable machine-learning framework.

Psychiatry research·2026
Same author

From EHR chaos to clinical clarity: enhancing foundational encoders with task-specific attention for domain-oriented representation learning in downstream clinical applications.

Health information science and systems·2026
Same author

Evaluating the clinical efficacy of combinatorial pharmacogenomic-guided treatment in schizophrenia patients.

British journal of clinical pharmacology·2026
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
Same journal

Beyond Fixed Thresholds: Optimizing Summaries of Wearable Device Data via Piecewise Linearization of Quantile Functions.

Statistics in medicine·2026
Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026
Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026
See all related articles

Related Experiment Video

Updated: Dec 30, 2025

Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore
06:01

Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore

Published on: December 12, 2019

8.9K

Causal data fusion methods using summary-level statistics for a continuous outcome.

Hongkai Li1, Wang Miao2, Zheng Cai1

  • 1School of Mathematical Sciences, Peking University, Beijing, People's Republic of China.

Statistics in Medicine
|January 21, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a new data fusion method to accurately estimate causal effects using summary statistics from multiple datasets with varying confounders. The approach improves causal inference by integrating incomplete confounding information for reliable treatment effect estimation.

Keywords:
causal inferencedata fusionidentificationincomplete confounders

More Related Videos

Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing
09:49

Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing

Published on: July 5, 2019

9.9K
A Data-Driven Approach to Quantifying Immune States in Sepsis
07:42

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

435

Related Experiment Videos

Last Updated: Dec 30, 2025

Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore
06:01

Visualization and Quantification of High-Dimensional Cytometry Data using Cytofast and the Upstream Clustering Methods FlowSOM and Cytosplore

Published on: December 12, 2019

8.9K
Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing
09:49

Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing

Published on: July 5, 2019

9.9K
A Data-Driven Approach to Quantifying Immune States in Sepsis
07:42

A Data-Driven Approach to Quantifying Immune States in Sepsis

Published on: February 7, 2025

435

Area of Science:

  • Biostatistics
  • Epidemiology
  • Data Science

Background:

  • Individual studies often estimate causal effects with incomplete confounder adjustment.
  • Integrating summary-level statistics from multiple datasets with diverse confounders presents a significant challenge in data fusion for causal inference.

Purpose of the Study:

  • To propose a novel method for identifying and estimating the causal effect of a treatment or exposure on a continuous outcome.
  • To address the challenge of incomplete confounder adjustment in individual studies by leveraging multiple datasets.

Main Methods:

  • Developed a novel data fusion technique combining summary-level statistics.
  • Utilized multiple datasets containing subsets of confounders.
  • Incorporated an external dataset with complete confounding information to identify the causal effect.

Main Results:

  • Demonstrated that the causal effect is identifiable using the proposed method.
  • Simulation studies confirmed the unbiasedness of the causal effect estimates.
  • Applied the method to investigate the effect of body mass index on fasting blood glucose.

Conclusions:

  • The proposed method effectively integrates summary statistics from multiple datasets with varying confounders.
  • This approach enables unbiased causal effect estimation, enhancing causal inference.
  • The method shows practical utility in real-world applications, such as analyzing health-related exposures and outcomes.