Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Bootstrapping01:24

Bootstrapping

603
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
603
Causality in Epidemiology01:21

Causality in Epidemiology

395
Causality or causation is a fundamental concept in epidemiology, vital for understanding the relationships between various factors and health outcomes. Despite its importance, there's no single, universally accepted definition of causality within the discipline. Drawing from a systematic review, causality in epidemiology encompasses several definitions, including production, necessary and sufficient, sufficient-component, counterfactual, and probabilistic models. Each has its strengths and...
395
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

124
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
124
Survival Tree01:19

Survival Tree

80
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
80
Statistical Hypothesis Testing01:16

Statistical Hypothesis Testing

1.9K
Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...
1.9K
Assumptions of Survival Analysis01:15

Assumptions of Survival Analysis

124
Survival models analyze the time until one or more events occur, such as death in biological organisms or failure in mechanical systems. These models are widely used across fields like medicine, biology, engineering, and public health to study time-to-event phenomena. To ensure accurate results, survival analysis relies on key assumptions and careful study design.
124
  1. Home
  2. A Fast Bootstrap Algorithm For Causal Inference With Large Data.
  1. Home
  2. A Fast Bootstrap Algorithm For Causal Inference With Large Data.

Related Experiment Video

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K

A fast bootstrap algorithm for causal inference with large data.

Matthew Kosko1, Lin Wang2, Michele Santacatterina3

  • 1Department of Statistics, George Washington University, Washington, DC.

Statistics in Medicine
|May 13, 2024

View abstract on PubMed

Summary
This summary is machine-generated.

A new causal bag of little bootstraps method offers efficient causal effect estimation for large datasets. This computational improvement provides reliable confidence intervals, aiding causal inference in research and industry.

Keywords:
causal bootstrapcovariate balancemachine learningpropensity scorereal‐world data

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.5K

Related Experiment Videos

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.5K

Area of Science:

  • Statistics
  • Biostatistics
  • Epidemiology

Background:

  • Estimating causal effects from large datasets is crucial in research and industry.
  • The traditional bootstrap method for standard errors and confidence intervals is computationally intensive for large data.
  • Modern causal inference techniques further increase the computational burden of the bootstrap.

Purpose of the Study:

  • Introduce a novel bootstrap algorithm, the causal bag of little bootstraps (CB দাব)
  • Enhance computational efficiency for causal inference with large datasets.
  • Ensure consistent estimates and reliable confidence interval coverage.

Main Methods:

  • Developed the causal bag of little bootstraps (CB দাব) algorithm.
  • Evaluated algorithm performance using simulation studies.
  • Assessed bias, confidence interval coverage, and computational time.
  • Applied the method to a large observational dataset (Women's Health Initiative).
  • Main Results:

    • The CB দাব algorithm significantly improves computational efficiency compared to the traditional bootstrap.
    • The proposed method yields consistent estimates and desirable confidence interval coverage.
    • Simulation studies demonstrate the algorithm's effectiveness in bias and coverage.

    Conclusions:

    • The causal bag of little bootstraps is a computationally efficient and statistically sound method for causal inference with large datasets.
    • This algorithm facilitates the evaluation of causal effects in complex, large-scale studies.
    • The method was successfully applied to analyze the effect of hormone therapy on coronary heart disease.