Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

193
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
193
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

325
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
325
Multiple Regression01:25

Multiple Regression

3.3K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.3K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.6K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.6K
Variability: Analysis01:11

Variability: Analysis

230
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
230
Randomized Experiments01:13

Randomized Experiments

8.1K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Molecular arms races at the virus-host splicing interface and their pathogenic implications.

Frontiers in microbiology·2026
Same author

Determining the half-life and turnover rate of EPA, n-3 docosapentaenoic acid, and DHA in humans using the natural abundance of carbon-13: a secondary analysis of a randomized clinical trial.

The American journal of clinical nutrition·2026
Same author

LLOT: application of Laplacian Linear Optimal Transport in spatial transcriptome reconstruction.

Biometrics·2026
Same author

Finer Particulate Matter Exposure Disparities Exist but Vary across Pollution Concentrations.

Environmental science & technology·2026
Same author

ADAM9 supports regulatory T cell stability through TGF-β-SMAD signaling and is reduced in systemic lupus erythematosus.

Clinical immunology (Orlando, Fla.)·2026
Same author

Dual Adaptive Disentangled Representation Learning With Multimodal Data for Disease Diagnosis.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: Oct 6, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K

Ultra-high dimensional variable selection for doubly robust causal inference.

Dingke Tang1, Dehan Kong1, Wenliang Pan2

  • 1Department of Statistical Sciences, University of Toronto, Toronto, Ontario, Canada.

Biometrics
|January 19, 2022
PubMed
Summary
This summary is machine-generated.

We introduce causal ball screening to select important confounders from ultra-high dimensional data for causal inference. This method improves the efficiency of causal effect estimates from observational studies.

Keywords:
Alzheimer's diseaseaverage causal effectball covarianceconfounder selectionvariable screening

More Related Videos

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.0K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.7K

Related Experiment Videos

Last Updated: Oct 6, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.0K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.7K

Area of Science:

  • Causal inference
  • Statistical modeling
  • High-dimensional data analysis

Background:

  • Observational studies increasingly rely on rich covariate data for causal inference.
  • Extracting key features from high-dimensional data is crucial for developing causal procedures like doubly robust estimators.
  • Existing methods may not optimally handle ultra-high dimensional confounder selection.

Purpose of the Study:

  • Propose a novel confounder selection method, causal ball screening, for ultra-high dimensional data.
  • Enhance the efficiency of causal effect estimates by controlling for confounding.
  • Maintain double robustness in causal effect estimation.

Main Methods:

  • Develop causal ball screening for confounder selection in ultra-high dimensional datasets.
  • Utilize an outcome model-free procedure for propensity score model selection.
  • Incorporate predictors of the outcome into both propensity score and outcome regression models.

Main Results:

  • Theoretical analysis demonstrates model selection consistency and pointwise normality.
  • Synthetic and real-world data analyses show favorable performance compared to existing methods.
  • The method effectively controls for confounding while improving causal effect estimate efficiency.

Conclusions:

  • Causal ball screening is an effective method for confounder selection in ultra-high dimensional settings.
  • The proposed approach maintains the double robustness of causal effect estimators.
  • This technique offers advantages in efficiency and robustness for causal inference using observational data.