Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Study Design in Statistics01:15

Study Design in Statistics

9.9K
A study design is a set of techniques that allow a researcher to collect and analyze data from different variables defined for a specific research problem. Statistics is commonly for effective study design and more robust experiments,
Does aspirin reduce the risk of heart attacks? Is one brand of fertilizer more effective at growing roses than another? Is fatigue as dangerous to a driver as the influence of alcohol? Questions like these are answered using randomized experiments with proper...
9.9K
Randomized Experiments01:13

Randomized Experiments

8.8K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.8K
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

244
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
244
Regression Toward the Mean01:52

Regression Toward the Mean

6.8K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.8K
Study Designs in Epidemiology01:20

Study Designs in Epidemiology

878
Epidemiological study designs are fundamental tools for investigating the distribution, determinants, and control of health conditions in populations. They help researchers understand the relationships between exposures and outcomes, and they broadly fall into two categories: "observational" and "experimental" studies.
Observational studies are those where the researcher does not intervene but rather observes natural variations. They include cross-sectional, cohort, and...
878
Bias in Epidemiological Studies01:29

Bias in Epidemiological Studies

1.3K
Biases can arise at various stages of research, from study design and data collection to analysis and interpretation. Recognizing and addressing these biases is essential to ensure the validity and reliability of epidemiological findings.Broadly speaking, biases in epidemiology fall into three main categories: selection bias, information bias, and confounding. A more detailed description of possible biases is:  
1.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Design, Synthesis, and Performance Characterization of BODIPY-Based NIR Probes for Aβ<sub>42</sub> Aggregate Detection.

ChemPlusChem·2026
Same author

Clinicopathological features of primary pulmonary choriocarcinoma in males: A report of two cases and literature review.

Respiratory medicine case reports·2026
Same author

[Predictive value of neutrophil-to-lymphocyte ratio combined with glycosylated haemoglobin for bloodstream infection in pyogenic liver abscesses].

Zhonghua wei zhong bing ji jiu yi xue·2026
Same author

Publisher Correction: Enhancing propensity score analysis with data missing not at random: Introducing dual-forest proximity imputation.

Behavior research methods·2026
Same author

Anti-human T lymphocyte porcine immunoglobulin attenuates experimental autoimmune encephalomyelitis by inhibiting T cell activation.

Scientific reports·2026
Same author

Enhancing propensity score analysis with data missing not at random: Introducing dual-forest proximity imputation.

Behavior research methods·2026
Same journal

Bayesian evaluation for latent variable models: A tutorial on computing information criteria and bayes factors with the r package bleval.

Psychological methods·2026
Same journal

A stochastic block prior for clustering in graphical models.

Psychological methods·2026
Same journal

Three-level vector autoregressive models.

Psychological methods·2026
Same journal

Scaling cognitive modeling to big data: A deep learning approach to studying individual differences in evidence accumulation model parameters.

Psychological methods·2026
Same journal

Best practices in multilevel modeling for within-cluster group comparisons: An evaluation of coding strategies reflecting group composition and heterogeneity.

Psychological methods·2026
Same journal

A unified framework for psychometrics in experimental psychology: The standardized generalized hierarchical factor model.

Psychological methods·2026
See all related articles

Related Experiment Video

Updated: Jan 15, 2026

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.0K

Machine learning for propensity score estimation: A systematic review and reporting guidelines.

Walter Leite1, Huibin Zhang2, Zachary Collier3

  • 1School of Human Development and Organizational Studies in Education, University of Florida.

Psychological Methods
|October 16, 2025
PubMed
Summary
This summary is machine-generated.

Machine learning (ML) is widely used for propensity score (PS) estimation in research. However, this review found critical reporting gaps, including underreported covariate balance and sensitivity analyses, necessitating improved guidelines.

More Related Videos

Influence of Emotional Factors on the Efficacy of Acupuncture Treatment for Overweight Complicated with Hyperlipidemia: A Retrospective Cohort Study
03:05

Influence of Emotional Factors on the Efficacy of Acupuncture Treatment for Overweight Complicated with Hyperlipidemia: A Retrospective Cohort Study

Published on: November 21, 2025

475
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.9K

Related Experiment Videos

Last Updated: Jan 15, 2026

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

15.0K
Influence of Emotional Factors on the Efficacy of Acupuncture Treatment for Overweight Complicated with Hyperlipidemia: A Retrospective Cohort Study
03:05

Influence of Emotional Factors on the Efficacy of Acupuncture Treatment for Overweight Complicated with Hyperlipidemia: A Retrospective Cohort Study

Published on: November 21, 2025

475
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.9K

Area of Science:

  • Statistics
  • Computer Science
  • Social Sciences

Background:

  • Machine learning (ML) methods are increasingly utilized for propensity score (PS) estimation in quasi-experimental research.
  • This systematic review synthesizes 179 applications of ML for PS estimation across diverse fields over four decades.

Purpose of the Study:

  • To systematically review the application of ML for PS estimation.
  • To identify frequently used ML methods, software packages, and reporting practices.
  • To highlight deficiencies in reporting critical analysis steps and propose guidelines.

Main Methods:

  • Systematic literature review of 179 studies using ML for PS estimation.
  • Analysis of ML methods (e.g., gradient boosting machine, random forest), software packages (e.g., R), and reporting of key analysis components.
  • Examination of reporting completeness for covariate balance, sensitivity analysis, and hyperparameter configuration.

Main Results:

  • Gradient Boosting Machine (GBM) and Random Forest were the most common ML methods for PS estimation.
  • Significant underreporting was observed: 48.04% omitted covariate balance evaluation, and 13.97% misused p-values for balance assessment.
  • Only 22.8% conducted sensitivity analyses, and 46.9% reported hyperparameters, indicating critical gaps in methodological transparency.

Conclusions:

  • While ML methods like GBM are prevalent for PS estimation, current reporting practices are often inadequate.
  • Underreporting of covariate balance, sensitivity analyses, and hyperparameter details hinders reproducibility and methodological rigor.
  • Guidelines are proposed to enhance the transparent and accurate reporting of ML-based propensity score analyses in research.