Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Bias in Epidemiological Studies01:29

Bias in Epidemiological Studies

683
Biases can arise at various stages of research, from study design and data collection to analysis and interpretation. Recognizing and addressing these biases is essential to ensure the validity and reliability of epidemiological findings.Broadly speaking, biases in epidemiology fall into three main categories: selection bias, information bias, and confounding. A more detailed description of possible biases is:  
683
Confounding in Epidemiological Studies01:27

Confounding in Epidemiological Studies

262
Confounding in statistical epidemiology represents a pivotal challenge, referring to the distortion in the perceived relationship between an exposure and an outcome due to the presence of a third variable, known as a confounder. This variable is associated with both the exposure and the outcome but is not a direct link in their causal chain. Its presence can lead to erroneous interpretations of the exposure's effect, either exaggerating or underestimating the true association. This...
262
Strategies for Assessing and Addressing Confounding01:25

Strategies for Assessing and Addressing Confounding

155
Confounding is a critical issue in epidemiological studies, often leading to misleading conclusions about associations between exposures and outcomes. It occurs when the relationship between the exposure and the outcome is mixed with the effects of other factors that influence the outcome. Given that, addressing confounding is of high importance for drawing accurate inferences in research.
Confounding can be addressed at both the design phase of a study and through analytical methods after data...
155
Bias01:22

Bias

4.9K
Bias refers to any tendency that prevents a question from being considered unprejudiced. In research, bias occurs when one outcome or answer is selected or encouraged over others in sampling or testing. Bias can occur during any research phase, including study design, data collection, analysis, and publication.
In statistics, a sampling bias is created when a sample is collected from a population, and some members of the population are not as likely to be chosen as others (remember, each member...
4.9K
Types of Biopharmaceutical Studies: Controlled and Non-Controlled Approaches01:23

Types of Biopharmaceutical Studies: Controlled and Non-Controlled Approaches

174
Biopharmaceutical studies constitute a vital field aiming to enhance drug delivery methods and refine therapeutic approaches, drawing upon diverse interdisciplinary knowledge. In research methodologies, the choice between controlled and non-controlled studies significantly influences the study's reliability and accuracy.
Non-controlled studies, commonly employed for initial exploration, lack a control group, rendering them susceptible to biases and external influences. In contrast,...
174
Classification of Illness01:17

Classification of Illness

7.9K
The meaning of illness is individualized to each person who experiences an alteration in health. In contrast, disease is a medical term indicating a pathological change in the structure and function of the body or mind. It is a condition that has specific symptoms and boundaries.
An illness is a response to a disease in which the person's level of functioning is changed compared with a previous level. The general classification of illness includes acute and chronic.
Acute illness is severe...
7.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Screening for hypertension in a public dental clinic: A single-centre cross-sectional study in Australia.

PloS one·2026
Same author

Dietary Approaches to Stop Hypertension (DASH) Diet, Incident Heart Failure and Its Associated Risk Factors in Australian Women.

Medicina (Kaunas, Lithuania)·2026
Same author

Text Messaging for Cardiovascular Risk Prevention in Psoriasis: A Randomized Clinical Trial.

JAMA dermatology·2026
Same author

Crosstalk between integrin signaling and NAD⁺ biosynthetic pathways promotes glycolysis, proliferation, survival, and tumor growth in triple-negative breast cancer.

Oncogene·2026
Same author

Burden of Glomerular Diseases in Australia: A Data Linkage Study.

Kidney diseases (Basel, Switzerland)·2026
Same author

Sex differences in risk factor control and medication adherence post-ACS: insights from the TEXTMEDS randomised clinical trial.

Open heart·2026
Same journal

A GenAI Pipeline for Violinist Kinematic Data Management.

Studies in health technology and informatics·2026
Same journal

AMAL-For-Qatar: A Comprehensive AI Ecosystem for Fetal Ultrasound Analysis - Project Overview and Achievements.

Studies in health technology and informatics·2026
Same journal

Longitudinal Treatment-Aware Multimodal AI for Dermatology: A Scoping Review.

Studies in health technology and informatics·2026
Same journal

Predicting Postpartum Depression Using Imbalance-Aware Machine Learning.

Studies in health technology and informatics·2026
Same journal

Validation of Deep-Learning Models for Autosegmentation of Brain Metastases.

Studies in health technology and informatics·2026
Same journal

Delay-Dependent Gating in Modular RNNs.

Studies in health technology and informatics·2026
See all related articles

Related Experiment Video

Updated: Sep 12, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

AI Bias and Confounding Risk in Health Feature Engineering for Machine Learning Classification Task.

Ruihua Guo1, Angus Ritchie2, Ross Smith1

  • 1School of Computer Science, The University of Sydney, NSW, Australia 2008.

Studies in Health Technology and Informatics
|August 8, 2025
PubMed
Summary
This summary is machine-generated.

Machine learning in healthcare faces bias challenges. This study found pregnancy status significantly impacted cardiovascular readmission prediction models, highlighting the need for propensity score matching to address hidden confounding factors.

Keywords:
AI biasClassificationConfounding biasFeature EngineeringMachine LearningQuality ControlReadmission risk prediction

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

900
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.9K

Related Experiment Videos

Last Updated: Sep 12, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

900
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.9K

Area of Science:

  • Health Informatics
  • Machine Learning in Healthcare
  • Clinical Prediction Models

Background:

  • Machine learning offers opportunities in healthcare but faces challenges like data variability and AI bias.
  • Confounding risks can impact the performance of AI models in clinical settings.
  • Real-world health data often has limitations in scope, population coverage, and granularity.

Purpose of the Study:

  • To investigate the impact of hidden confounding factors on machine learning model performance in cardiovascular readmission prediction.
  • To evaluate the effectiveness of propensity score adjustment in mitigating confounding risks.
  • To identify potential confounding factors in predicting patient readmissions.

Main Methods:

  • Utilized real-life electronic health record data from the DREAM dataset.
  • Applied five machine learning models: k-nearest neighbors (KNN), random forest (RF), decision tree (DT), Catboost, and Xgboost.
  • Assessed model performance using Area Under the ROC Curve (AUC) and F1 score, comparing results before and after propensity score adjustment.

Main Results:

  • Propensity score adjustment revealed significant performance fluctuations, particularly for patients aged 20-40.
  • High-risk pregnant females were identified as a potential confounding factor, with a significantly higher pregnancy rate in the non-readmitted group (χ² = 10.2, p < 0.001).
  • Pregnancy status required data from an external system, posing integration challenges.

Conclusions:

  • Traditional machine learning pipelines without careful consideration of confounding risks may yield suboptimal clinical classifiers.
  • Incorporating propensity score matching is a viable strategy to randomize and account for invisible confounding factors.
  • Addressing hidden confounders like pregnancy status is crucial for developing robust and reliable AI tools in healthcare.