Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.5K
Significance Testing: Overview01:04

Significance Testing: Overview

5.0K
Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
5.0K
Statistical Significance01:50

Statistical Significance

20.5K
Once data is collected from both the experimental and the control groups, a statistical analysis is conducted to find out if there are meaningful differences between the two groups. A statistical analysis determines how likely any difference found is due to chance (and thus not meaningful). In psychology, group differences are considered meaningful, or significant, if the odds that these differences occurred by chance alone are 5 percent or less. Stated another way, if we repeated this...
20.5K
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

329
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
329
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

3.1K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
3.1K
Regression Toward the Mean01:52

Regression Toward the Mean

6.5K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cardiovascular Risk in Patients With Treated Isolated Diastolic Hypertension and Isolated Low Diastolic Blood Pressure.

Journal of the American Heart Association·2024
Same author

Prognostic Role of Pulmonary Function in Patients With Heart Failure With Reduced Ejection Fraction.

Journal of the American Heart Association·2022
Same author

The Comprehensive Machine Learning Analytics for Heart Failure.

International journal of environmental research and public health·2021
Same author

Extensions of the distributed lag non-linear model (DLNM) to account for cumulative mortality.

Environmental science and pollution research international·2021
Same author

A novel cross-validation strategy for artificial neural networks using distributed-lag environmental factors.

PloS one·2021
Same author

A novel machine learning strategy for model selections - Stepwise Support Vector Machine (StepSVM).

PloS one·2020
Same journal

Correction: Grewal et al. Diversity and Representation in Cardiovascular Research: Evidence Gaps, Emerging Models, and Policy Implications. <i>Int. J. Environ. Res. Public Health</i> 2026, <i>23</i>, 241.

International journal of environmental research and public health·2026
Same journal

Drinking Water Quality and Health Risk Assessment in Rural Ghana: Evidence from North-East and North Gonja Districts in the Savannah Region.

International journal of environmental research and public health·2026
Same journal

Physical Activity of University Students During COVID-19 Restrictions: Evidence from Poland.

International journal of environmental research and public health·2026
Same journal

Assessment of Occupational Health and Safety Hazards in Mosquito Control Personnel in North Carolina and Virginia, USA.

International journal of environmental research and public health·2026
Same journal

Association Between Dysfunctional Parenting Practices and Suspected Gaming Disorder Among Japanese Male Junior High School Students: A Cross-Sectional Study of Parental Assessment.

International journal of environmental research and public health·2026
Same journal

A National Virtual Peer Support Group for Women Veterans Living with Breast Cancer: Lessons from the Field.

International journal of environmental research and public health·2026
See all related articles

Related Experiment Video

Updated: Oct 2, 2025

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.3K

A Novel Algorithm to Estimate the Significance Level of a Feature Interaction Using the Extreme Gradient Boosting

Chao-Yu Guo1, Ke-Hao Chang1

  • 1Division of Biostatistics, Institute of Public Health, School of Medicine, National Yang Ming Chiao Tung University, Taipei 112, Taiwan.

International Journal of Environmental Research and Public Health
|February 25, 2022
PubMed
Summary
This summary is machine-generated.

We developed XGB-FI, a new machine learning algorithm, to calculate p-values for feature interactions. XGB-FI demonstrates superior power compared to traditional regression models in identifying significant interactions.

Keywords:
XGBcross-validationinteractionmachine learning

More Related Videos

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.0K
Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines
08:27

Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines

Published on: January 5, 2024

1.3K

Related Experiment Videos

Last Updated: Oct 2, 2025

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.3K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.0K
Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines
08:27

Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines

Published on: January 5, 2024

1.3K

Area of Science:

  • Biostatistics
  • Machine Learning
  • Computational Biology

Background:

  • Interaction effects are crucial in cardiac research, but conventional analyses may yield erroneous conclusions if not properly addressed.
  • Statistical methods like regression can model interactions, but machine learning approaches often lack p-value assessment for specific feature interactions.

Purpose of the Study:

  • To propose a novel machine learning algorithm, XGB-FI (extreme gradient boosting machine for feature interaction), for assessing the p-value of feature interactions.
  • To address the limitation of existing machine learning strategies in quantifying the statistical significance of feature interactions.

Main Methods:

  • XGB-FI stratifies data into four subgroups based on interactive features.
  • It employs cross-validation to train four XGBoost models, preventing overfitting.
  • A novel feature interaction ratio (FIR) is computed, and an empirical p-value is derived from its distribution.

Main Results:

  • Computer simulations confirmed XGB-FI's valid type I error rate at the 0.05 nominal level.
  • XGB-FI consistently exhibited higher statistical power than multiple regression models across all examined scenarios.
  • The proposed algorithm effectively identifies significant interaction effects.

Conclusions:

  • The novel XGB-FI algorithm outperforms conventional statistical models in detecting feature interactions.
  • XGB-FI provides a robust machine learning solution for evaluating the statistical significance of feature interactions in complex datasets.