Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Analysis01:11

Regression Analysis

5.5K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.5K
Multiple Regression01:25

Multiple Regression

2.9K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
2.9K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

100
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
100
Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test01:09

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

1.5K
In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...
1.5K
Statistical Hypothesis Testing01:16

Statistical Hypothesis Testing

1.8K
Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...
1.8K
Data Validation01:15

Data Validation

125
Method validation is a crucial process in analytical chemistry designed to confirm that a given method consistently produces reliable and high-quality results. This process is essential when a method is applied to different sample matrices or when procedural modifications are made, ensuring that the results meet acceptable standards across various applications.
Key parameters for method validation include:
125

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Latent Space Projections and Atlases, a Cautionary Tale in Deep Neuroimaging using Autoencoders.

International journal of neural systems·2026
Same author

An explainable framework for the relationship between dementia and metabolism patterns.

NeuroImage·2026
Same author

An Explainable Framework for the Relationship between Dementia and Metabolism Patterns.

ArXiv·2026
Same author

Management of extensively drug-resistant <i>Acinetobacter baumannii</i> bacteremia with cefiderocol and sulbactam-durlobactam: a case report.

ASM case reports·2025
Same author

Nonlinear Weighting Ensemble Learning Model to Diagnose Parkinson's Disease Using Multimodal Data.

International journal of neural systems·2023
Same author

A Connection Between Pattern Classification by Machine Learning and Statistical Inference With the General Linear Model.

IEEE journal of biomedical and health informatics·2021
Same journal

Corrigendum to "Metformin suppresses vascular smooth muscle cell senescence by promoting autophagic flux" [J. Adv. Res. 41 (2022) 205-218].

Journal of advanced research·2026
Same journal

Interactive canopy nitrogen and water additions delay phenology in a warm-temperate forest.

Journal of advanced research·2026
Same journal

Combination of degrading with non-degrading endophytic bacteria synergistically enhances rice resistance to chlorpyrifos stress.

Journal of advanced research·2026
Same journal

Impact of technological heterogeneity on economic efficiency and total factor productivity change in developed and developing G20 economies.

Journal of advanced research·2026
Same journal

GhIMP10D, an inositol monophosphatase gene enhancing alkaline stress tolerance in plants.

Journal of advanced research·2026
Same journal

The heterogeneity of dopamine-mediated vasodilation in human intrarenal arteries.

Journal of advanced research·2026
See all related articles

Related Experiment Video

Updated: May 12, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Statistical agnostic regression: A machine learning method to validate regression models.

J M Gorriz1, J Ramirez2, F Segovia2

  • 1Dpt. of Psychiatry, University of Cambridge, UK; DaSCI Institute, University of Granada, Spain; ibs.Granada, Granada, Spain.

Journal of Advanced Research
|May 3, 2025
PubMed
Summary
This summary is machine-generated.

Statistical Agnostic Regression (SAR) provides a non-parametric method to assess the statistical significance of machine learning linear regression models. This novel approach controls false positive rates, offering a robust alternative to classical statistical methods.

Keywords:
K-fold cross-validationOrdinary least squaresPermutation testsStatistical learning theoryUpper boundinglinear support vector machines

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.0K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Related Experiment Videos

Last Updated: May 12, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.0K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Area of Science:

  • Statistics
  • Machine Learning
  • Data Science

Background:

  • Regression analysis is fundamental for modeling relationships between variables.
  • Linear regression, including Ordinary Least Squares (OLS), Ridge, and Lasso, is widely used.
  • Machine learning (ML) models are increasingly applied but often lack formal statistical significance testing.

Purpose of the Study:

  • Introduce Statistical Agnostic Regression (SAR) for evaluating ML-based linear regression.
  • Develop a method to formally assess the statistical significance of linear relationships.
  • Provide a robust framework for hypothesis testing in regression analysis.

Main Methods:

  • Utilize concentration inequalities to analyze expected loss under worst-case scenarios.
  • Define a threshold for determining statistical significance with a specified probability (1-η).
  • Employ a non-parametric approach, avoiding assumptions of classical regression methods.

Main Results:

  • The SAR test demonstrates comparable analysis of variance to the classical multivariate F-test.
  • SAR effectively controls the false positive rate, unlike standard ML methods.
  • Residuals from SAR offer a balance between ML and OLS approaches.

Conclusions:

  • SAR offers a statistically rigorous and assumption-free method for linear regression significance testing.
  • The proposed method enhances the reliability of ML-based regression models.
  • SAR provides a valuable tool for researchers seeking robust statistical inference.