Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Testing a Claim about Standard Deviation

Testing a Claim about Standard Deviation

A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.
The...

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with data...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Author Correction: Methodological considerations for evaluating policy impacts on transgender and non-binary youth suicidality.

Nature human behaviour·2026

Same author

Probing the Hall anomaly and electronic structure in kagome metal RbV<sub>3</sub>Sb<sub>5</sub> under hydrostatic pressure.

Science and technology of advanced materials·2026

Same author

<i>Fusobacterium nucleatum-</i>Derived Isoleucine Exacerbates Aneurysm by Inducing Ferroptosis in Vascular Smooth Muscle Cells.

Arteriosclerosis, thrombosis, and vascular biology·2026

Same author

Methodological considerations for evaluating policy impacts on transgender and non-binary youth suicidality.

Nature human behaviour·2026

Same author

Dual-Similarity Driven Just-in-Time Modeling Enables Precise Real-Time Monitoring of Cell Cultures via Dielectric Spectroscopy.

Biotechnology journal·2026

Same author

Refining the Role of ERK Signaling in Anthracycline-Induced Cardiotoxicity.

JACC. CardioOncology·2026

Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026

Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026

Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026

Same journal

Beyond Fixed Thresholds: Optimizing Summaries of Wearable Device Data via Piecewise Linearization of Quantile Functions.

Statistics in medicine·2026

Same journal

A Causal Framework for Evaluating the Total Effect of Strategies Aiming to Expand Screening and to Improve Outcomes.

Statistics in medicine·2026

Same journal

Causal Effects on Nonterminal Event Time With Application to Antibiotic Usage and Future Resistance.

Statistics in medicine·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 15, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Testing for improvement in prediction model performance.

Margaret Sullivan Pepe¹, Kathleen F Kerr, Gary Longton

¹Biostatistics and Biomathematics, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA. mspepe@u.washington.edu

Statistics in Medicine

|January 9, 2013

Summary

This summary is machine-generated.

Testing for improved prediction performance is redundant if a new predictor is already a risk factor. Standard statistical tests for prediction improvement are overly conservative, leading to insensitivity. Focus on estimation rather than hypothesis testing for better insights.

More Related Videos

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Related Experiment Videos

Last Updated: May 15, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Area of Science:

Biostatistics
Epidemiology
Medical Informatics

Background:

Evaluating new predictors in risk models is crucial for improving prediction performance.
Existing methodologies for assessing prediction improvement can be complex and lead to redundant testing.

Purpose of the Study:

To theoretically prove the equivalence between testing for prediction improvement and testing for risk factor significance.
To investigate the properties of statistical tests for prediction improvement, particularly the area under the ROC curve (AUC).
To identify reasons for the perceived insensitivity of AUC to prediction performance improvements.

Main Methods:

Theoretical derivation of null hypothesis equivalence.
Simulation studies to evaluate statistical test properties.
Analysis of standard testing procedures for regression models.

Main Results:

Null hypotheses for prediction improvement are equivalent to testing if a new predictor is a risk factor.
Standard inference procedures without adjustments for regression coefficient variability are extremely conservative.
The insensitivity of AUC may stem from invalid inference methods, not the measure itself.

Conclusions:

Hypothesis testing for prediction improvement is redundant when the predictor's risk factor status is established.
Recommend focusing on estimation of prediction performance measures over hypothesis testing.
Advise using well-developed methods for evaluating risk factors to avoid redundant and problematic inference procedures.