Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Variation

Variation

An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...

Methods of Medium Optimization

Methods of Medium Optimization

Optimizing growth media enhances microbial proliferation and maximizes product yield. Statistical experimental design methodologies provide structured and reproducible approaches, offering progressively higher levels of robustness and efficiency.The One-Factor-at-a-Time (OFAT) MethodThe One-Factor-at-a-Time (OFAT) method involves adjusting a single variable while keeping all others constant. However, it cannot detect interactions between variables, often leading to suboptimal outcomes when...

Predicting Products: Substitution vs. Elimination

Predicting Products: Substitution vs. Elimination

When a nucleophile and an alkyl halide react, nucleophilic substitution and β-elimination reactions compete to generate products.
The following factors can influence the mechanisms competing against each other:

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Stage shifts in national lung adenocarcinoma and the impact of opportunistic self-initiated LDCT screening in Taiwan: a nationwide population-based cohort study.

The Lancet regional health. Western Pacific·2026

Same author

Evaluating risk prediction models: the Predictiveness curve and its geometric summaries.

American journal of epidemiology·2026

Same author

Age-related differences in outcomes and tumor characteristics of ultra-rare sarcomas: A Nation-wide Study.

Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology·2026

Same author

Association of Cancer Care Quality Certification With Survival Across Multiple Cancer Types: A Population-Based Cohort Study in Taiwan.

International journal of cancer·2026

Same author

Explainable artificial intelligence for personalized prognosis in pancreatic cancer: A nationwide study from Taiwan.

PLOS digital health·2026

Same author

Distribution of adverse pathological features and prognosis across tongue, buccal, gum, and other oral cancer subsites: A nationwide study.

American journal of otolaryngology·2026

Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026

Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026

Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026

Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026

Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026

Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 2, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Alternative performance measures for prediction models.

Yun-Chun Wu¹, Wen-Chung Lee²

¹Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.

|March 11, 2014

Summary

This summary is machine-generated.

New prediction model performance measures, the Pietra and scaled Brier indices, are more sensitive than AUC and Gini to improvements in the "gray zone." These indices offer better clinical relevance and interpretation for evaluating prediction models.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: May 2, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Biostatistics
Machine Learning
Medical Informatics

Background:

The area under the receiver operating characteristic curve (AUC) is a common performance measure for prediction models but can be insensitive to improvements from new markers, especially in the "gray zone."
Recently proposed relative-performance measures may yield contradictory conclusions.
Evaluating prediction model performance requires sensitive and interpretable metrics.

Purpose of the Study:

To compare the sensitivity of various performance measures to changes in prediction model performance when a new marker is added.
To identify alternative measures that are more sensitive to improvements in the "gray zone" of prediction accuracy.

Main Methods:

Computer simulations were used to assess the performance of different measures.
The study evaluated the area under the receiver operating characteristic curve (AUC), Gini index, Pietra index, and scaled Brier score.
Simulations focused on scenarios where a new marker's discrimination power was concentrated in the "gray zone" of a baseline model.

Main Results:

The area under the receiver operating characteristic curve (AUC) and the Gini index showed minimal performance improvements when the added marker's power was in the "gray zone."
The Pietra index and the scaled Brier score demonstrated more significant performance improvements in the same "gray zone" scenario.
These findings highlight differences in sensitivity among various prediction model performance metrics.

Conclusions:

The Pietra index and the scaled Brier score are recommended for prediction model performance measurement.
These measures offer superior sensitivity to markers that improve discrimination in the "gray zone" compared to AUC and Gini.
Ease of interpretation and clinical relevance further support the use of the Pietra and scaled Brier indices.