Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Prediction Intervals01:03

Prediction Intervals

2.5K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.5K
Multiple Regression01:25

Multiple Regression

3.3K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.3K
Variation01:19

Variation

6.4K
An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...
6.4K
Methods of Medium Optimization01:28

Methods of Medium Optimization

70
Optimizing growth media enhances microbial proliferation and maximizes product yield. Statistical experimental design methodologies provide structured and reproducible approaches, offering progressively higher levels of robustness and efficiency.The One-Factor-at-a-Time (OFAT) MethodThe One-Factor-at-a-Time (OFAT) method involves adjusting a single variable while keeping all others constant. However, it cannot detect interactions between variables, often leading to suboptimal outcomes when...
70
Predicting Products: Substitution vs. Elimination02:52

Predicting Products: Substitution vs. Elimination

11.1K
When a nucleophile and an alkyl halide react, nucleophilic substitution and β-elimination reactions compete to generate products.
The following factors can influence the mechanisms competing against each other:
11.1K
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

438
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
438

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Stage shifts in national lung adenocarcinoma and the impact of opportunistic self-initiated LDCT screening in Taiwan: a nationwide population-based cohort study.

The Lancet regional health. Western Pacific·2026
Same author

Evaluating risk prediction models: the Predictiveness curve and its geometric summaries.

American journal of epidemiology·2026
Same author

Age-related differences in outcomes and tumor characteristics of ultra-rare sarcomas: A Nation-wide Study.

Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology·2026
Same author

Association of Cancer Care Quality Certification With Survival Across Multiple Cancer Types: A Population-Based Cohort Study in Taiwan.

International journal of cancer·2026
Same author

Explainable artificial intelligence for personalized prognosis in pancreatic cancer: A nationwide study from Taiwan.

PLOS digital health·2026
Same author

Distribution of adverse pathological features and prognosis across tongue, buccal, gum, and other oral cancer subsites: A nationwide study.

American journal of otolaryngology·2026
Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026
Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026
Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026
Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026
Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026
Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026
See all related articles

Related Experiment Video

Updated: May 2, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.3K

Alternative performance measures for prediction models.

Yun-Chun Wu1, Wen-Chung Lee2

  • 1Institute of Epidemiology and Preventive Medicine, College of Public Health, National Taiwan University, Taipei, Taiwan.

Plos One
|March 11, 2014
PubMed
Summary
This summary is machine-generated.

New prediction model performance measures, the Pietra and scaled Brier indices, are more sensitive than AUC and Gini to improvements in the "gray zone." These indices offer better clinical relevance and interpretation for evaluating prediction models.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K

Related Experiment Videos

Last Updated: May 2, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.3K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K

Area of Science:

  • Biostatistics
  • Machine Learning
  • Medical Informatics

Background:

  • The area under the receiver operating characteristic curve (AUC) is a common performance measure for prediction models but can be insensitive to improvements from new markers, especially in the "gray zone."
  • Recently proposed relative-performance measures may yield contradictory conclusions.
  • Evaluating prediction model performance requires sensitive and interpretable metrics.

Purpose of the Study:

  • To compare the sensitivity of various performance measures to changes in prediction model performance when a new marker is added.
  • To identify alternative measures that are more sensitive to improvements in the "gray zone" of prediction accuracy.

Main Methods:

  • Computer simulations were used to assess the performance of different measures.
  • The study evaluated the area under the receiver operating characteristic curve (AUC), Gini index, Pietra index, and scaled Brier score.
  • Simulations focused on scenarios where a new marker's discrimination power was concentrated in the "gray zone" of a baseline model.

Main Results:

  • The area under the receiver operating characteristic curve (AUC) and the Gini index showed minimal performance improvements when the added marker's power was in the "gray zone."
  • The Pietra index and the scaled Brier score demonstrated more significant performance improvements in the same "gray zone" scenario.
  • These findings highlight differences in sensitivity among various prediction model performance metrics.

Conclusions:

  • The Pietra index and the scaled Brier score are recommended for prediction model performance measurement.
  • These measures offer superior sensitivity to markers that improve discrimination in the "gray zone" compared to AUC and Gini.
  • Ease of interpretation and clinical relevance further support the use of the Pietra and scaled Brier indices.