Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

Censoring Survival Data

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Kaplan-Meier Approach

Kaplan-Meier Approach

The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Doubly regularized generalized linear models for spatial observations with high-dimensional covariates.

Journal of the Royal Statistical Society. Series C, Applied statistics·2026

Same author

Clinical trials for continuously monitored and updated AI systems.

Nature medicine·2026

Same author

Investigating the analytical robustness of the social and behavioural sciences.

Nature·2026

Same author

Differential expression analysis for spatially correlated data using smiDE.

Genome biology·2026

Same author

Minimally Invasive Distal Pancreatectomy as the Standard of Care in the US: Are We There Yet?

Cancers·2025

Same author

Firearm Injuries: Unveiling the Unmatched Healthcare Burden and Costs.

Annals of surgery open : perspectives of surgical history, education, and clinical approaches·2025

Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026

Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026

Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026

Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026

Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026

Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 11, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Selective prediction-set models with coverage rate guarantees.

Jean Feng¹, Arjun Sondhi², Jessica Perry³

¹Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA.

|December 2, 2021

Summary

This summary is machine-generated.

New machine learning (ML) models for healthcare can abstain from predictions when uncertain, improving reliability. Selective prediction-set (SPS) models offer a balanced approach, enhancing accuracy for critical healthcare decisions.

Keywords:

abstaining prediction models cross-validation ensemble methods neural networks prediction sets

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Related Experiment Videos

Last Updated: Oct 11, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Area of Science:

Machine Learning in Healthcare
Medical Informatics
Artificial Intelligence in Medicine

Background:

Current machine learning (ML) in healthcare requires full clinician oversight or operates without human input.
This binary approach limits ML reliability and increases the burden on healthcare professionals.
A middle ground is needed to balance ML predictions with human expertise.

Purpose of the Study:

To develop a framework for selective prediction-set (SPS) models that can abstain from making predictions.
To improve the reliability of ML algorithms in healthcare by allowing them to abstain on difficult cases.
To reduce the workload on human experts by focusing their attention on cases requiring oversight.

Main Methods:

Introduced a general penalized loss minimization framework for training SPS models.
Developed a model-agnostic statistical inference procedure for evaluating the coverage rate of SPS models.
Ensembled individual SPS models trained using K-fold cross-validation for robust performance evaluation.

Main Results:

SPS models abstain from predictions when outcomes are difficult to predict accurately, particularly for out-of-distribution data.
Models achieve higher predictive accuracy on cases where they do provide a prediction.
SPS ensembles demonstrated coverage rates closer to the nominal level with narrower confidence intervals.

Conclusions:

Selective prediction-set (SPS) models offer a promising approach to enhance ML reliability in healthcare.
The ability to abstain improves accuracy and reduces the burden on human experts.
This method shows potential for improving diagnostic accuracy, as demonstrated in ICU patient data and MNIST image prediction.