Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Prediction Intervals01:03

Prediction Intervals

2.4K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.4K
Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

788
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
788
Censoring Survival Data01:09

Censoring Survival Data

276
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
276
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

3.6K
A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n)  to the number of categories (k).
3.6K
Survival Tree01:19

Survival Tree

180
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
180
Kaplan-Meier Approach01:24

Kaplan-Meier Approach

299
The Kaplan-Meier estimator is a non-parametric method used to estimate the survival function from time-to-event data. In medical research, it is frequently employed to measure the proportion of patients surviving for a certain period after treatment. This estimator is fundamental in analyzing time-to-event data, making it indispensable in clinical trials, epidemiological studies, and reliability engineering. By estimating survival probabilities, researchers can evaluate treatment effectiveness,...
299

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Doubly regularized generalized linear models for spatial observations with high-dimensional covariates.

Journal of the Royal Statistical Society. Series C, Applied statistics·2026
Same author

Clinical trials for continuously monitored and updated AI systems.

Nature medicine·2026
Same author

Investigating the analytical robustness of the social and behavioural sciences.

Nature·2026
Same author

Differential expression analysis for spatially correlated data using smiDE.

Genome biology·2026
Same author

Minimally Invasive Distal Pancreatectomy as the Standard of Care in the US: Are We There Yet?

Cancers·2025
Same author

Firearm Injuries: Unveiling the Unmatched Healthcare Burden and Costs.

Annals of surgery open : perspectives of surgical history, education, and clinical approaches·2025
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: Oct 11, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.2K

Selective prediction-set models with coverage rate guarantees.

Jean Feng1, Arjun Sondhi2, Jessica Perry3

  • 1Department of Epidemiology and Biostatistics, University of California, San Francisco, California, USA.

Biometrics
|December 2, 2021
PubMed
Summary
This summary is machine-generated.

New machine learning (ML) models for healthcare can abstain from predictions when uncertain, improving reliability. Selective prediction-set (SPS) models offer a balanced approach, enhancing accuracy for critical healthcare decisions.

Keywords:
abstaining prediction modelscross-validationensemble methodsneural networksprediction sets

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.7K

Related Experiment Videos

Last Updated: Oct 11, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.2K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.7K

Area of Science:

  • Machine Learning in Healthcare
  • Medical Informatics
  • Artificial Intelligence in Medicine

Background:

  • Current machine learning (ML) in healthcare requires full clinician oversight or operates without human input.
  • This binary approach limits ML reliability and increases the burden on healthcare professionals.
  • A middle ground is needed to balance ML predictions with human expertise.

Purpose of the Study:

  • To develop a framework for selective prediction-set (SPS) models that can abstain from making predictions.
  • To improve the reliability of ML algorithms in healthcare by allowing them to abstain on difficult cases.
  • To reduce the workload on human experts by focusing their attention on cases requiring oversight.

Main Methods:

  • Introduced a general penalized loss minimization framework for training SPS models.
  • Developed a model-agnostic statistical inference procedure for evaluating the coverage rate of SPS models.
  • Ensembled individual SPS models trained using K-fold cross-validation for robust performance evaluation.

Main Results:

  • SPS models abstain from predictions when outcomes are difficult to predict accurately, particularly for out-of-distribution data.
  • Models achieve higher predictive accuracy on cases where they do provide a prediction.
  • SPS ensembles demonstrated coverage rates closer to the nominal level with narrower confidence intervals.

Conclusions:

  • Selective prediction-set (SPS) models offer a promising approach to enhance ML reliability in healthcare.
  • The ability to abstain improves accuracy and reduces the burden on human experts.
  • This method shows potential for improving diagnostic accuracy, as demonstrated in ICU patient data and MNIST image prediction.