Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Uncertainty in Measurement: Accuracy and Precision03:37

Uncertainty in Measurement: Accuracy and Precision

73.7K
Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value. 
73.7K
Uncertainty: Confidence Intervals00:54

Uncertainty: Confidence Intervals

4.1K
The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...
4.1K
Uncertainty: Overview00:59

Uncertainty: Overview

554
In analytical chemistry, we often perform repetitive measurements to detect and minimize inaccuracies caused by both determinate and indeterminate errors. Despite the cares we take, the presence of random errors means that repeated measurements almost never have exactly the same magnitude. The collective difference between these measurements - observed values - and the estimated or expected value is called uncertainty. Uncertainty is conventionally written after the estimated or expected value.
554
Propagation of Uncertainty from Random Error00:59

Propagation of Uncertainty from Random Error

682
An experiment often consists of more than a single step. In this case, measurements at each step give rise to uncertainty. Because the measurements occur in successive steps, the uncertainty in one step necessarily contributes to that in the subsequent step. As we perform statistical analysis on these types of experiments, we must learn to account for the propagation of uncertainty from one step to the next. The propagation of uncertainty depends on the type of arithmetic operation performed on...
682
Propagation of Uncertainty from Systematic Error01:10

Propagation of Uncertainty from Systematic Error

518
The atomic mass of an element varies due to the relative ratio of its isotopes. A sample's relative proportion of oxygen isotopes influences its average atomic mass. For instance, if we were to measure the atomic mass of oxygen from a sample, the mass would be a weighted average of the isotopic masses of oxygen in that sample. Since a single sample is not likely to perfectly reflect the true atomic mass of oxygen for all the molecules of oxygen on Earth, the mass we obtain from this...
518
Confidence Interval for Estimating Population Mean01:25

Confidence Interval for Estimating Population Mean

7.3K
A point estimate of the population mean is obtained from a single sample. Such a point estimate does not represent a population well because it needs to account for variability in the population. Single point estimate can also be biased despite the sample being selected randomly. Thus, a point estimate is often unreliable. A confidence interval is needed to reduce this unreliability.
A confidence interval for the mean is a range of values that provides an estimate of the population mean. As the...
7.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Validating Physiologically-Based Pharmacokinetic Models Using the Continuous Ranked Probability Score: Beyond Being Correct on Average.

CPT: pharmacometrics & systems pharmacology·2026
Same author

Power and clinical utility of mesopic microperimetry analysis strategies in age-related macular degeneration.

Acta ophthalmologica·2025
Same author

Structural equation modeling to explore putative causal factors for chronic fatigue in childhood cancer survivors: a DCCSS LATER study.

Journal of cancer survivorship : research and practice·2025
Same author

Long-term sequelae of SARS-CoV-2 two years following infection: exploring the interplay of biological, psychological, and social factors.

Psychological medicine·2024
Same author

Effect of Early Levodopa Treatment on Mortality in People with Parkinson's Disease.

Movement disorders clinical practice·2024
Same author

The effect of cardiovascular risk on disease progression in <i>de novo</i> Parkinson's disease patients: An observational analysis.

Frontiers in neurology·2023
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Jul 1, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K

How to evaluate uncertainty estimates in machine learning for regression?

Laurens Sluijterman1, Eric Cator2, Tom Heskes3

  • 1Department of Mathematics, Radboud University, P.O. Box 9010-59, 6500 GL, Nijmegen, Netherlands.

Neural Networks : the Official Journal of the International Neural Network Society
|March 5, 2024
PubMed
Summary
This summary is machine-generated.

Current methods for evaluating neural network uncertainty estimates are flawed. We propose a new simulation-based approach for better assessment and development of uncertainty quantification techniques.

Keywords:
BootstrapDropoutNeural networksRegressionUncertainty

More Related Videos

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma
04:09

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

8.2K
Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements
10:22

Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements

Published on: September 7, 2019

8.3K

Related Experiment Videos

Last Updated: Jul 1, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K
Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma
04:09

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

8.2K
Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements
10:22

Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements

Published on: September 7, 2019

8.3K

Area of Science:

  • Machine Learning
  • Artificial Intelligence
  • Statistical Modeling

Background:

  • Neural networks are increasingly used, necessitating reliable uncertainty estimation.
  • Existing methods for evaluating uncertainty estimates (log-likelihood for densities, coverage for prediction intervals) have limitations.
  • These methods struggle to disentangle components of predictive uncertainty and compare diverse estimation approaches.

Purpose of the Study:

  • To identify and analyze the fundamental flaws in current methods for evaluating uncertainty estimates from neural networks.
  • To demonstrate the limitations of log-likelihood and direct prediction interval testing.
  • To propose a novel, simulation-based approach for more robust and comparable evaluation of uncertainty quantification.

Main Methods:

  • Theoretical analysis of existing evaluation metrics (log-likelihood, prediction interval coverage).
  • Simulations to demonstrate the shortcomings of current methods, including issues with marginal vs. pointwise coverage.
  • Development and proposal of a simulation-based testing framework for uncertainty quantification.

Main Results:

  • Both log-likelihood and direct prediction interval testing exhibit significant flaws.
  • Current methods cannot reliably assess individual components of predictive uncertainty.
  • Testing on a single dataset can mask undesirable behaviors like over/underconfidence.
  • A better log-likelihood does not guarantee improved prediction intervals.

Conclusions:

  • Existing methods for evaluating neural network uncertainty estimates are inadequate.
  • A new simulation-based approach is proposed to overcome these limitations.
  • This new approach facilitates better comparison and development of uncertainty quantification methods.