Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Uncertainty in Measurement: Accuracy and Precision

Uncertainty in Measurement: Accuracy and Precision

Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value.

Uncertainty: Confidence Intervals

Uncertainty: Confidence Intervals

The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...

Uncertainty: Overview

Uncertainty: Overview

In analytical chemistry, we often perform repetitive measurements to detect and minimize inaccuracies caused by both determinate and indeterminate errors. Despite the cares we take, the presence of random errors means that repeated measurements almost never have exactly the same magnitude. The collective difference between these measurements - observed values - and the estimated or expected value is called uncertainty. Uncertainty is conventionally written after the estimated or expected value.

Propagation of Uncertainty from Random Error

Propagation of Uncertainty from Random Error

An experiment often consists of more than a single step. In this case, measurements at each step give rise to uncertainty. Because the measurements occur in successive steps, the uncertainty in one step necessarily contributes to that in the subsequent step. As we perform statistical analysis on these types of experiments, we must learn to account for the propagation of uncertainty from one step to the next. The propagation of uncertainty depends on the type of arithmetic operation performed on...

Propagation of Uncertainty from Systematic Error

Propagation of Uncertainty from Systematic Error

The atomic mass of an element varies due to the relative ratio of its isotopes. A sample's relative proportion of oxygen isotopes influences its average atomic mass. For instance, if we were to measure the atomic mass of oxygen from a sample, the mass would be a weighted average of the isotopic masses of oxygen in that sample. Since a single sample is not likely to perfectly reflect the true atomic mass of oxygen for all the molecules of oxygen on Earth, the mass we obtain from this...

Confidence Interval for Estimating Population Mean

Confidence Interval for Estimating Population Mean

A point estimate of the population mean is obtained from a single sample. Such a point estimate does not represent a population well because it needs to account for variability in the population. Single point estimate can also be biased despite the sample being selected randomly. Thus, a point estimate is often unreliable. A confidence interval is needed to reduce this unreliability.
A confidence interval for the mean is a range of values that provides an estimate of the population mean. As the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Validating Physiologically-Based Pharmacokinetic Models Using the Continuous Ranked Probability Score: Beyond Being Correct on Average.

CPT: pharmacometrics & systems pharmacology·2026

Same author

Power and clinical utility of mesopic microperimetry analysis strategies in age-related macular degeneration.

Acta ophthalmologica·2025

Same author

Structural equation modeling to explore putative causal factors for chronic fatigue in childhood cancer survivors: a DCCSS LATER study.

Journal of cancer survivorship : research and practice·2025

Same author

Long-term sequelae of SARS-CoV-2 two years following infection: exploring the interplay of biological, psychological, and social factors.

Psychological medicine·2024

Same author

Effect of Early Levodopa Treatment on Mortality in People with Parkinson's Disease.

Movement disorders clinical practice·2024

Same author

The effect of cardiovascular risk on disease progression in <i>de novo</i> Parkinson's disease patients: An observational analysis.

Frontiers in neurology·2023

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 1, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

How to evaluate uncertainty estimates in machine learning for regression?

Laurens Sluijterman¹, Eric Cator², Tom Heskes³

¹Department of Mathematics, Radboud University, P.O. Box 9010-59, 6500 GL, Nijmegen, Netherlands.

Neural Networks : the Official Journal of the International Neural Network Society

|March 5, 2024

Summary

This summary is machine-generated.

Current methods for evaluating neural network uncertainty estimates are flawed. We propose a new simulation-based approach for better assessment and development of uncertainty quantification techniques.

Keywords:

Bootstrap Dropout Neural networks Regression Uncertainty

More Related Videos

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements

Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements

Published on: September 7, 2019

Related Experiment Videos

Last Updated: Jul 1, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Predicting Treatment Response to Image-Guided Therapies Using Machine Learning: An Example for Trans-Arterial Treatment of Hepatocellular Carcinoma

Published on: October 10, 2018

Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements

Split Point Analysis and Uncertainty Quantification of Thermal-Optical Organic/Elemental Carbon Measurements

Published on: September 7, 2019

Area of Science:

Machine Learning
Artificial Intelligence
Statistical Modeling

Background:

Neural networks are increasingly used, necessitating reliable uncertainty estimation.
Existing methods for evaluating uncertainty estimates (log-likelihood for densities, coverage for prediction intervals) have limitations.
These methods struggle to disentangle components of predictive uncertainty and compare diverse estimation approaches.

Purpose of the Study:

To identify and analyze the fundamental flaws in current methods for evaluating uncertainty estimates from neural networks.
To demonstrate the limitations of log-likelihood and direct prediction interval testing.
To propose a novel, simulation-based approach for more robust and comparable evaluation of uncertainty quantification.

Main Methods:

Theoretical analysis of existing evaluation metrics (log-likelihood, prediction interval coverage).
Simulations to demonstrate the shortcomings of current methods, including issues with marginal vs. pointwise coverage.
Development and proposal of a simulation-based testing framework for uncertainty quantification.

Main Results:

Both log-likelihood and direct prediction interval testing exhibit significant flaws.
Current methods cannot reliably assess individual components of predictive uncertainty.
Testing on a single dataset can mask undesirable behaviors like over/underconfidence.
A better log-likelihood does not guarantee improved prediction intervals.

Conclusions:

Existing methods for evaluating neural network uncertainty estimates are inadequate.
A new simulation-based approach is proposed to overcome these limitations.
This new approach facilitates better comparison and development of uncertainty quantification methods.