Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Uncertainty: Confidence Intervals

Uncertainty: Confidence Intervals

The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...

Uncertainty: Overview

Uncertainty: Overview

In analytical chemistry, we often perform repetitive measurements to detect and minimize inaccuracies caused by both determinate and indeterminate errors. Despite the cares we take, the presence of random errors means that repeated measurements almost never have exactly the same magnitude. The collective difference between these measurements - observed values - and the estimated or expected value is called uncertainty. Uncertainty is conventionally written after the estimated or expected value.

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Propagation of Uncertainty from Random Error

Propagation of Uncertainty from Random Error

An experiment often consists of more than a single step. In this case, measurements at each step give rise to uncertainty. Because the measurements occur in successive steps, the uncertainty in one step necessarily contributes to that in the subsequent step. As we perform statistical analysis on these types of experiments, we must learn to account for the propagation of uncertainty from one step to the next. The propagation of uncertainty depends on the type of arithmetic operation performed on...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Rhodiosin promotes cerebral angiogenesis in mice with ischemic stroke via PI3K/Akt pathway activation.

Phytomedicine : international journal of phytotherapy and phytopharmacology·2026

Same author

Multifunctional double hairpin DNA polydopamine nano-system for "switch-on" detection and intervention of high-grade dysplastic nodules.

Mikrochimica acta·2026

Same author

Interleukin-33 and superoxide dismutase 3 mediates co-achieved tooth movement acceleration and root protection.

American journal of orthodontics and dentofacial orthopedics : official publication of the American Association of Orthodontists, its constituent societies, and the American Board of Orthodontics·2026

Same author

Formononetin attenuates cerebral ischemia-reperfusion injury by regulating microglial glycolysis through AMPK.

Journal of ethnopharmacology·2026

Same author

Interactive active learning for literature screening: finetuning GPT with DeepSeek reasoning for cross-domain generalization.

Journal of the American Medical Informatics Association : JAMIA·2026

Same author

Multidimensional successively targeted glucose-aptamer nano-system to achieve "Point-Surface" early intervention of Alzheimer's disease.

Nanomedicine : nanotechnology, biology, and medicine·2026

Same journal

Tooling or Not Tooling? The Impact of Tools on Language Agents for Chemistry Problem Solving.

Findings of ACL. NAACL·2026

Same journal

LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models.

Findings of ACL. NAACL·2026

Same journal

Semantic Consistency-Based Uncertainty Quantification for Factuality in Radiology Report Generation.

Findings of ACL. NAACL·2026

Same journal

Identifying Self-Disclosures of Use, Misuse and Addiction in Community-based Social Media Posts.

Findings of ACL. NAACL·2025

Same journal

OSCaR: Object State Captioning and State Change Representation.

Findings of ACL. NAACL·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 9, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Uncertainty Quantification for Clinical Outcome Predictions with (Large) Language Models.

Zizhang Chen¹, Peizhao Li², Xiaomeng Dong²

¹Brandeis University.

Findings of ACL. NAACL

|December 4, 2025

Summary

This summary is machine-generated.

This study enhances AI reliability in healthcare by quantifying uncertainty in language models (LMs) for electronic health records (EHRs). Methods like ensembling and multi-tasking reduce prediction uncertainty, improving AI transparency and patient safety.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: Jan 9, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Artificial Intelligence in Medicine
Clinical Informatics
Machine Learning for Healthcare

Background:

Language models (LMs) show promise for clinical prediction using electronic health records (EHRs).
High-stakes healthcare applications demand reliable AI predictions, necessitating robust uncertainty quantification.
Current AI models often lack transparency, posing risks to patient safety and ethical standards.

Purpose of the Study:

To develop and validate a framework for uncertainty quantification of LMs in EHR tasks.
To address uncertainty in both white-box (accessible parameters) and black-box (proprietary LMs like GPT-4) settings.
To enhance the reliability and transparency of AI-driven clinical predictions.

Main Methods:

Quantified uncertainty in white-box LMs using multi-tasking and ensemble techniques.
Extended uncertainty quantification to black-box models, including proprietary LMs.
Validated the framework on longitudinal clinical data from over 6,000 patients across ten prediction tasks.

Main Results:

Proposed multi-tasking and ensemble methods effectively reduced model uncertainty in EHR tasks.
Ensembling and multi-task prediction prompts demonstrated uncertainty reduction across various clinical prediction scenarios.
The framework successfully increased model transparency in both white-box and black-box settings.

Conclusions:

Uncertainty quantification using ensembling and multi-tasking improves the reliability of LMs for EHRs.
The developed framework enhances AI transparency and trustworthiness in clinical decision support.
This work advances the safe and ethical integration of AI in healthcare delivery.