Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Uncertainty: Confidence Intervals

Uncertainty: Confidence Intervals

The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...

Binomial Probability Distribution

Binomial Probability Distribution

A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...

Propagation of Uncertainty from Systematic Error

Propagation of Uncertainty from Systematic Error

The atomic mass of an element varies due to the relative ratio of its isotopes. A sample's relative proportion of oxygen isotopes influences its average atomic mass. For instance, if we were to measure the atomic mass of oxygen from a sample, the mass would be a weighted average of the isotopic masses of oxygen in that sample. Since a single sample is not likely to perfectly reflect the true atomic mass of oxygen for all the molecules of oxygen on Earth, the mass we obtain from this...

Propagation of Uncertainty from Random Error

Propagation of Uncertainty from Random Error

An experiment often consists of more than a single step. In this case, measurements at each step give rise to uncertainty. Because the measurements occur in successive steps, the uncertainty in one step necessarily contributes to that in the subsequent step. As we perform statistical analysis on these types of experiments, we must learn to account for the propagation of uncertainty from one step to the next. The propagation of uncertainty depends on the type of arithmetic operation performed on...

Uncertainty: Overview

Uncertainty: Overview

In analytical chemistry, we often perform repetitive measurements to detect and minimize inaccuracies caused by both determinate and indeterminate errors. Despite the cares we take, the presence of random errors means that repeated measurements almost never have exactly the same magnitude. The collective difference between these measurements - observed values - and the estimated or expected value is called uncertainty. Uncertainty is conventionally written after the estimated or expected value.

Confidence Intervals

Confidence Intervals

An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a sample proportion. However, unlike the point estimate which is a single value, the confidence interval contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A confidence...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

What, This "Base" Is Not a Base? Common Misconceptions about Aqueous Ionization That May Hinder Drug Discovery and Development.

Journal of medicinal chemistry·2025

Same author

The Value of Functional Fascicular Block in Patients With Atrioventricular Re-Entrant Tachycardia.

JACC. Clinical electrophysiology·2025

Same author

Identification of a Novel Indolizine RORγT Inverse Agonist Using the AI-Driven Drug Design Platform.

ACS medicinal chemistry letters·2025

Same author

Thermodynamics-informed neural networks and extensive data sets: key factors to accurate blind predictions of apparent p<i>K</i><sub>a</sub> values in the euroSAMPL challenge.

Physical chemistry chemical physics : PCCP·2025

Same author

Modeling Carbon Basicity.

Molecular informatics·2025

Same author

Approach to the Diagnosis and Management of Complex Fascicular Ventricular Tachycardias.

Circulation. Arrhythmia and electrophysiology·2024

Same journal

Multimodal feature fusion for molecular property classification.

Journal of cheminformatics·2026

Same journal

P2MAT: A machine learning (ML) driven software for Property Prediction of MATerial.

Journal of cheminformatics·2026

Same journal

Computational design of low-volatility lubricants for space using interpretable machine learning.

Journal of cheminformatics·2026

Same journal

OpenStats: how to combine statistics and research data management (RDM) to leverage efficient scientific data analysis by guided statistics.

Journal of cheminformatics·2026

Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026

Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 27, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Using beta binomials to estimate classification uncertainty for ensemble models.

Robert D Clark¹, Wenkel Liang¹, Adam C Lee¹

¹Department of Life Sciences, Simulations Plus, Inc., 45205 10th Street West, Lancaster, CA 93534, USA.

Journal of Cheminformatics

|July 3, 2014

Summary

This summary is machine-generated.

Ensemble models improve drug discovery by assessing prediction confidence. Analyzing submodel agreement and error distributions accurately estimates reliability for individual predictions, reducing costs and animal testing.

Keywords:

ANNE Artificial neural network ensemble Classification Confidence Error estimation Predictive value QSAR Uncertainty

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Identification and Classification of Position-specific GABA_A Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

Related Experiment Videos

Last Updated: Apr 27, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Identification and Classification of Position-specific GABA_A Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

Area of Science:

Computational chemistry and cheminformatics
Pharmacology and toxicology

Background:

Quantitative structure-activity relationship (QSAR) models offer significant potential for reducing drug discovery costs and animal testing.
Estimating the overall reliability of QSAR models has advanced, but assessing confidence in individual predictions remains crucial for researchers and regulators.

Purpose of the Study:

To develop a method for estimating the confidence in individual predictions made by ensemble models.
To leverage the agreement among submodels within an ensemble to quantify prediction reliability.

Main Methods:

Utilized artificial neural network ensembles (ANNEs) with two classification methods: vote tallies and averaged outputs.
Modeled prediction and error distributions using the beta binomial distribution.
Validated the method on large datasets including logP, Ames mutagenicity, and CYP2D6 inhibition data.

Main Results:

The agreement among submodels in an ensemble provides information on prediction reliability.
Beta binomial distributions accurately modeled prediction and error distributions, enabling probability estimation of classification errors.
The method accurately predicted external validation set performance using training pool data, even with imbalanced datasets.
Prospective misclassification likelihood correlated with submodel consensus and was accurately estimated.

Conclusions:

Confidence in individual ensemble predictions can be accurately assessed by analyzing prediction and error distributions relative to submodel agreement.
Ensemble uncertainty estimation can be improved by adjusting classification thresholds based on error distribution parameters.
Model profiles can indicate unreliable uncertainty estimates without external validation.