Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Uncertainty: Confidence Intervals00:54

Uncertainty: Confidence Intervals

9.9K
The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...
9.9K
Binomial Probability Distribution01:15

Binomial Probability Distribution

13.1K
A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...
13.1K
Propagation of Uncertainty from Systematic Error01:10

Propagation of Uncertainty from Systematic Error

1.4K
The atomic mass of an element varies due to the relative ratio of its isotopes. A sample's relative proportion of oxygen isotopes influences its average atomic mass. For instance, if we were to measure the atomic mass of oxygen from a sample, the mass would be a weighted average of the isotopic masses of oxygen in that sample. Since a single sample is not likely to perfectly reflect the true atomic mass of oxygen for all the molecules of oxygen on Earth, the mass we obtain from this...
1.4K
Propagation of Uncertainty from Random Error00:59

Propagation of Uncertainty from Random Error

1.9K
An experiment often consists of more than a single step. In this case, measurements at each step give rise to uncertainty. Because the measurements occur in successive steps, the uncertainty in one step necessarily contributes to that in the subsequent step. As we perform statistical analysis on these types of experiments, we must learn to account for the propagation of uncertainty from one step to the next. The propagation of uncertainty depends on the type of arithmetic operation performed on...
1.9K
Uncertainty: Overview00:59

Uncertainty: Overview

1.6K
In analytical chemistry, we often perform repetitive measurements to detect and minimize inaccuracies caused by both determinate and indeterminate errors. Despite the cares we take, the presence of random errors means that repeated measurements almost never have exactly the same magnitude. The collective difference between these measurements - observed values - and the estimated or expected value is called uncertainty. Uncertainty is conventionally written after the estimated or expected value.
1.6K
Confidence Intervals01:21

Confidence Intervals

9.2K
An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a sample proportion. However, unlike the point estimate which is a single value, the confidence interval contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A confidence...
9.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

What, This "Base" Is Not a Base? Common Misconceptions about Aqueous Ionization That May Hinder Drug Discovery and Development.

Journal of medicinal chemistry·2025
Same author

The Value of Functional Fascicular Block in Patients With Atrioventricular Re-Entrant Tachycardia.

JACC. Clinical electrophysiology·2025
Same author

Identification of a Novel Indolizine RORγT Inverse Agonist Using the AI-Driven Drug Design Platform.

ACS medicinal chemistry letters·2025
Same author

Thermodynamics-informed neural networks and extensive data sets: key factors to accurate blind predictions of apparent p<i>K</i><sub>a</sub> values in the euroSAMPL challenge.

Physical chemistry chemical physics : PCCP·2025
Same author

Modeling Carbon Basicity.

Molecular informatics·2025
Same author

Approach to the Diagnosis and Management of Complex Fascicular Ventricular Tachycardias.

Circulation. Arrhythmia and electrophysiology·2024
Same journal

Multimodal feature fusion for molecular property classification.

Journal of cheminformatics·2026
Same journal

P2MAT: A machine learning (ML) driven software for Property Prediction of MATerial.

Journal of cheminformatics·2026
Same journal

Computational design of low-volatility lubricants for space using interpretable machine learning.

Journal of cheminformatics·2026
Same journal

OpenStats: how to combine statistics and research data management (RDM) to leverage efficient scientific data analysis by guided statistics.

Journal of cheminformatics·2026
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: Apr 27, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.3K

Using beta binomials to estimate classification uncertainty for ensemble models.

Robert D Clark1, Wenkel Liang1, Adam C Lee1

  • 1Department of Life Sciences, Simulations Plus, Inc., 45205 10th Street West, Lancaster, CA 93534, USA.

Journal of Cheminformatics
|July 3, 2014
PubMed
Summary
This summary is machine-generated.

Ensemble models improve drug discovery by assessing prediction confidence. Analyzing submodel agreement and error distributions accurately estimates reliability for individual predictions, reducing costs and animal testing.

Keywords:
ANNEArtificial neural network ensembleClassificationConfidenceError estimationPredictive valueQSARUncertainty

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K
Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons
08:04

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

1.5K

Related Experiment Videos

Last Updated: Apr 27, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.3K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K
Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons
08:04

Identification and Classification of Position-specific GABAA Receptor Subunit Missense Variants for Their Role In Hippocampal Pyramidal Neurons

Published on: June 6, 2025

1.5K

Area of Science:

  • Computational chemistry and cheminformatics
  • Pharmacology and toxicology

Background:

  • Quantitative structure-activity relationship (QSAR) models offer significant potential for reducing drug discovery costs and animal testing.
  • Estimating the overall reliability of QSAR models has advanced, but assessing confidence in individual predictions remains crucial for researchers and regulators.

Purpose of the Study:

  • To develop a method for estimating the confidence in individual predictions made by ensemble models.
  • To leverage the agreement among submodels within an ensemble to quantify prediction reliability.

Main Methods:

  • Utilized artificial neural network ensembles (ANNEs) with two classification methods: vote tallies and averaged outputs.
  • Modeled prediction and error distributions using the beta binomial distribution.
  • Validated the method on large datasets including logP, Ames mutagenicity, and CYP2D6 inhibition data.

Main Results:

  • The agreement among submodels in an ensemble provides information on prediction reliability.
  • Beta binomial distributions accurately modeled prediction and error distributions, enabling probability estimation of classification errors.
  • The method accurately predicted external validation set performance using training pool data, even with imbalanced datasets.
  • Prospective misclassification likelihood correlated with submodel consensus and was accurately estimated.

Conclusions:

  • Confidence in individual ensemble predictions can be accurately assessed by analyzing prediction and error distributions relative to submodel agreement.
  • Ensemble uncertainty estimation can be improved by adjusting classification thresholds based on error distribution parameters.
  • Model profiles can indicate unreliable uncertainty estimates without external validation.