Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Confidence Coefficient01:24

Confidence Coefficient

7.5K
The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...
7.5K
Interpretation of Confidence Intervals01:19

Interpretation of Confidence Intervals

5.6K
A confidence interval is a better estimate of the population than a point estimate, as it uses a range of values from a sample instead of a single value.
Confidence intervals have confidence coefficients that are crucial for their interpretation. The most common confidence coefficients are 0.90, 0.95, and 0.99, which can be written as percentages–90%, 95%, and 99%, respectively.
Suppose a person calculates a confidence interval with a confidence coefficient of 0.95. In that case, they can...
5.6K
Ordinal Level of Measurement00:55

Ordinal Level of Measurement

22.9K
The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using an ordinal scale are similar to nominal scale data, but there is one major difference. The ordinal scale data can be ordered. An example of ordinal scale data is a list of the top five national parks...
22.9K
Confidence Intervals01:21

Confidence Intervals

6.1K
An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a  sample proportion. However, unlike the point estimate which is a single value, the confidence interval  contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A...
6.1K
Prediction Intervals01:03

Prediction Intervals

2.2K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.2K
Regression Analysis01:11

Regression Analysis

5.6K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

DNA-Encoded Chemical Library Screening with Target Titration Analysis: DELTA.

Journal of medicinal chemistry·2026
Same author

A Target Class Ligandability Evaluation of WD40 Repeat-Containing Proteins.

Journal of medicinal chemistry·2024
Same author

Extensive exploration of structure activity relationships for the SARS-CoV-2 macrodomain from shape-based fragment merging and active learning.

bioRxiv : the preprint server for biology·2024
Same author

Data Sharing in Chemistry: Lessons Learned and a Case for Mandating Structured Reaction Data.

Journal of chemical information and modeling·2023
Same author

Evolving symbolic density functionals.

Science advances·2022
Same author

Defining Levels of Automated Chemical Design.

Journal of medicinal chemistry·2022
Same journal

PACEff Builder: An Efficient Platform for Constructing PACE Hybrid-Resolution Models for Molecular Dynamics Simulations of Aqueous Protein, Peptide Assembly, and Membrane Protein Systems.

Journal of chemical information and modeling·2026
Same journal

TransKla: A Local-Global Cross-Attention Based Transformer Approach for Prediction of Lysine Lactylation Sites.

Journal of chemical information and modeling·2026
Same journal

CondenSimAdapter: A Versatile Builder for Multiscale Simulations of Protein Condensates with Broad Force-Field Compatibility and Robust Dense-Phase Relaxation.

Journal of chemical information and modeling·2026
Same journal

Simulation Guided Design of a Potentially Hyperactive Ice Nucleating Protein.

Journal of chemical information and modeling·2026
Same journal

Setting the Bases of the Photogenotoxicity of <i>p</i>-Aminobenzoic Acid.

Journal of chemical information and modeling·2026
Same journal

Probing Charge-Controlled Inter-Domain Flexibility: Integrating Experimental and Coarse-Grained Approaches.

Journal of chemical information and modeling·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K

Ordinal Confidence Level Assignments for Regression Model Predictions.

Steven Kearnes1, Patrick Riley1

  • 1Relay Therapeutics, Cambridge, Massachusetts 02142, United States.

Journal of Chemical Information and Modeling
|December 9, 2024
PubMed
Summary
This summary is machine-generated.

We developed a straightforward method to assign reliable confidence scores to molecular property predictions from regression models. This approach aids decision-making in drug discovery by providing interpretable confidence levels.

More Related Videos

Assessment and Communication for People with Disorders of Consciousness
07:37

Assessment and Communication for People with Disorders of Consciousness

Published on: August 1, 2017

9.0K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Related Experiment Videos

Last Updated: Jun 5, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.0K
Assessment and Communication for People with Disorders of Consciousness
07:37

Assessment and Communication for People with Disorders of Consciousness

Published on: August 1, 2017

9.0K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Area of Science:

  • Computational chemistry
  • Machine learning in drug discovery
  • Quantitative structure-property relationship (QSPR) modeling

Background:

  • Accurate prediction of molecular properties is crucial for efficient drug discovery.
  • Assessing the reliability of these predictions is essential for informed decision-making.
  • Existing methods may lack interpretability or straightforward application.

Purpose of the Study:

  • To introduce a simple and interpretable method for quantifying prediction confidence.
  • To enable better decision-making in drug discovery pipelines.
  • To validate the proposed confidence assignment method.

Main Methods:

  • Development of a novel confidence scoring technique for regression models.
  • Application of time-split validation for robust performance assessment.
  • Utilizing internal assay data from Relay Therapeutics for empirical evaluation.

Main Results:

  • The proposed method successfully assigns accurate and interpretable confidence levels to molecular property predictions.
  • Demonstrated effectiveness in a realistic drug discovery context using time-split validation.
  • Confidence levels proved valuable for guiding decisions in drug discovery programs.

Conclusions:

  • The presented method offers a practical solution for enhancing the reliability of molecular property predictions.
  • Improved confidence assessment facilitates more effective and data-driven drug discovery strategies.
  • This approach has the potential to accelerate the identification and optimization of drug candidates.