Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Confidence Coefficient

Confidence Coefficient

The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...

Interpretation of Confidence Intervals

Interpretation of Confidence Intervals

A confidence interval is a better estimate of the population than a point estimate, as it uses a range of values from a sample instead of a single value.
Confidence intervals have confidence coefficients that are crucial for their interpretation. The most common confidence coefficients are 0.90, 0.95, and 0.99, which can be written as percentages–90%, 95%, and 99%, respectively.
Suppose a person calculates a confidence interval with a confidence coefficient of 0.95. In that case, they can...

Ordinal Level of Measurement

Ordinal Level of Measurement

The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using an ordinal scale are similar to nominal scale data, but there is one major difference. The ordinal scale data can be ordered. An example of ordinal scale data is a list of the top five national parks...

Confidence Intervals

Confidence Intervals

An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a sample proportion. However, unlike the point estimate which is a single value, the confidence interval contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

DNA-Encoded Chemical Library Screening with Target Titration Analysis: DELTA.

Journal of medicinal chemistry·2026

Same author

A Target Class Ligandability Evaluation of WD40 Repeat-Containing Proteins.

Journal of medicinal chemistry·2024

Same author

Extensive exploration of structure activity relationships for the SARS-CoV-2 macrodomain from shape-based fragment merging and active learning.

bioRxiv : the preprint server for biology·2024

Same author

Data Sharing in Chemistry: Lessons Learned and a Case for Mandating Structured Reaction Data.

Journal of chemical information and modeling·2023

Same author

Evolving symbolic density functionals.

Science advances·2022

Same author

Defining Levels of Automated Chemical Design.

Journal of medicinal chemistry·2022

Same journal

PACEff Builder: An Efficient Platform for Constructing PACE Hybrid-Resolution Models for Molecular Dynamics Simulations of Aqueous Protein, Peptide Assembly, and Membrane Protein Systems.

Journal of chemical information and modeling·2026

Same journal

TransKla: A Local-Global Cross-Attention Based Transformer Approach for Prediction of Lysine Lactylation Sites.

Journal of chemical information and modeling·2026

Same journal

CondenSimAdapter: A Versatile Builder for Multiscale Simulations of Protein Condensates with Broad Force-Field Compatibility and Robust Dense-Phase Relaxation.

Journal of chemical information and modeling·2026

Same journal

Simulation Guided Design of a Potentially Hyperactive Ice Nucleating Protein.

Journal of chemical information and modeling·2026

Same journal

Setting the Bases of the Photogenotoxicity of <i>p</i>-Aminobenzoic Acid.

Journal of chemical information and modeling·2026

Same journal

Probing Charge-Controlled Inter-Domain Flexibility: Integrating Experimental and Coarse-Grained Approaches.

Journal of chemical information and modeling·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 5, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Ordinal Confidence Level Assignments for Regression Model Predictions.

Steven Kearnes¹, Patrick Riley¹

¹Relay Therapeutics, Cambridge, Massachusetts 02142, United States.

Journal of Chemical Information and Modeling

|December 9, 2024

Summary

This summary is machine-generated.

We developed a straightforward method to assign reliable confidence scores to molecular property predictions from regression models. This approach aids decision-making in drug discovery by providing interpretable confidence levels.

More Related Videos

Assessment and Communication for People with Disorders of Consciousness

Assessment and Communication for People with Disorders of Consciousness

Published on: August 1, 2017

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Related Experiment Videos

Last Updated: Jun 5, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Assessment and Communication for People with Disorders of Consciousness

Assessment and Communication for People with Disorders of Consciousness

Published on: August 1, 2017

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Area of Science:

Computational chemistry
Machine learning in drug discovery
Quantitative structure-property relationship (QSPR) modeling

Background:

Accurate prediction of molecular properties is crucial for efficient drug discovery.
Assessing the reliability of these predictions is essential for informed decision-making.
Existing methods may lack interpretability or straightforward application.

Purpose of the Study:

To introduce a simple and interpretable method for quantifying prediction confidence.
To enable better decision-making in drug discovery pipelines.
To validate the proposed confidence assignment method.

Main Methods:

Development of a novel confidence scoring technique for regression models.
Application of time-split validation for robust performance assessment.
Utilizing internal assay data from Relay Therapeutics for empirical evaluation.

Main Results:

The proposed method successfully assigns accurate and interpretable confidence levels to molecular property predictions.
Demonstrated effectiveness in a realistic drug discovery context using time-split validation.
Confidence levels proved valuable for guiding decisions in drug discovery programs.

Conclusions:

The presented method offers a practical solution for enhancing the reliability of molecular property predictions.
Improved confidence assessment facilitates more effective and data-driven drug discovery strategies.
This approach has the potential to accelerate the identification and optimization of drug candidates.