Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.2K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.2K
Prediction Intervals01:03

Prediction Intervals

2.2K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.2K
Constraints and Statical Determinacy01:26

Constraints and Statical Determinacy

559
In structural engineering, the equilibrium of a system is not only determined by its equations of equilibrium but also with the help of constraints. Constraints refer to restrictions on the motion of a system. The proper combinations of constraints can minimize the total number of constraints needed to maintain a system in mechanical equilibrium. When this happens, the system is said to be statically determinate. For such systems, the unknown reaction supports can be estimated using equilibrium...
559
Goodness-of-Fit Test01:16

Goodness-of-Fit Test

3.3K
The goodness-of-fit test is a type of hypothesis test which determines whether the data "fits" a particular distribution. For example, one may suspect that some anonymous data may fit a binomial distribution. A chi-square test (meaning the distribution for the hypothesis test is chi-square) can be used to determine if there is a fit. The null and alternative hypotheses may be written in sentences or stated as equations or inequalities. The test statistic for a goodness-of-fit test is given as...
3.3K
Regression Analysis01:11

Regression Analysis

5.5K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.5K
Conserved Binding Sites01:49

Conserved Binding Sites

4.1K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
4.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Methyl Cation Affinity and Methyl Anion Affinity Prediction Using Uni-Mol-Based Models.

The journal of physical chemistry. A·2026
Same author

Machine Learning Prediction for Fe(II) Spin-Crossover Complex in the Same Spin State Using Geometrical and Topological Descriptors.

Journal of chemical information and modeling·2026
Same author

Cyanomethylation of Aldehydes on an Electrochemical Microflow System and Utility of Machine Learning-Assisted Examination of the Reaction Conditions.

Chemistry (Weinheim an der Bergstrasse, Germany)·2025
Same author

Analog Accessibility Score (AAscore) for Rational Compound Selection.

Journal of chemical information and modeling·2024
Same author

Chemical Graph-Based Transformer Models for Yield Prediction of High-Throughput Cross-Coupling Reaction Datasets.

ACS omega·2024
Same author

Bridging Structure- and Ligand-Based Virtual Screening through Fragmented Interaction Fingerprint.

ACS omega·2024
Same journal

PFASGroups: An Open-Source Framework for Automated Identification, Structural Classification, and Prioritization of Per- and Polyfluoroalkyl Substances.

Journal of chemical information and modeling·2026
Same journal

DeepKbhb: Context-Aware Prediction of Human Lysine β-Hydroxybutyrylation Sites.

Journal of chemical information and modeling·2026
Same journal

HyperDC: A Non-Uniform Hypergraph Framework for Dual- and Higher-Order Drug Combination Recommendation Across Diverse Complex Diseases.

Journal of chemical information and modeling·2026
Same journal

Correction to "AstraMEV (AI-Guided Structural Assembly of Multi-Epitope Vaccines) Against Infectious Bronchitis Virus".

Journal of chemical information and modeling·2026
Same journal

MolPy: A Large Language Model-Friendly Toolkit for Reactive Topology Editing in Polymer Simulations.

Journal of chemical information and modeling·2026
Same journal

Molecular Mechanisms of KIT Receptor Dimerization and Oncogenic Activation Revealed by Multiscale Simulations.

Journal of chemical information and modeling·2026
See all related articles

Related Experiment Video

Updated: May 21, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K

Understanding Conformation Importance in Data-Driven Property Prediction Models.

Yu Hamakawa1, Tomoyuki Miyao1,2

  • 1Graduate School of Science and Technology, Nara Institute of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan.

Journal of Chemical Information and Modeling
|March 18, 2025
PubMed
Summary
This summary is machine-generated.

Using multiple molecular conformers improves machine learning property prediction. An end-to-end model, Uni-Mol, utilizing atomic coordinates, achieved high accuracy, outperforming traditional descriptors, especially for conformation-sensitive properties.

More Related Videos

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.4K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

448

Related Experiment Videos

Last Updated: May 21, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.4K
Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications
03:31

Author Spotlight: Enhancement of Salient Object Detection for Smart Grid Applications

Published on: December 15, 2023

448

Area of Science:

  • Chemoinformatics
  • Computational Chemistry
  • Machine Learning

Background:

  • Accurate molecular property prediction is crucial for drug discovery and materials design.
  • The choice of molecular representation, particularly the inclusion of conformational information, significantly impacts prediction model performance.
  • A systematic analysis of how conformational data influences property prediction models is lacking.

Purpose of the Study:

  • To investigate the impact of using multiple molecular conformers on machine learning-based property prediction.
  • To compare the effectiveness of 2D and 3D molecular descriptors in property prediction tasks.
  • To evaluate the performance of an end-to-end model (Uni-Mol) against traditional descriptors using controlled datasets.

Main Methods:

  • Development and utilization of three controlled datasets varying in scale and property type (quantum mechanical, melting point, reaction data).
  • Comparison of property prediction models employing different molecular representations: 2D descriptors, 3D descriptors, and an end-to-end model (Uni-Mol).
  • Evaluation of aggregation approaches for multiple conformers, including data augmentation and mean aggregation.

Main Results:

  • Using all available conformers via data augmentation consistently yielded high prediction accuracy across datasets.
  • The Uni-Mol model, using atomic coordinates and ground-truth conformations, significantly outperformed traditional 2D and 3D descriptors.
  • Uni-Mol demonstrated high accuracy in predicting conformation-sensitive properties, though performance decreased with incorrect conformers.

Conclusions:

  • Multiple conformers, treated as data augmentation, enhance machine learning property prediction accuracy.
  • End-to-end models like Uni-Mol, leveraging 3D structural information, offer superior performance over traditional descriptors for molecular property prediction.
  • Careful consideration of conformational data is essential for developing robust and accurate predictive models in chemoinformatics.