Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Predicting Molecular Geometry02:27

Predicting Molecular Geometry

45.2K
VSEPR Theory for Determination of Electron Pair Geometries
45.2K
Molecular Models02:00

Molecular Models

43.5K
Physical models representing molecular architectures of chemical compounds play essential roles in understanding chemistry. The use of molecular models makes it easier to visualize the structures and shapes of atoms and molecules.
43.5K
Predicting Reaction Outcomes02:24

Predicting Reaction Outcomes

10.1K
Kinetics describes the rate and path by which a reaction occurs. In contrast, thermodynamics deals with state functions and describes the properties, behavior, and components of a system. It is not concerned with the path taken by the process and cannot address the rate at which a reaction occurs. Although it does provide information about what can happen during a reaction process, it does not describe the detailed steps of what appears on an atomic or a molecular level. On the other hand,...
10.1K
Ligand Binding and Linkage00:49

Ligand Binding and Linkage

5.5K
Allosteric proteins have more than one ligand binding site; the binding of a ligand to any of these sites influences the binding of ligands to the other sites. When a protein is allosteric, its binding sites are called coupled or linked.  In the case of enzymes, the site that binds to the substrate is known as the active site and the other site is known as the regulatory site. When a ligand binds to the regulatory site, this leads to conformational changes in the protein that can influence...
5.5K
Ligand Binding and Linkage00:49

Ligand Binding and Linkage

4.0K
4.0K
Improving Translational Accuracy02:07

Improving Translational Accuracy

14.1K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
14.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Local nonequilibrium thermodynamics of polymer collapse dynamics.

The Journal of chemical physics·2025
Same author

Hybrid coarse-grained and all-atom molecular dynamics simulation studies of binary biological lipid membranes containing chlorosulfolipids.

The Journal of chemical physics·2025
Same author

Pressure-Dependent Shape and Edge Configurations of MoS<sub>2</sub> by Kinetic Monte Carlo Simulation.

ACS nano·2024
Same author

Inhibition mechanism of testis-expressed gene 14 (TEX14) in cytokinetic abscission: Well-tempered metadynamics simulation studies.

The Journal of chemical physics·2023
Same author

Structure and stability of polydiacetylene membrane systems: Molecular dynamics simulation studies.

Journal of computational chemistry·2022
Same author

Chlorosulfolipid (Danicalipin A) Membrane Structure: Hybrid Molecular Dynamics Simulation Studies.

The journal of physical chemistry letters·2021
Same journal

The Anionic States of Ubiquinone Characterized by Second-Order Approximate Coupled-Cluster Theory.

Journal of computational chemistry·2026
Same journal

Hydrogen Bond Energy Estimation in Large Molecular Clusters via the Method of Synergistic Cyclic Cooperativity: A Software Update H-BEE 2.0.

Journal of computational chemistry·2026
Same journal

The Intricate Mechanism of Nitric Oxide Synthase.

Journal of computational chemistry·2026
Same journal

A Molecular "Thermometer" for Measuring Effective Non-Local Exchange.

Journal of computational chemistry·2026
Same journal

Insights to Orientation Dependence of Molecular Conduction Modeled by High-Level Quantum Embedding.

Journal of computational chemistry·2026
Same journal

AutoSTOP-RT-TDDFT: Adaptive and Selected Real-Time Time-Dependent Density Functional Theory for Simulation of X-Ray Absorptions.

Journal of computational chemistry·2026
See all related articles

Related Experiment Video

Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K

Evaluating In-Context Learning in Large Language Models for Molecular Property Regression.

Chan Young Joe1, Kyungwoo Song2,3, Rakwoo Chang1

  • 1Department of Applied Chemistry, University of Seoul, Seoul, Republic of Korea.

Journal of Computational Chemistry
|January 15, 2026
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) show promise but struggle with genuine in-context learning for scientific regression tasks. Machine learning models offer greater robustness in molecular property prediction, especially under challenging conditions.

Keywords:
SMILES representationfunctional out‐of‐distributionin‐context learninglarge language modelsmolecular property predictionshortcut learningstructure–activity landscape index

More Related Videos

Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro
05:50

Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro

Published on: September 26, 2025

1.4K

Related Experiment Videos

Last Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K
Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro
05:50

Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro

Published on: September 26, 2025

1.4K

Area of Science:

  • Artificial Intelligence
  • Computational Chemistry
  • Machine Learning

Background:

  • Large language models (LLMs) excel at natural language tasks.
  • Their capability for in-context learning (ICL) in scientific regression is not well understood.
  • Assessing LLM performance in scientific domains requires specialized evaluation frameworks.

Purpose of the Study:

  • To systematically evaluate the in-context learning abilities of seven large language models (LLMs) in scientific regression tasks.
  • To investigate LLM performance on molecular property prediction under controlled conditions designed to isolate shortcut learning and induce out-of-distribution (OOD) behavior.
  • To compare LLM performance against traditional machine learning (ML) baselines.

Main Methods:

  • A controlled framework of 56 transformed tasks was used to assess seven LLMs on molecular property prediction.
  • Tasks were designed to isolate shortcut learning and induce functional out-of-distribution (OOD) behavior.
  • Performance was evaluated by comparing LLM results with machine learning (ML) baselines.

Main Results:

  • LLMs achieved near-perfect performance on raw molecular weight prediction, likely due to shortcut cues.
  • LLM performance significantly deteriorated under nonlinear transformations of the data.
  • Machine learning (ML) baselines demonstrated greater robustness, leading to a performance crossover where ML outperformed LLMs.
  • Meta-analysis identified distributional descriptors and structure-activity landscape indices (SALI) as predictors of task favorability.

Conclusions:

  • LLMs' in-context learning for scientific regression is limited and susceptible to shortcut learning.
  • Machine learning models offer more robust and reliable performance for molecular property prediction, particularly in out-of-distribution scenarios.
  • Distributional descriptors and SALI can guide the selection of appropriate AI/ML approaches for chemistry applications.