Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Predicting Molecular Geometry

Predicting Molecular Geometry

VSEPR Theory for Determination of Electron Pair Geometries

Molecular Models

Molecular Models

Physical models representing molecular architectures of chemical compounds play essential roles in understanding chemistry. The use of molecular models makes it easier to visualize the structures and shapes of atoms and molecules.

Predicting Reaction Outcomes

Predicting Reaction Outcomes

Kinetics describes the rate and path by which a reaction occurs. In contrast, thermodynamics deals with state functions and describes the properties, behavior, and components of a system. It is not concerned with the path taken by the process and cannot address the rate at which a reaction occurs. Although it does provide information about what can happen during a reaction process, it does not describe the detailed steps of what appears on an atomic or a molecular level. On the other hand,...

Ligand Binding and Linkage

Ligand Binding and Linkage

Allosteric proteins have more than one ligand binding site; the binding of a ligand to any of these sites influences the binding of ligands to the other sites. When a protein is allosteric, its binding sites are called coupled or linked. In the case of enzymes, the site that binds to the substrate is known as the active site and the other site is known as the regulatory site. When a ligand binds to the regulatory site, this leads to conformational changes in the protein that can influence...

Ligand Binding and Linkage

Ligand Binding and Linkage

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Local nonequilibrium thermodynamics of polymer collapse dynamics.

The Journal of chemical physics·2025

Same author

Hybrid coarse-grained and all-atom molecular dynamics simulation studies of binary biological lipid membranes containing chlorosulfolipids.

The Journal of chemical physics·2025

Same author

Pressure-Dependent Shape and Edge Configurations of MoS<sub>2</sub> by Kinetic Monte Carlo Simulation.

ACS nano·2024

Same author

Inhibition mechanism of testis-expressed gene 14 (TEX14) in cytokinetic abscission: Well-tempered metadynamics simulation studies.

The Journal of chemical physics·2023

Same author

Structure and stability of polydiacetylene membrane systems: Molecular dynamics simulation studies.

Journal of computational chemistry·2022

Same author

Chlorosulfolipid (Danicalipin A) Membrane Structure: Hybrid Molecular Dynamics Simulation Studies.

The journal of physical chemistry letters·2021

Same journal

The Anionic States of Ubiquinone Characterized by Second-Order Approximate Coupled-Cluster Theory.

Journal of computational chemistry·2026

Same journal

Hydrogen Bond Energy Estimation in Large Molecular Clusters via the Method of Synergistic Cyclic Cooperativity: A Software Update H-BEE 2.0.

Journal of computational chemistry·2026

Same journal

The Intricate Mechanism of Nitric Oxide Synthase.

Journal of computational chemistry·2026

Same journal

A Molecular "Thermometer" for Measuring Effective Non-Local Exchange.

Journal of computational chemistry·2026

Same journal

Insights to Orientation Dependence of Molecular Conduction Modeled by High-Level Quantum Embedding.

Journal of computational chemistry·2026

Same journal

AutoSTOP-RT-TDDFT: Adaptive and Selected Real-Time Time-Dependent Density Functional Theory for Simulation of X-Ray Absorptions.

Journal of computational chemistry·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evaluating In-Context Learning in Large Language Models for Molecular Property Regression.

Chan Young Joe¹, Kyungwoo Song^2,3, Rakwoo Chang¹

¹Department of Applied Chemistry, University of Seoul, Seoul, Republic of Korea.

Journal of Computational Chemistry

|January 15, 2026

Summary

This summary is machine-generated.

Large language models (LLMs) show promise but struggle with genuine in-context learning for scientific regression tasks. Machine learning models offer greater robustness in molecular property prediction, especially under challenging conditions.

Keywords:

SMILES representation functional out‐of‐distribution in‐context learning large language models molecular property prediction shortcut learning structure–activity landscape index

More Related Videos

Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro

Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro

Published on: September 26, 2025

Related Experiment Videos

Last Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro

Pharmacophore Modeling for Targets with Extensive Ligand Libraries: A Case Study on SARS-CoV-2 Mpro

Published on: September 26, 2025

Area of Science:

Artificial Intelligence
Computational Chemistry
Machine Learning

Background:

Large language models (LLMs) excel at natural language tasks.
Their capability for in-context learning (ICL) in scientific regression is not well understood.
Assessing LLM performance in scientific domains requires specialized evaluation frameworks.

Purpose of the Study:

To systematically evaluate the in-context learning abilities of seven large language models (LLMs) in scientific regression tasks.
To investigate LLM performance on molecular property prediction under controlled conditions designed to isolate shortcut learning and induce out-of-distribution (OOD) behavior.
To compare LLM performance against traditional machine learning (ML) baselines.

Main Methods:

A controlled framework of 56 transformed tasks was used to assess seven LLMs on molecular property prediction.
Tasks were designed to isolate shortcut learning and induce functional out-of-distribution (OOD) behavior.
Performance was evaluated by comparing LLM results with machine learning (ML) baselines.

Main Results:

LLMs achieved near-perfect performance on raw molecular weight prediction, likely due to shortcut cues.
LLM performance significantly deteriorated under nonlinear transformations of the data.
Machine learning (ML) baselines demonstrated greater robustness, leading to a performance crossover where ML outperformed LLMs.
Meta-analysis identified distributional descriptors and structure-activity landscape indices (SALI) as predictors of task favorability.

Conclusions:

LLMs' in-context learning for scientific regression is limited and susceptible to shortcut learning.
Machine learning models offer more robust and reliable performance for molecular property prediction, particularly in out-of-distribution scenarios.
Distributional descriptors and SALI can guide the selection of appropriate AI/ML approaches for chemistry applications.