Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Molecular Models02:00

Molecular Models

38.2K
Physical models representing molecular architectures of chemical compounds play essential roles in understanding chemistry. The use of molecular models makes it easier to visualize the structures and shapes of atoms and molecules.
38.2K
Molecular Orbital Theory II03:51

Molecular Orbital Theory II

19.1K
Molecular Orbital Energy Diagrams
19.1K
Predicting Molecular Geometry02:27

Predicting Molecular Geometry

34.2K
VSEPR Theory for Determination of Electron Pair Geometries
34.2K
Molecular Shapes01:18

Molecular Shapes

56.8K
Molecules have characteristic shapes that are crucial for their function. The arrangement of various electron groups around the central atom dictates their molecular geometry. Electron pairs in the valence shell of a central atom will adopt an arrangement that minimizes repulsions between the electron pairs by maximizing the distance between them. The valence electrons form either bonding pairs, located primarily between bonded atoms, or lone pairs.
Two regions of electron density in a diatomic...
56.8K
Polymers: Defining Molecular Weight01:01

Polymers: Defining Molecular Weight

2.8K
Unlike small molecules with definite molecular weights, polymers are a mixture of individual polymer chains of varying lengths, each with a unique molecular weight.  So, the molecular weight of a polymer is expressed as an average value based on the average size of the polymer chains. The two most common forms of averages used for polymers are the number average molecular weight and weight average molecular weight.
The number average molecular weight (Mn) is the summation of the number...
2.8K
Classification of Elements and Compounds02:54

Classification of Elements and Compounds

66.5K
Pure substances consist of only one type of matter. A pure substance can be an element or a compound. An element consists of only one type of atom, while a compound consists of two or more types of atoms held together by a chemical bond. Elements are classified as atomic or molecular based on the nature of their basic units.
Compounds are pure substances composed of two or more elements in fixed, definite proportions. Compounds are classified as ionic or molecular (covalent) based on the bonds...
66.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Morphology-controlled copper nanostructures: synthesis, anti-oxidation strategy, and applications.

Nanoscale·2026
Same author

The pangenome of <i>Acanthopagrus</i> provides genomic variation evidence for genetic diversity.

iScience·2026
Same author

Underlying Framework of All-optical Controlled Synaptic Devices for Neuromorphic Computing.

Nano-micro letters·2026
Same author

Graded BMP signals modulate yellow and red color in fishes, impacting adult pigment patterns and conspecific shoaling behavior.

Current biology : CB·2026
Same author

A survey of contrastive learning methods in molecular representation.

Briefings in bioinformatics·2026
Same author

Regulation Engineering of Alkali Metal Interlayer Pillar in P2-Type Cathode for Ultra-High Rate and Long-Term Cycling Sodium-Ion Batteries.

Nano-micro letters·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 22, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

543

Can large language models understand molecules?

Shaghayegh Sadeghi1, Alan Bui2, Ali Forooghi2

  • 1School of Computer Science, Univeristy of Windsor, Sunset Ave, Windsor, ON, N9B 3P4, Canada. sadeghi3@uwindsor.ca.

BMC Bioinformatics
|June 26, 2024
PubMed
Summary
This summary is machine-generated.

Large Language Models (LLMs) like LLaMA show strong performance in generating molecular embeddings from Simplified Molecular Input Line Entry System (SMILES) strings. LLaMA-based embeddings outperform GPT and excel in drug-drug interaction prediction tasks.

Keywords:
GPTLLaMALarge language modelsSMILES embedding

More Related Videos

Modeling an Enzyme Active Site using Molecular Visualization Freeware
14:37

Modeling an Enzyme Active Site using Molecular Visualization Freeware

Published on: December 25, 2021

9.8K
Author Spotlight: Exploring Cellular Processes by Modeling Ligands in Cryo-EM Maps
09:30

Author Spotlight: Exploring Cellular Processes by Modeling Ligands in Cryo-EM Maps

Published on: July 19, 2024

1.3K

Related Experiment Videos

Last Updated: Jun 22, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

543
Modeling an Enzyme Active Site using Molecular Visualization Freeware
14:37

Modeling an Enzyme Active Site using Molecular Visualization Freeware

Published on: December 25, 2021

9.8K
Author Spotlight: Exploring Cellular Processes by Modeling Ligands in Cryo-EM Maps
09:30

Author Spotlight: Exploring Cellular Processes by Modeling Ligands in Cryo-EM Maps

Published on: July 19, 2024

1.3K

Area of Science:

  • Cheminformatics
  • Artificial Intelligence
  • Computational Chemistry

Background:

  • Large Language Models (LLMs) are emerging as powerful tools in cheminformatics.
  • Simplified Molecular Input Line Entry System (SMILES) is a crucial format for representing chemical structures.
  • LLMs can decode SMILES strings into meaningful vector representations for downstream tasks.

Purpose of the Study:

  • To evaluate the performance of Generative Pre-trained Transformer (GPT) and Large Language Model Meta AI (LLaMA) in generating SMILES embeddings.
  • To compare LLM-based embeddings against traditional pre-trained models on SMILES.
  • To assess the utility of these embeddings in molecular property and drug-drug interaction (DDI) prediction.

Main Methods:

  • Investigated the performance of GPT and LLaMA models.
  • Generated SMILES embeddings using both LLMs.
  • Evaluated embeddings on molecular property prediction and DDI prediction tasks.
  • Compared LLM-based embeddings with pre-trained SMILES models.

Main Results:

  • LLaMA-generated SMILES embeddings outperformed GPT-generated embeddings in both molecular property and DDI prediction.
  • LLaMA embeddings achieved performance comparable to pre-trained models in molecular prediction.
  • LLaMA embeddings surpassed pre-trained models in DDI prediction tasks.

Conclusions:

  • LLMs demonstrate significant potential for generating effective molecular embeddings.
  • LLaMA shows particular promise for molecular representation tasks.
  • Further research into LLMs for molecular embedding is warranted to bridge the gap between AI and cheminformatics.