Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

¹H NMR: Pople Notation01:09

¹H NMR: Pople Notation

1.6K
The Pople nomenclature system classifies spin systems based on the difference between their chemical shifts. Coupled spins are denoted by capital letters with subscripts indicating the number of equivalent nuclei. When the coupled nuclei have well-separated chemical shifts, they are assigned letters that are far apart in the alphabet, such as A and X. When the difference in chemical shifts is small, coupled nuclei are named using adjacent letters of the alphabet (AB, MN, or XY).
A proton...
1.6K
IUPAC Nomenclature of Carboxylic Acids01:16

IUPAC Nomenclature of Carboxylic Acids

8.7K
IUPAC names of carboxylic acids are systematically derived following a few rules discussed below.
For acyclic saturated monocarboxylic acids, the longest hydrocarbon chain containing the –COOH carbon is identified as the parent chain. Then, the last -e of the parent hydrocarbon name is replaced with a suffix -oic acid.
8.7K
Naming Enantiomers02:21

Naming Enantiomers

19.5K
The naming of enantiomers employs the Cahn–Ingold–Prelog rules that involve assigning priorities to different substituent groups at a chiral center. Each enantiomer, being a distinct molecule, is assigned a unique name by the Cahn–Ingold–Prelog (CIP) rules, also called the R–S system. The prefix R- or S- attached to the chiral centers in an enantiomer is dependent on the spatial arrangement of the four substituents on the chiral center. The R–S system...
19.5K
Expressing Solution Concentration02:48

Expressing Solution Concentration

57.8K
A solute is a component of a solution that is typically present at a much lower concentration than the solvent. Solute concentrations are often described with qualitative terms such as dilute (of relatively low concentration) and concentrated (of relatively high concentration).
Concentrations may be quantitatively assessed using a wide variety of measurement units, each convenient for particular applications. Molarity (M) is a useful concentration unit for many applications in chemistry.
57.8K
Molecular Compounds: Formulas and Nomenclature03:10

Molecular Compounds: Formulas and Nomenclature

42.3K
Molecular compounds or covalent compounds result when atoms share electrons to form covalent bonds. Since there is no electron transfer, molecular compounds do not contain ions; instead, they consist of discrete, neutral molecules. 
42.3K
Newman Projections02:06

Newman Projections

16.0K
Different notations are used to represent the three-dimensional structure of molecules on two-dimensional surfaces. One of the most commonly used representations is the dash-wedge formula. The dashed wedges, solid wedges, and the plane lines indicate the groups situated behind the plane, coming out of the plane, and in the plane, respectively.
The organic molecules rotate across the single bonds leading to numerous temporary three-dimensional structures of varying energy known as...
16.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cheminformatics Microservice V3: a web portal for chemical structure manipulation and analysis.

Journal of cheminformatics·2025
Same author

Analysis of metabolomics and transcriptomics data to assess interactions in microalgal co-culture of Skeletonema marinoi and Prymnesium parvum.

PloS one·2025
Same author

PySSA for Windows: End-User Protein Structure Prediction and Visual Analysis with ColabFold and PyMOL.

Journal of chemical information and modeling·2025
Same author

The evolution of open science in cheminformatics: a journey from closed systems to collaborative innovation.

Journal of cheminformatics·2025
Same author

COCONUT 2.0: a comprehensive overhaul and curation of the collection of open natural products database.

Nucleic acids research·2024
Same author

An automated calculation pipeline for differential pair interaction energies with molecular force fields using the Tinker Molecular Modeling Package.

Journal of cheminformatics·2024
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
Same journal

Sampling out-of-distribution chemical spaces via Bayesian flow.

Journal of cheminformatics·2026
Same journal

Hold on tight: the kinetic profiling of opioid receptor ligands using the CORAL-MD.

Journal of cheminformatics·2026
Same journal

Transformer-accelerated discovery of inhibitors targeting the RpsA<sub>Δ438</sub> deletion in PZA-resistant tuberculosis.

Journal of cheminformatics·2026
Same journal

DICL: a manually curated database of ion channels and ligands as a useful platform for drug discovery targeting ion channels.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: May 8, 2025

Utilization of Stop-flow Micro-tubing Reactors for the Development of Organic Transformations
13:09

Utilization of Stop-flow Micro-tubing Reactors for the Development of Organic Transformations

Published on: January 4, 2018

39.0K

STOUT V2.0: SMILES to IUPAC name conversion using transformer models.

Kohulan Rajan1, Achim Zielesny2, Christoph Steinbeck3

  • 1Institute for Inorganic and Analytical Chemistry, Friedrich Schiller University Jena, Lessingstr. 8, 07743, Jena, Germany.

Journal of Cheminformatics
|December 28, 2024
PubMed
Summary
This summary is machine-generated.

STOUT V2, a transformer-based model, accurately translates chemical structures from SMILES notation to IUPAC names. This tool simplifies chemical nomenclature, offering a powerful, open-source solution for chemists.

Keywords:
Chemical name translationDeep learningSMILES to IUPAC nameSTOUTTransformers

More Related Videos

Versatile CO2 Transformations into Complex Products: A One-pot Two-step Strategy
07:36

Versatile CO2 Transformations into Complex Products: A One-pot Two-step Strategy

Published on: November 9, 2019

7.9K
Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

8.1K

Related Experiment Videos

Last Updated: May 8, 2025

Utilization of Stop-flow Micro-tubing Reactors for the Development of Organic Transformations
13:09

Utilization of Stop-flow Micro-tubing Reactors for the Development of Organic Transformations

Published on: January 4, 2018

39.0K
Versatile CO2 Transformations into Complex Products: A One-pot Two-step Strategy
07:36

Versatile CO2 Transformations into Complex Products: A One-pot Two-step Strategy

Published on: November 9, 2019

7.9K
Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

8.1K

Area of Science:

  • Computational Chemistry
  • Cheminformatics
  • Artificial Intelligence in Chemistry

Background:

  • Systematic chemical naming follows International Union of Pure and Applied Chemistry (IUPAC) rules, which are complex and challenging to apply consistently.
  • Accurate translation from Simplified Molecular Input Line Entry System (SMILES) strings to IUPAC names is vital for streamlining chemical structure naming.

Purpose of the Study:

  • To present STOUT (SMILES-TO-IUPAC-name translator) V2, a novel transformer-based model for translating SMILES notation to IUPAC names.
  • To demonstrate the potential of neural networks in chemical nomenclature, complementing existing deterministic algorithms.

Main Methods:

  • Development of a transformer-based neural network model (STOUT V2).
  • Training the model on a large dataset of approximately 1 billion SMILES strings and their corresponding IUPAC names.
  • Leveraging OpenEye's Lexichem software for access to chemical naming functionalities.

Main Results:

  • STOUT V2 achieves exceptional accuracy in generating IUPAC names, even for complex chemical structures.
  • The model effectively captures intricate patterns and relationships within chemical structures for precise naming.
  • A web application was developed to enhance accessibility, and the model/source code are open-source.

Conclusions:

  • Transformer-based neural approaches, like STOUT V2, show significant potential for chemical nomenclature.
  • STOUT V2 offers a powerful, accurate, and accessible tool to aid chemists in systematic chemical naming.
  • The open-source nature of STOUT V2 encourages further research and development in computational chemistry.