Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Molecular Orbital Theory II03:51

Molecular Orbital Theory II

18.9K
Molecular Orbital Energy Diagrams
18.9K
Predicting Molecular Geometry02:27

Predicting Molecular Geometry

34.0K
VSEPR Theory for Determination of Electron Pair Geometries
34.0K
Ligand Binding Sites02:40

Ligand Binding Sites

12.7K
Proteins are dynamic macromolecules that carry out a wide variety of essential processes; however, the activities of most proteins depend on their interactions with other molecules or ions, known as ligands.
Protein-ligand interactions are quite specific; even though numerous potential ligands surround a cellular protein at any given time, only a particular ligand can bind to that protein. Moreover, a ligand binds only to a dedicated area on the surface of the protein, known as the...
12.7K
Structure of Benzene: Molecular Orbital Model01:18

Structure of Benzene: Molecular Orbital Model

8.7K
According to the molecular orbital (MO) model, benzene has a planar structure with a regular hexagon of six sp2 hybridized carbons. As shown in Figure 1, each carbon is bonded to three other atoms with C–C–C and H–C–C bond angles of 120°. The C–H bond length is 109 pm, and the C–C bond length is 139 pm which is midway between the single bond length of sp3 hybridized carbons (154 pm) and sp2 hybridized carbons (133 pm).
8.7K
Hybridization of Atomic Orbitals I03:24

Hybridization of Atomic Orbitals I

46.4K
The mathematical expression known as the wave function, ψ, contains information about each orbital and the wavelike properties of electrons in an isolated atom. When atoms are bound together in a molecule, the wave functions combine to produce new mathematical descriptions that have different shapes. This process of combining the wave functions for atomic orbitals is called hybridization and is mathematically accomplished by the linear combination of atomic orbitals. The new orbitals that...
46.4K
π Electron Effects on Chemical Shift: Overview01:27

π Electron Effects on Chemical Shift: Overview

1.0K
An applied magnetic field causes loosely bound π-electrons in organic molecules to circulate, producing a local or induced diamagnetic field over a large spatial volume. As the molecules tumble in solution, the field generated by π-electrons in spherical substituents results in a zero net field. However, the net field generated by π-electrons in non-spherical substituents is not zero. The effect of this induced field depends on the orientation of the molecule with respect to B0,...
1.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrating multimodal features with deep learning for protein solubility prediction.

Journal of cheminformatics·2026
Same author

Integrating Conformational Sampling and Siamese Learning to Predict Mutation-Induced Binding Affinity Changes in Abelson Tyrosine Kinase and Its Ligands.

Journal of chemical theory and computation·2026
Same author

TRACE-DDI: A Hybrid Framework of Transformer-GAT Context Encoder and Pathway-Anchored Knowledge Graphs for DDI Prediction.

Computational and structural biotechnology journal·2026
Same author

LABMA: Latent-bottleneck attention-based multimodal architecture for integration of transcriptomics, proteomics, and MRI in neurodegeneration.

Computers in biology and medicine·2026
Same author

Improved image reconstruction in coherent diffraction imaging using self-seeded XFEL pulses.

Journal of synchrotron radiation·2026
Same author

Encoding and Decoding of Brain Dynamic Functional Connectivity for ADHD Diagnosis.

IEEE journal of biomedical and health informatics·2026
Same journal

Multimodal feature fusion for molecular property classification.

Journal of cheminformatics·2026
Same journal

P2MAT: A machine learning (ML) driven software for Property Prediction of MATerial.

Journal of cheminformatics·2026
Same journal

Computational design of low-volatility lubricants for space using interpretable machine learning.

Journal of cheminformatics·2026
Same journal

OpenStats: how to combine statistics and research data management (RDM) to leverage efficient scientific data analysis by guided statistics.

Journal of cheminformatics·2026
Same journal

Unified heterogeneity-aware benchmark of drug synergy prediction: a cross-study analysis of traditional machine learning and graph deep learning models.

Journal of cheminformatics·2026
Same journal

Count your bits: fingerprint benchmarking to assess broad chemical space representation.

Journal of cheminformatics·2026
See all related articles

Related Experiment Video

Updated: May 29, 2025

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA
10:21

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

Published on: February 23, 2024

2.3K

Positional embeddings and zero-shot learning using BERT for molecular-property prediction.

Medard Edmund Mswahili1, JunHa Hwang1, Jagath C Rajapakse2

  • 1Department of Computer Engineering, Chungbuk National University, Cheongju, 28644, South Korea.

Journal of Cheminformatics
|February 5, 2025
PubMed
Summary
This summary is machine-generated.

This study enhances molecular property prediction using transformer models by optimizing positional encodings (PEs) for chemical simplified molecular input line entry system (SMILES) data. The research demonstrates improved accuracy and generalization in predicting properties of unseen molecular representations.

Keywords:
BERTDeepSMILESMolecular-property predictionPositional embedding/encodingSMILESTransformersZero-shot learning

More Related Videos

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.3K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.5K

Related Experiment Videos

Last Updated: May 29, 2025

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA
10:21

Author Spotlight: Streamlining Protein Target Prediction and Validation via Molecular Docking and CETSA

Published on: February 23, 2024

2.3K
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.3K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.5K

Area of Science:

  • Cheminformatics
  • Deep Learning
  • Natural Language Processing

Background:

  • Advancements in cheminformatics necessitate improved handling of chemical simplified molecular input line entry system (SMILES) data.
  • Transformer models, successful in natural language processing, show promise for molecular data analysis.
  • Positional encodings (PEs) are crucial for transformer models to capture sequential and contextual information in molecular representations.

Purpose of the Study:

  • To explore and optimize various positional encodings (PEs) within a transformer-based framework for enhanced molecular property prediction.
  • To investigate the impact of PEs on the accuracy and generalization of Bidirectional Encoder Representations from Transformer (BERT) models for chemical text analysis using SMILES and DeepSMILES.
  • To assess the zero-shot learning capabilities of BERT models with different PEs on diverse molecular datasets.

Main Methods:

  • Pretraining Bidirectional Encoder Representations from Transformer (BERT) models with various positional encodings (PEs) on SMILES strings.
  • Fine-tuning the best-performing BERT models on downstream molecular property prediction tasks.
  • Utilizing both SMILES and DeepSMILES representations for comprehensive evaluation across existing and newly proposed datasets.

Main Results:

  • Demonstrated improved accuracy and generalization in molecular property prediction through optimized PEs in BERT models.
  • Successfully applied BERT models with various PEs to diverse datasets, including COVID-19 and bioassay data.
  • Showcased the model's proficiency in zero-shot learning for predicting properties of unseen molecular representations.

Conclusions:

  • Positional encodings significantly enhance the performance of transformer models like BERT for molecular property prediction.
  • BERT models, when equipped with appropriate PEs, exhibit robustness and potential for broad applications in cheminformatics and bioinformatics.
  • The study highlights the effectiveness of PEs in capturing complex atomic relationships within SMILES strings for accurate chemical data analysis.