Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Molecular Models02:00

Molecular Models

43.6K
Physical models representing molecular architectures of chemical compounds play essential roles in understanding chemistry. The use of molecular models makes it easier to visualize the structures and shapes of atoms and molecules.
43.6K
Physiological Foundation of Stress01:24

Physiological Foundation of Stress

617
Stress triggers a coordinated physiological response involving the sympathetic nervous system (SNS) and the hypothalamic-pituitary-adrenal (HPA) axis. This dual activation ensures that the body is prepared for both immediate and prolonged stress management. The process begins with the perception of a stressor. This initial phase activates the SNS, leading to the rapid release of adrenaline (epinephrine) from the adrenal glands.
Role of the Sympathetic Nervous System
Adrenaline triggers the...
617
Social Foundations of Self II: The Generalized Other01:20

Social Foundations of Self II: The Generalized Other

245
According to George Herbert Mead, as children progress beyond the game stage, they develop a more comprehensive understanding of societal rules and norms. This cognitive and social development enables them to internalize the expectations of the broader community, refining their ability to regulate behavior.Consistent participation in organized activities is crucial in helping children recognize that their actions are not isolated but contribute to a more significant, interconnected group...
245
Theoretical Foundations of Nursing Practice01:30

Theoretical Foundations of Nursing Practice

17.3K
Theories play an essential role in organizing patient care. Theories refer to a proposed or followed belief, policy, or procedure that is the basis for action. Nursing theories are knowledge-based concepts that guide nurses' actions, influence nursing education and practice, and allow nurses to care for their patients.
Theories provide a perspective to assess patients' conditions and organize data and methods. They also assist in analyzing and interpreting information. They represent a...
17.3K
Social Foundations of Self I: Play and Game01:24

Social Foundations of Self I: Play and Game

197
The development of self in children is deeply rooted in social interactions, mainly through stages of play and structured games. These stages, outlined by sociologist George Herbert Mead, illustrate how children progressively learn to understand and adopt social roles, forming a cohesive sense of self.The Play Stage: Imitation and Simple Role-TakingIn the early years of childhood, the play stage is characterized by imitative behavior, where children engage in role-playing based on familiar...
197
Social Foundations of Self III: Self-Evaluation01:30

Social Foundations of Self III: Self-Evaluation

186
Self-evaluation is the process by which individuals assess their abilities, behaviors, and characteristics based on feedback from others. Charles H. Cooley observed that a person’s self-perception is primarily influenced by how others see and judge them. He suggested that individuals form their identities based on their interpretations of others' reactions. As a result, social interactions play a crucial role in shaping self-esteem and personal identity. These external evaluations often...
186

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

CLOUD: A Scalable and Physics-Informed Foundation Model for Crystal Representation Learning.

Nature communications·2026
Same author

Excess density as a descriptor for electrolyte solvent design.

The Journal of chemical physics·2025
Same author

Differentiable modeling and optimization of non-aqueous Li-based battery electrolyte solutions using geometric deep learning.

Nature communications·2024
Same author

Electrowinning for Room-Temperature Ironmaking: Mapping the Electrochemical Aqueous Iron Interface.

The journal of physical chemistry. C, Nanomaterials and interfaces·2024
Same author

Modeling Scanning Electrochemical Cell Microscopy (SECCM) in Twisted Bilayer Graphene.

The journal of physical chemistry letters·2024
Same author

Twisto-Electrochemical Activity Volcanoes in Trilayer Graphene.

Journal of the American Chemical Society·2024
Same journal

PFASGroups: An Open-Source Framework for Automated Identification, Structural Classification, and Prioritization of Per- and Polyfluoroalkyl Substances.

Journal of chemical information and modeling·2026
Same journal

DeepKbhb: Context-Aware Prediction of Human Lysine β-Hydroxybutyrylation Sites.

Journal of chemical information and modeling·2026
Same journal

HyperDC: A Non-Uniform Hypergraph Framework for Dual- and Higher-Order Drug Combination Recommendation Across Diverse Complex Diseases.

Journal of chemical information and modeling·2026
Same journal

Correction to "AstraMEV (AI-Guided Structural Assembly of Multi-Epitope Vaccines) Against Infectious Bronchitis Virus".

Journal of chemical information and modeling·2026
Same journal

MolPy: A Large Language Model-Friendly Toolkit for Reactive Topology Editing in Polymer Simulations.

Journal of chemical information and modeling·2026
Same journal

Molecular Mechanisms of KIT Receptor Dimerization and Oncogenic Activation Revealed by Multiscale Simulations.

Journal of chemical information and modeling·2026
See all related articles

Related Experiment Video

Updated: Jan 25, 2026

Interactive Molecular Model Assembly with 3D Printing
06:15

Interactive Molecular Model Assembly with 3D Printing

Published on: August 13, 2020

10.9K

Tokenization for Molecular Foundation Models.

Alexius Wadell1,2, Anoushka Bhutani1,2, Venkatasubramanian Viswanathan1,2

  • 1Department of Mechanical Engineering, University of Michigan, Ann Arbor, Michigan 48109, United States.

Journal of Chemical Information and Modeling
|January 23, 2026
PubMed
Summary
This summary is machine-generated.

Molecular foundation models advance science, but limited tokenizers hinder progress. New Smirk and Smirk-GPE tokenizers offer full coverage, enabling broader applications in chemistry and beyond.

More Related Videos

A Novel Surgical Technique As a Foundation for In Vivo Partial Liver Engineering in Rat
13:27

A Novel Surgical Technique As a Foundation for In Vivo Partial Liver Engineering in Rat

Published on: October 6, 2018

8.6K
Modeling an Enzyme Active Site using Molecular Visualization Freeware
14:37

Modeling an Enzyme Active Site using Molecular Visualization Freeware

Published on: December 25, 2021

11.4K

Related Experiment Videos

Last Updated: Jan 25, 2026

Interactive Molecular Model Assembly with 3D Printing
06:15

Interactive Molecular Model Assembly with 3D Printing

Published on: August 13, 2020

10.9K
A Novel Surgical Technique As a Foundation for In Vivo Partial Liver Engineering in Rat
13:27

A Novel Surgical Technique As a Foundation for In Vivo Partial Liver Engineering in Rat

Published on: October 6, 2018

8.6K
Modeling an Enzyme Active Site using Molecular Visualization Freeware
14:37

Modeling an Enzyme Active Site using Molecular Visualization Freeware

Published on: December 25, 2021

11.4K

Area of Science:

  • Cheminformatics and computational chemistry
  • Artificial intelligence in scientific discovery
  • Molecular representation and modeling

Background:

  • Text-based foundation models are crucial for scientific discovery, particularly in molecular design.
  • Existing molecular foundation models are limited by closed-vocabulary tokenizers, restricting their ability to represent the full molecular space.
  • A systematic evaluation of 35 tokenizers revealed significant gaps in SMILES representation coverage.

Purpose of the Study:

  • To systematically evaluate the coverage of existing molecular tokenizers.
  • To assess the impact of tokenizer choice on molecular property prediction.
  • To develop novel tokenizers with comprehensive coverage of molecular representations.

Main Methods:

  • Evaluated 35 tokenizers, including 20 chemistry-specific ones, for SMILES representation coverage.
  • Introduced n-gram language models as a proxy to assess tokenizer impact.
  • Pretrained and finetuned 18 RoBERTa-style encoders for molecular property prediction.
  • Developed two new tokenizers, Smirk and Smirk-GPE, with full OpenSMILES coverage.

Main Results:

  • Identified significant gaps in the coverage of existing tokenizers for SMILES.
  • Demonstrated the effectiveness of n-gram language models as a proxy for tokenizer evaluation.
  • Showcased the capability of pretrained RoBERTa-style encoders for molecular property prediction.
  • Introduced Smirk and Smirk-GPE tokenizers that integrate nuclear, electronic, and geometric degrees of freedom.

Conclusions:

  • Existing tokenizers inadequately cover the molecular space, necessitating open-vocabulary approaches.
  • The proposed Smirk and Smirk-GPE tokenizers provide full OpenSMILES coverage and enable broader applications.
  • Highlights the need for chemically diverse benchmarks and open-vocabulary modeling in cheminformatics.