Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.7K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.7K
Protein Organization01:24

Protein Organization

6.2K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
6.2K
Ligand Binding Sites02:40

Ligand Binding Sites

12.6K
Proteins are dynamic macromolecules that carry out a wide variety of essential processes; however, the activities of most proteins depend on their interactions with other molecules or ions, known as ligands.
Protein-ligand interactions are quite specific; even though numerous potential ligands surround a cellular protein at any given time, only a particular ligand can bind to that protein. Moreover, a ligand binds only to a dedicated area on the surface of the protein, known as the...
12.6K
From DNA to Protein03:06

From DNA to Protein

17.9K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
17.9K
Protein-protein Interfaces02:04

Protein-protein Interfaces

12.4K
Many proteins form complexes to carry out their functions, making protein-protein interactions (PPIs) essential for an organism's survival. Most PPIs are stabilized by numerous weak noncovalent chemical forces. The physical shape of the interfaces determines the way two proteins interact. Many globular proteins have closely-matching shapes on their surfaces, which form a large number of weak bonds. Additionally, many PPIs occur between two helices or between a surface cleft and a...
12.4K
Conserved Binding Sites01:49

Conserved Binding Sites

4.1K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
4.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Making invisible excited-state structures of pro-interleukin-18 visible by combining NMR and machine learning.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Scalable emulation of protein equilibrium ensembles with generative deep learning.

Science (New York, N.Y.)·2025
Same author

FlowPacker: protein side-chain packing with torsional flow matching.

Bioinformatics (Oxford, England)·2025
Same author

A High-Throughput Method for Screening Peptide Activators of G-Protein-Coupled Receptors.

ACS omega·2024
Same author

EuDockScore: Euclidean graph neural networks for scoring protein-protein interfaces.

Bioinformatics (Oxford, England)·2024
Same author

Antibody-SGM, a Score-Based Generative Model for Antibody Heavy-Chain Design.

Journal of chemical information and modeling·2024
Same journal

Tomogram exploration through template matching and deep learning.

Current opinion in structural biology·2026
Same journal

A comparative review of cryo-electron ptychography: Biological applications and future perspectives.

Current opinion in structural biology·2026
Same journal

Metabolic disruptions through a three-dimensional genomic lens.

Current opinion in structural biology·2026
Same journal

Collective variable design for biomolecular conformational dynamics.

Current opinion in structural biology·2026
Same journal

Polymer scaling in protein crowding: From dilute coils to semidilute meshes.

Current opinion in structural biology·2026
Same journal

Tuning the physicochemical properties of rationally designed protein-based biomolecular condensates.

Current opinion in structural biology·2026
See all related articles

Related Experiment Video

Updated: May 23, 2025

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.0K

Language models for protein design.

Jin Sub Lee1, Osama Abdin1, Philip M Kim2

  • 1Department of Molecular Genetics, University of Toronto, Toronto, M5S 1A8, Canada.

Current Opinion in Structural Biology
|March 7, 2025
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) are revolutionizing protein design by treating amino acid sequences like language. This review explores LLM applications in protein design, covering foundations, advances, and future research directions.

More Related Videos

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.1K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K

Related Experiment Videos

Last Updated: May 23, 2025

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.0K
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.1K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Artificial Intelligence

Background:

  • Large language models (LLMs) demonstrate advanced capabilities in understanding and generating human language.
  • Proteins, composed of amino acid sequences, can be analogously modeled using language principles.
  • LLMs offer a powerful framework for learning representations of protein sequences.

Purpose of the Study:

  • To review the applications of language modeling in protein design.
  • To discuss the foundational concepts of protein language modeling.
  • To highlight recent advancements and future research directions in this field.

Main Methods:

  • Reviewing existing literature on protein language models.
  • Analyzing applications in protein structure prediction, design, and property prediction.
  • Discussing context-conditioned design and structure integration techniques.

Main Results:

  • Language modeling principles are effectively applied to learn informative protein representations.
  • Recent advances include context-conditioned design and the integration of structural information.
  • LLMs show significant potential for advancing protein design capabilities.

Conclusions:

  • Protein language modeling is a rapidly evolving field with substantial implications for protein design.
  • Further research is needed to address current shortcomings and unlock the full potential of LLMs in this domain.
  • Improved protein language models will facilitate the development of novel proteins with desired functions.