Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Organization01:24

Protein Organization

6.2K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
6.2K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.7K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.7K
Conserved Binding Sites01:49

Conserved Binding Sites

4.1K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
4.1K
Structural Protein Function01:56

Structural Protein Function

27.3K
Structural proteins are a category of proteins responsible for functions ranging from cell shape and movement to providing support to major structures such as bones, cartilage, hair, and muscles. This group includes proteins such as collagen, actin, myosin, and keratin.
Collagen, the most abundant protein in mammals, is found throughout the body. In connective tissue, such as skin, ligaments, and tendons, it provides tensile strength and elasticity.  In bones and teeth, it mineralizes to...
27.3K
Proteins: From Genes to Degradation02:11

Proteins: From Genes to Degradation

11.9K
Within a biological system, the DNA encodes the RNA, and the nucleotide sequence in the RNA further defines the amino acid sequence in the protein. This is referred to as “The Central Dogma of Molecular Biology” - a term coined by Francis Crick.  Central dogma is a firm principle in biology that defines the flow of genetic information within any life form. The two fundamental steps in central dogma are - transcription and translation.
Transcription is the synthesis of RNA...
11.9K
Protein Families02:47

Protein Families

15.2K
Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key...
15.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ProtAff: Protein Binding Affinity Prediction via LoRA-Finetuned ESM-2.

bioRxiv : the preprint server for biology·2026
Same author

Predictions from deep learning propose substantial protein-carbohydrate interplay.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Customizing CRISPR-Cas PAM specificity with protein language models.

Nature biotechnology·2026
Same author

Can We Extract Physics-like Energies from Generative Protein Diffusion Models?

bioRxiv : the preprint server for biology·2025
Same author

Adapting Co-Folding Models for Structure-Based Protein-Protein Docking Through Flow Matching.

bioRxiv : the preprint server for biology·2025
Same author

De novo design of metal-oxide templating proteins.

bioRxiv : the preprint server for biology·2025
Same journal

Glycoform engineering of a mammalian platform to sculpt a humanized recombinant bioscavenger.

Cell systems·2026
Same journal

Targeted genomic editing of human gut Bacteroides species based on CRISPR-associated transposases.

Cell systems·2026
Same journal

Scalable enumeration and sampling of minimal metabolic pathways for organisms and communities.

Cell systems·2026
Same journal

Deciphering protein mutation-phenotype linkages from CRISPR-based tiling mutagenesis screens.

Cell systems·2026
Same journal

High-throughput machine learning-aided antibody discovery for cell surface antigens.

Cell systems·2026
Same journal

Quantitative cytokine profiling of primary human macrophages reveals distinct single-cell modes of trained immunity.

Cell systems·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K

How well do contextual protein encodings learn structure, function, and evolutionary context?

Sai Pooja Mahajan1, Fátima A Dávila-Hernández1, Jeffrey A Ruffolo2

  • 1Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.

Cell Systems
|March 5, 2025
PubMed
Summary
This summary is machine-generated.

This study uses AI models to understand protein sequences, revealing that context is key to predicting amino acid residues and protein structure. Learned representations capture evolutionary and functional protein properties effectively.

Keywords:
antibody designbinder designcontext-aware designdeep learningequivariant graph transformersfinetuningmasked modelspretraining protein modelsprotein designprotein flexibility

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.4K

Related Experiment Videos

Last Updated: May 24, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.4K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Protein Engineering

Background:

  • Protein function is dictated by amino acid sequence, influenced by structural, evolutionary, and functional contexts.
  • Understanding these contextual relationships is crucial for predicting protein behavior and designing novel proteins.

Purpose of the Study:

  • To train masked label prediction models for learning residue representations in diverse protein contexts.
  • To investigate how pretraining and fine-tuning contextual encodings improve specialized protein representations.
  • To explore the utility of learned representations in predicting protein structure, flexibility, and interactions.

Main Methods:

  • Trained masked label prediction models on protein sequences to learn contextual residue representations.
  • Sampled sequences from learned representations to assess their ability to fold into template structures.
  • Evaluated generated sequences for evolutionary conservation, structural plasticity, and binding energies at protein-protein interfaces.

Main Results:

  • Learned representations successfully generated sequences that fold into template structures and reflect evolutionary variations.
  • For flexible proteins, sampled sequences explored the full conformational space, indicating encoded plasticity.
  • Generated sequences accurately replicated wild-type binding energies at protein-protein interfaces in silico.
  • Fine-tuning captured conserved patterns at antibody-antigen interfaces, while pretraining enhanced sequence recovery for the H3 loop.

Conclusions:

  • Contextual encodings derived from masked label prediction models provide powerful representations of amino acid residues.
  • These representations effectively capture structural, evolutionary, and functional properties of proteins.
  • The approach holds promise for protein design, understanding protein dynamics, and analyzing protein interactions.