Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.8K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.8K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.1K
3.1K
Protein Complex Assembly02:41

Protein Complex Assembly

2.0K
2.0K
Protein and Protein Structure02:15

Protein and Protein Structure

79.4K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
79.4K
Protein Networks02:26

Protein Networks

2.3K
2.3K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

3.9K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
3.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

EquiHGNN: Scalable rotationally equivariant hypergraph neural networks.

The Journal of chemical physics·2026
Same author

Enabling multi-target drug discovery through latent evolutionary optimization and synthesis-aware prioritization (EVOSYNTH).

Communications chemistry·2026
Same author

BioMedKG: multimodal contrastive representation learning in augmented BioMedical knowledge graphs.

Frontiers in systems biology·2025
Same author

DrugPipe: Generative artificial intelligence-assisted virtual screening pipeline for generalizable and efficient drug repurposing.

Biology methods & protocols·2025
Same author

Machine learning for automated electrical penetration graph analysis of aphid feeding behavior: Accelerating research on insect-plant interactions.

PloS one·2025
Same author

ProteinReDiff: Complex-based ligand-binding proteins redesign by equivariant diffusion-based generative models.

Structural dynamics (Melville, N.Y.)·2024
Same journal

Gap junction communication regulates luminal-myoepithelial crosstalk and cell differentiation in a bilayered human mammary epithelial cell model.

Biology methods & protocols·2026
Same journal

Nonlinear combinatorial analysis of blood transcriptomes identifies PRKAR1A as a regulator of TDP-43 pathophysiology in amyotrophic lateral sclerosis.

Biology methods & protocols·2026
Same journal

Assessing the feasibility of machine learning for ancient DNA age prediction: Limitations and insights.

Biology methods & protocols·2026
Same journal

Advances in the detection of antimicrobial resistance in aquatic environments: a methodological perspective.

Biology methods & protocols·2026
Same journal

sc-rDSeq: Droplet-based single-cell full-length total RNA-seq method.

Biology methods & protocols·2026
Same journal

Discovery of novel InhA inhibitors through structural bioinformatics and machine learning-driven QSAR screening of natural products.

Biology methods & protocols·2026
See all related articles

Related Experiment Video

Updated: Jun 21, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.0K

Multimodal pretraining for unsupervised protein representation learning.

Viet Thanh Duy Nguyen1, Truong Son Hy1,2

  • 1FPT Software AI Center, HCMC, Hanoi, Vietnam.

Biology Methods & Protocols
|July 10, 2024
PubMed
Summary
This summary is machine-generated.

Multimodal Protein Representation Learning (MPRL) unifies protein structures for better biological insights. This framework enhances predictions in protein-ligand binding, fold classification, and enzyme activity.

Keywords:
Geometric Deep LearningLarge Language ModelsMultimodal representationProtein representation learningUnsupervised pretraining

More Related Videos

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.8K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K

Related Experiment Videos

Last Updated: Jun 21, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.0K
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.8K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K

Area of Science:

  • Molecular Biology
  • Structural Biology
  • Computational Biology

Background:

  • Proteins are essential biomolecules with complex structures and functions vital for biological processes.
  • Understanding protein structure-function relationships is key for advancements in medicine and drug design.
  • Current methods may not fully capture the intricate details of protein architecture.

Purpose of the Study:

  • To introduce a novel framework, Multimodal Protein Representation Learning (MPRL), for learning unified protein representations.
  • To integrate primary and tertiary protein structures using a multimodal pretraining approach.
  • To develop a symmetry-preserving method for enhanced protein analysis.

Main Methods:

  • MPRL employs Evolutionary Scale Modeling (ESM-2) for protein sequence analysis.
  • Variational Graph Auto-Encoders (VGAE) are used for residue-level graph representations.
  • PointNet Autoencoder (PAE) processes 3D atomic point clouds.
  • Auto-Fusion synthesizes joint representations from multiple pretrained models.

Main Results:

  • MPRL learns unsupervised, unified protein representations by integrating diverse structural data.
  • The framework preserves critical symmetries inherent in protein structures.
  • Significant performance enhancements were observed across multiple downstream tasks.

Conclusions:

  • MPRL offers a robust and comprehensive approach to protein representation learning.
  • The framework advances the understanding of protein dynamics and facilitates research in molecular biology and drug design.
  • Publicly available source code enables further research and development.