Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein and Protein Structure02:15

Protein and Protein Structure

86.9K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
86.9K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.6K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.6K
Protein and Protein Structures02:15

Protein and Protein Structures

18.7K
18.7K
Protein Organization01:24

Protein Organization

9.1K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
9.1K
Protein Organization01:13

Protein Organization

155.9K
Overview
155.9K
Proteomics01:33

Proteomics

9.3K
A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term...
9.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Peculiarities of the Interaction of the Bacteriolytic Protease Blp from <i>Lysobacter capsici</i> XL1 with the Cell Wall of <i>Staphylococcus aureus</i> 209P.

International journal of molecular sciences·2026
Same author

Sign Epistasis Can be Absent in Multi-peaked Landscapes With Neutral Mutations.

Genome biology and evolution·2026
Same author

MPXV RNA-seq data provide evidence for protection of viral transcripts from APOBEC3 editing.

Journal of virology·2026
Same author

KRASAVA-An Expert System for Virtual Screening of KRAS G12D Inhibitors.

International journal of molecular sciences·2026
Same author

Convergent pairs of highly transcribed genes restrict chromatin looping in Dictyostelium discoideum.

Nucleic acids research·2025
Same author

Limitations of Out-of-Distribution Detection in 3D Medical Image Segmentation.

Journal of imaging·2023
Same journal

Correction to: Pathogenicity patterns in cytochrome P450 family.

Bioinformatics advances·2026
Same journal

Region-aware bridge modeling enables interpretable mesoscale representation of spatial transcriptomic tissue sections.

Bioinformatics advances·2026
Same journal

Microbiome differential abundance methodologies to detect relevant taxa associated with chemotherapy toxicity rate in colorectal cancer.

Bioinformatics advances·2026
Same journal

maldipickr dereplicates microbial MALDI-TOF spectra to facilitate multiplexed isolation.

Bioinformatics advances·2026
Same journal

RAM-MSA: an anytime memory-bounded method for exact multiple sequence alignment using path finding.

Bioinformatics advances·2026
Same journal

Interpretable machine learning for low-sample multi-omics: a case study of ferret vaccine response.

Bioinformatics advances·2026
See all related articles

Related Experiment Video

Updated: Jan 17, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.7K

MULAN: multimodal protein language model for sequence and structure encoding.

Daria Frolova1,2, Marina Pak1,3, Anna Litvin1,3,4

  • 1Ligand Pro, Moscow 121205, Russia.

Bioinformatics Advances
|September 24, 2025
PubMed
Summary
This summary is machine-generated.

MULAN, a multimodal protein language model (PLM), enhances protein representations by combining sequence and structure data efficiently. This structure-aware PLM improves predictions, particularly for protein-protein interactions, without extensive retraining.

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K
Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

949

Related Experiment Videos

Last Updated: Jan 17, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.7K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K
Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

949

Area of Science:

  • Computational biology
  • Bioinformatics
  • Artificial intelligence in biology

Background:

  • Protein language models (PLMs) primarily utilize protein sequences for high-quality representations.
  • Incorporating protein structural information is crucial for various prediction tasks, driving interest in structure-aware PLMs.
  • Existing structure-aware PLMs often require training from scratch or substantial parameter additions for structure encoding.

Purpose of the Study:

  • To introduce MULAN, a multimodal protein language model (PLM) that integrates both sequence and angle-based structural information.
  • To develop a parameter-efficient approach for incorporating structural awareness into pre-trained PLMs.
  • To evaluate MULAN's performance across diverse downstream tasks compared to existing models.

Main Methods:

  • Developed MULAN, a multimodal PLM featuring a pre-trained sequence encoder and a parameter-efficient Structure Adapter.
  • Fused the sequence encoder and Structure Adapter, training them together for multimodal representation learning.
  • Evaluated MULAN on nine downstream prediction tasks, assessing its performance against sequence-only and structure-aware baselines.

Main Results:

  • MULAN models demonstrated improved performance across various sizes compared to sequence-only ESM2 and structure-aware SaProt.
  • Significant improvements were observed in protein-protein interaction prediction, with AUROC increasing by up to 0.12.
  • MULAN achieved enhanced structural awareness through efficient fine-tuning of existing PLMs, avoiding costly training from scratch.

Conclusions:

  • MULAN offers an effective and computationally inexpensive method to imbue protein representations with structural awareness.
  • The proposed Structure Adapter provides a parameter-efficient way to integrate structural information into PLMs.
  • MULAN represents a valuable advancement for structure-aware protein modeling and prediction tasks.