Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Networks02:26

Protein Networks

2.8K
2.8K
Protein Networks02:26

Protein Networks

4.4K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
4.4K
Protein-protein Interfaces02:04

Protein-protein Interfaces

14.4K
Many proteins form complexes to carry out their functions, making protein-protein interactions (PPIs) essential for an organism's survival. Most PPIs are stabilized by numerous weak noncovalent chemical forces. The physical shape of the interfaces determines the way two proteins interact. Many globular proteins have closely-matching shapes on their surfaces, which form a large number of weak bonds. Additionally, many PPIs occur between two helices or between a surface cleft and a...
14.4K
Protein-Protein Interfaces02:04

Protein-Protein Interfaces

4.4K
4.4K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

14.0K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
14.0K
Proteomics01:33

Proteomics

9.2K
A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term...
9.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Characterizing the MSMP-CCR2 Interaction through Molecular Dynamics Simulations and Machine Learning Approaches.

Journal of chemical information and modeling·2026
Same author

Exploring the association between common genetic deletions and aging: insights from the Canadian Longitudinal Study on Aging.

GeroScience·2026
Same author

THe Biom: a platform for visualization and exploration of cancer transcriptomic biomarkers identified by robust feature selection.

Bioinformatics advances·2026
Same author

De novo variants in the splicing factor gene SF3B1 are associated with neurodevelopmental disorders.

Nature communications·2026
Same author

Investigating the neuronal role of the proteasomal ATPase subunit gene PSMC5 in neurodevelopmental proteasomopathies.

Nature communications·2025
Same author

A systematic benchmark of integrative strategies for microbiome-metabolome data.

Communications biology·2025
Same journal

Molecular Solution to the Paradox of Ancient Brain Preservation.

Journal of proteome research·2026
Same journal

From Method-Defined Signals to Reference Measurement Procedures: Two Decades of Mass Spectrometry-Based ProGRP Quantification.

Journal of proteome research·2026
Same journal

Proteomic Profiling of Extracellular Vesicle-Enriched Plasma Using Mag-Net for Biomarker Discovery in Pancreatic Ductal Adenocarcinoma.

Journal of proteome research·2026
Same journal

Computationally Efficient Bayesian Estimation of Graphical Networks for Omics Data.

Journal of proteome research·2026
Same journal

Hierarchy of MS-Based Evidence.

Journal of proteome research·2026
Same journal

Proteomic Profiling of Exosomes from HPV-Positive and HPV-Negative Head and Neck Squamous Cell Carcinoma: Selective Cargo Packaging.

Journal of proteome research·2026
See all related articles

Related Experiment Video

Updated: Jan 7, 2026

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

888

Protein Language Models: Applications and Perspectives.

Mickael Leclercq1, Arnaud Droit1,2

  • 1Axe Endo-Nephro, Centre de recherche du CHU de Québec-Université Laval, Québec, QC G1 V 4G2, Canada.

Journal of Proteome Research
|December 26, 2025
PubMed
Summary
This summary is machine-generated.

Protein language models (pLMs), adapted from large language models (LLMs), rapidly analyze protein sequences. These advanced AI tools accelerate biological research and drug discovery by predicting structures and functions.

Keywords:
biophysical property predictioncomputational scalability and efficiencyde novo protein sequence generationpost-translational modification predictionprotein function annotationprotein language models (pLMs)protein structure predictionprotein−protein interaction modelingsequence embeddingstransformer architectures

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K
Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

13.3K

Related Experiment Videos

Last Updated: Jan 7, 2026

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

888
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K
Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

13.3K

Area of Science:

  • Proteomics
  • Bioinformatics
  • Computational Biology

Background:

  • Large language models (LLMs) originally designed for human text have been adapted for proteomics as protein language models (pLMs).
  • pLMs process amino acid sequences similarly to how LLMs process text, learning patterns from vast datasets.
  • This adaptation enables novel applications in understanding protein behavior and function.

Purpose of the Study:

  • To highlight the capabilities and applications of protein language models (pLMs) in proteomics.
  • To discuss the advantages of pLMs over traditional methods in terms of speed and insight generation.
  • To address the challenges and future directions in pLM development, including resource requirements and bias reduction.

Main Methods:

  • Treating amino acid sequences as 'sentences' for pattern recognition.
  • Utilizing large-scale sequence databases for model training.
  • Applying pLMs to tasks such as protein structure prediction, function annotation, and interaction mapping.

Main Results:

  • pLMs offer faster insights compared to traditional proteomics approaches.
  • Key applications include predicting protein structures, annotating functions, designing novel sequences, and mapping molecular interactions.
  • Current research focuses on enhancing prediction accuracy and reducing biases through efficient training and smaller models.

Conclusions:

  • pLMs are revolutionizing proteomics by providing rapid, large-scale insights.
  • Continued development, driven by growing sequence databases, will accelerate drug discovery and basic research.
  • Future pLMs will offer deeper understanding of protein functions and disease pathways, aiding experimental design.