Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

11.2K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
11.2K
Proteomics01:33

Proteomics

7.7K
A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term...
7.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ANARCII enables alignment-free antigen receptor numbering using a generalised language model.

Communications biology·2026
Same author

LICHEN enables light-chain immunoglobulin sequence generation conditioned on the heavy chain and experimental needs.

Communications biology·2026
Same author

PIPENN-EMB ensemble net and protein embeddings generalise protein interface prediction beyond homology.

Scientific reports·2025
Same author

Impact of pathogenic mutations of the GLUT1 glucose transporter on solute carrier dynamics using ComDYN enhanced sampling.

F1000Research·2025
Same author

Mining literature and pathway data to explore the relations of ketamine with neurotransmitters and gut microbiota using a knowledge-graph.

Bioinformatics (Oxford, England)·2023
Same author

Ten quick tips for sequence-based prediction of protein properties using machine learning.

PLoS computational biology·2022
Same journal

Integrated multi-assessment and structural performance index framework for stacking-sequence optimisation of natural fibre reinforced laminates.

Scientific reports·2026
Same journal

SuperiorGAT: graph attention networks for sparse LiDAR point cloud reconstruction in autonomous systems.

Scientific reports·2026
Same journal

The effect of stretching the pectoralis major, sternocleidomastoid, and iliopsoas muscles on 800 m swimming performance in master swimmers.

Scientific reports·2026
Same journal

ISNR-PQC: isometry noise resilience post quantum cryptography primitive.

Scientific reports·2026
Same journal

Identification of high-yielding and stable genotypes of barley in the cold climate of Iran using AMMI and GGE biplot models.

Scientific reports·2026
Same journal

Bayesian negative binomial modelling of spatial and temporal patterns of road traffic deaths in Ghana.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Aug 27, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.8K

ProteinGLUE multi-task benchmark suite for self-supervised protein modeling.

Henriette Capel1, Robin Weiler1, Maurits Dijkstra1

  • 1Informatics Institute, Vrije Universiteit, 1081 HV, Amsterdam, The Netherlands.

Scientific Reports
|September 26, 2022
PubMed
Summary
This summary is machine-generated.

Self-supervised language models analyze protein sequences. We introduce ProteinGLUE, a benchmark for evaluating protein representations across seven tasks, aiding model comparison and development.

More Related Videos

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

301
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

2.0K

Related Experiment Videos

Last Updated: Aug 27, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.8K
Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

301
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

2.0K

Area of Science:

  • Computational biology
  • Bioinformatics
  • Machine learning in biology

Background:

  • Self-supervised language modeling is advancing protein sequence analysis.
  • Current methods lack standardization, hindering model comparison and generalizability.
  • Existing evaluations often focus on limited downstream tasks.

Purpose of the Study:

  • Introduce ProteinGLUE, a standardized benchmark for evaluating protein representations.
  • Facilitate rigorous comparison of diverse self-supervised learning models for protein sequences.
  • Provide baseline models and code to accelerate research in protein property prediction.

Main Methods:

  • Developed ProteinGLUE, a benchmark comprising seven per-amino-acid tasks.
  • Implemented two baseline models trained on masked symbol and next sentence prediction.
  • Evaluated model performance on downstream tasks including secondary structure and interface prediction.

Main Results:

  • Pre-training significantly improved performance on various downstream tasks compared to no pre-training.
  • The larger base model did not consistently outperform the smaller medium model.
  • ProteinGLUE provides a robust framework for assessing learned protein representations.

Conclusions:

  • ProteinGLUE offers a valuable resource for the protein sequence analysis community.
  • Standardized evaluation is crucial for advancing self-supervised learning in proteomics.
  • The benchmark facilitates the development of more generalizable and effective protein representation models.