Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.7K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.7K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

18.8K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
18.8K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.1K
3.1K
Protein Networks02:26

Protein Networks

3.9K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
3.9K
Conserved Binding Sites01:49

Conserved Binding Sites

4.1K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
4.1K
Ligand Binding and Linkage00:49

Ligand Binding and Linkage

4.7K
Allosteric proteins have more than one ligand binding site; the binding of a ligand to any of these sites influences the binding of ligands to the other sites. When a protein is allosteric, its binding sites are called coupled or linked.  In the case of enzymes, the site that binds to the substrate is known as the active site and the other site is known as the regulatory site. When a ligand binds to the regulatory site, this leads to conformational changes in the protein that can influence...
4.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Effectiveness of Clinical Pathway-Based Rehabilitation Nursing on Swallowing Function in Stroke Patients With Dysphagia: A Retrospective Cohort Study.

Journal of visualized experiments : JoVE·2026
Same author

High-Performance Ink-Writable Polyurethane Elastomers Based on Crystalline PCL and Hindered Urea Bonds.

Macromolecular rapid communications·2026
Same author

Dynamic liquid film crystallization (DLFC): An additive-free strategy for spherical curcumin particles with superior powder flowability for functional foods.

Food chemistry·2026
Same author

SynFit: Synergistic Contrastive Learning for Multi-Objective Protein Fitness Prediction and Optimization.

bioRxiv : the preprint server for biology·2026
Same author

Ischaemic stroke recurrence in patients with symptomatic intracranial atherosclerotic stenosis in China (PROMISE): a multivariable prediction model development and validation study.

The Lancet. Digital health·2026
Same author

Epidemiological characteristics and viral molecular evolution analysis of severe fever with thrombocytopenia syndrome in Shandong Province, China.

Ticks and tick-borne diseases·2026
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K

Learning maximally spanning representations improves protein function annotation.

Jiaqi Luo1, Yunan Luo1

  • 1School of Computational Science and Engineering, Georgia Institute of Technology.

Biorxiv : the Preprint Server for Biology
|March 3, 2025
PubMed
Summary
This summary is machine-generated.

MSRep, a new deep learning framework, improves protein function annotation by addressing data imbalance. It enhances prediction accuracy for both common and rare protein functions, aiding the study of uncharacterized proteins.

More Related Videos

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.5K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K

Related Experiment Videos

Last Updated: May 24, 2025

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.2K
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.5K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K

Area of Science:

  • Computational biology
  • Bioinformatics
  • Machine learning

Background:

  • Automated protein function annotation is vital for understanding biological processes, medicine, and biotechnology.
  • Existing methods struggle with imbalanced data, leading to poor performance on understudied protein functions.
  • This imbalance stems from biases in data collection and protein evolution.

Purpose of the Study:

  • To develop MSRep, a novel deep learning framework to address data imbalance in protein function annotation.
  • To improve prediction accuracy for both well-represented and underrepresented protein functions.
  • To enhance the generalizability of protein function prediction models.

Main Methods:

  • MSRep refines a pre-trained protein language model using a novel loss function inspired by neural collapse (NC).
  • The framework induces an NC-like structure to ensure balanced representation of all function classes in the embedding space.
  • Evaluated across four diverse protein function annotation tasks (EC numbers, Gene3D, Pfam, GO terms).

Main Results:

  • MSRep demonstrated superior predictive performance across all tested annotation tasks.
  • The framework significantly improved accuracy for both well-studied and understudied protein functions.
  • MSRep outperformed several existing state-of-the-art protein annotation tools.

Conclusions:

  • MSRep effectively addresses the challenge of imbalanced data in protein function annotation.
  • The approach enhances the annotation of understudied functions and uncharacterized proteins.
  • MSRep holds promise for advancing protein function studies and accelerating biological discovery.