Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Peptide Identification Using Tandem Mass Spectrometry01:33

Peptide Identification Using Tandem Mass Spectrometry

Tandem mass spectrometry, also known as MS/MS or MS2, is an analytical technique that employs two mass analyzers. Essentially it is a series of mass spectrometers that helps isolate a particular biomolecule and then helps study its chemical properties.
This technique helps gather information regarding the protein from which the peptide was obtained and to study the peptides’ amino acid sequence. Identifying peptides from a complex mixture is an important component of the growing field of...
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to form...
Protein and Protein Structure02:15

Protein and Protein Structure

Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme can...
Protein Families02:47

Protein Families

Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key locations, protein...
Protein Families02:47

Protein Families

Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key locations, protein...
Protein Networks02:26

Protein Networks

An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

DeepPI: Alignment-Free Analysis of Flexible Length Proteins Based on Deep Learning and Image Generator.

Interdisciplinary sciences, computational life sciences·2024
Same author

cPlot: Contig-Plotting Visualization for the Analysis of Short-Read Nucleotide Sequence Alignments.

International journal of molecular sciences·2022
Same journal

SA-MTP: a structure-aware framework for multifunctional therapeutic peptide annotation.

Briefings in bioinformatics·2026
Same journal

Genome assemblies and annotations are not static and need support for tracking their evolution.

Briefings in bioinformatics·2026
Same journal

A historical journey of metabolite-protein interaction discovery: from data harmonization to AI-driven prediction.

Briefings in bioinformatics·2026
Same journal

Bridging local-global transmembrane protein contexts with contrastive pretraining for alignment-free pathogenicity prediction.

Briefings in bioinformatics·2026
Same journal

Prediction of drug hypersensitivity by comprehensive modeling of HLA-peptidomes.

Briefings in bioinformatics·2026
Same journal

EssTFNet: integration of adaptive time-frequency and DNA language models for interpretable human essential gene prediction.

Briefings in bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 7, 2026

Computational Prediction of Amino Acid Preferences of Potentially Multispecific Peptide-Binding Domains Involved in Protein-Protein Interactions
06:50

Computational Prediction of Amino Acid Preferences of Potentially Multispecific Peptide-Binding Domains Involved in Protein-Protein Interactions

Published on: January 26, 2024

ProtDML: label-aware representation learning for broad-spectrum protein function prediction.

Yejin Kan1, Gangman Yi1

  • 1Department of Computer Science and Artificial Intelligence, Dongguk University, 30, Pildong-ro 1-gil, Jung-gu, Seoul 04620, South Korea.

Briefings in Bioinformatics
|June 5, 2026
PubMed
Summary
This summary is machine-generated.

ProtDML, a novel framework, enhances protein function prediction by explicitly modeling label correlations using distance metric learning. This approach improves accuracy and scalability for diverse biological tasks.

Keywords:
distance metric learningmulti-label classificationprotein function predictionprotein representation learning

More Related Videos

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Related Experiment Videos

Last Updated: Jun 7, 2026

Computational Prediction of Amino Acid Preferences of Potentially Multispecific Peptide-Binding Domains Involved in Protein-Protein Interactions
06:50

Computational Prediction of Amino Acid Preferences of Potentially Multispecific Peptide-Binding Domains Involved in Protein-Protein Interactions

Published on: January 26, 2024

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Machine Learning

Background:

  • Protein function prediction is vital for understanding biological processes, yet experimental annotations lag behind sequence data growth.
  • Existing protein language models (pLMs) often fail to capture complex correlations among functional labels, limiting their predictive power.
  • There is a need for scalable and accurate methods to predict protein function, especially considering protein multifunctionality.

Purpose of the Study:

  • To develop ProtDML, a framework that improves protein function prediction by explicitly modeling label correlations.
  • To enhance the modeling of protein multifunctionality through a novel distance metric learning approach.
  • To integrate complementary protein representations for improved sequence-based prediction.

Main Methods:

  • ProtDML utilizes distance metric learning with a similarity-weighted pull-push objective to model label co-occurrence patterns.
  • A multi-view strategy integrates diverse sequence-derived protein representations.
  • The framework learns a task-specific distance metric, creating a label-aware feature space.

Main Results:

  • ProtDML significantly outperforms sequence-based baselines on Pfam multi-label classification by creating distinct, separable clusters.
  • The method demonstrates structural generalization capabilities on SCOP benchmarks without explicit structural input.
  • ProtDML achieves state-of-the-art performance on the CAFA3 benchmark, showing robustness in large-scale, imbalanced prediction tasks.

Conclusions:

  • ProtDML offers an accurate and scalable solution for sequence-based protein function prediction by effectively modeling label correlations.
  • The framework is model-agnostic and complements existing protein language model embeddings.
  • ProtDML shows promise for real-world applications, including viral data analysis.