Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Networks02:26

Protein Networks

3.9K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
3.9K
Proteomics01:33

Proteomics

7.0K
A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term...
7.0K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.6K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.6K
Protein-protein Interfaces02:04

Protein-protein Interfaces

12.4K
Many proteins form complexes to carry out their functions, making protein-protein interactions (PPIs) essential for an organism's survival. Most PPIs are stabilized by numerous weak noncovalent chemical forces. The physical shape of the interfaces determines the way two proteins interact. Many globular proteins have closely-matching shapes on their surfaces, which form a large number of weak bonds. Additionally, many PPIs occur between two helices or between a surface cleft and a...
12.4K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

18.7K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
18.7K
Protein and Protein Structure02:15

Protein and Protein Structure

77.4K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
77.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Evaluating the role of pretraining dataset size and diversity on single-cell foundation model performance.

Nature methods·2026
Same author

Scalable and cost-efficient custom gene library assembly from oligopools.

Science advances·2026
Same author

Closing the loop: Experimentally validated methods in artificial intelligence-driven protein design.

Current opinion in structural biology·2026
Same author

Sequence Display enables large-scale sequence-activity datasets for rapid protein evolution.

Nature biotechnology·2026
Same author

Trainable subnetworks reveal insights into structure knowledge organization in protein language models.

PLoS computational biology·2026
Same author

Improving atlas-scale single-cell annotation models with hierarchical cross-entropy loss.

Nature computational science·2026
Same journal

ARISE: RNA-Anchored Shared-Edge Topology and Hierarchical Fusion for Spatial Multi-Omics Integration.

Bioinformatics (Oxford, England)·2026
Same journal

Interactive exploration of biobank-scale ancestral recombination graphs with Lorax.

Bioinformatics (Oxford, England)·2026
Same journal

PepMCP: A Graph-Based Membrane Contact Probability Predictor for Membrane-Lytic Antimicrobial Peptides.

Bioinformatics (Oxford, England)·2026
Same journal

ARGscape: A modular, interactive tool for manipulation of spatiotemporal ancestral recombination graphs.

Bioinformatics (Oxford, England)·2026
Same journal

A-liner: linear alignment visualizer for genome comparisons.

Bioinformatics (Oxford, England)·2026
Same journal

Interacting Species Database (ISDB): Comprehensive Resource for Interspecies Interactions at the Molecular Level.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: May 13, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.4K

ProtNote: a multimodal method for protein-function annotation.

Samir Char1, Nathaniel Corley2, Sarah Alamdari3

  • 1Microsoft Cloud & AI, Microsoft, Redmond, WA 98052, United States.

Bioinformatics (Oxford, England)
|April 15, 2025
PubMed
Summary
This summary is machine-generated.

ProtNote, a new deep-learning model, predicts protein functions using text, even for novel ones. This advances protein biology by enabling flexible function discovery beyond trained data.

More Related Videos

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

19.6K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K

Related Experiment Videos

Last Updated: May 13, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.4K
Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

19.6K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.3K

Area of Science:

  • Molecular Biology
  • Bioinformatics
  • Computational Biology

Background:

  • The protein sequence-function relationship is crucial for protein biology and engineering.
  • Less than 1% of known protein sequences have experimentally verified functions.
  • Current deep learning models for protein function prediction are limited to trained functions.

Purpose of the Study:

  • To introduce ProtNote, a multimodal deep-learning model for protein function prediction.
  • To enable both supervised and zero-shot protein function prediction using free-form text.
  • To generalize predictions to unseen and novel protein functions.

Main Methods:

  • Developed a multimodal deep-learning model named ProtNote.
  • Leveraged free-form text for protein function prediction.
  • Evaluated performance on supervised and zero-shot tasks, including novel Gene Ontology and Enzyme Commission number predictions.

Main Results:

  • ProtNote achieves near state-of-the-art performance on its training set.
  • ProtNote generalizes effectively to unseen and novel protein functions in zero-shot settings.
  • ProtNote outperforms baseline models in predicting novel Gene Ontology and Enzyme Commission numbers, capturing nuanced sequence-function relationships.

Conclusions:

  • ProtNote enhances protein function discovery by allowing unrestricted free text inputs.
  • The model unlocks biological use cases previously inaccessible due to limitations of predefined labels.
  • ProtNote is a valuable tool for navigating the dynamic landscape of protein biology and engineering.