Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.5K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.5K
Protein Networks02:26

Protein Networks

4.1K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
4.1K
Protein Organization01:24

Protein Organization

7.8K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
7.8K
Protein-protein Interfaces02:04

Protein-protein Interfaces

13.9K
Many proteins form complexes to carry out their functions, making protein-protein interactions (PPIs) essential for an organism's survival. Most PPIs are stabilized by numerous weak noncovalent chemical forces. The physical shape of the interfaces determines the way two proteins interact. Many globular proteins have closely-matching shapes on their surfaces, which form a large number of weak bonds. Additionally, many PPIs occur between two helices or between a surface cleft and a...
13.9K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

12.7K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
12.7K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.3K
3.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Folding the unfoldable 2: using AlphaFold and ESMFold to explore spurious proteins.

Bioinformatics advances·2026
Same author

InterProScan 6: a modern large-scale protein function annotation pipeline.

Bioinformatics advances·2026
Same author

Viral non-coding RNA structure annotation and API-based data retrieval with Rfam and R2DT.

bioRxiv : the preprint server for biology·2026
Same author

Amos Bairoch (1957-2025): pioneer of bioinformatics and founder of Swiss-Prot.

Bioinformatics advances·2026
Same author

Expression Atlas in 2026: enabling FAIR and open expression data through community collaboration and integration.

Nucleic acids research·2025
Same author

Exploring the Terra incognita of AI-based domain classifications.

Protein science : a publication of the Protein Society·2025
Same journal

Prime editing for precise genome engineering and modulation of fungal metabolism.

Nature biotechnology·2026
Same journal

Retargeted serine integrases for one-step, precise integration of large DNA sequences in human cells.

Nature biotechnology·2026
Same journal

A retargeted recombinase for precise insertion of large DNA.

Nature biotechnology·2026
Same journal

Experiment-guided AlphaFold3 resolves measurement-consistent protein ensembles.

Nature biotechnology·2026
Same journal

Spatially resolved profiling of extracellular vesicles in tissues with Spatial-EV-seq.

Nature biotechnology·2026
Same journal

Mapping the spatial landscape of extracellular vesicles in tissues with Spatial-EV-seq.

Nature biotechnology·2026
See all related articles

Related Experiment Video

Updated: Oct 3, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.3K

Using deep learning to annotate the protein universe.

Maxwell L Bileschi1, David Belanger2, Drew H Bryant2

  • 1Google Research, Cambridge, MA, USA. mlbileschi@google.com.

Nature Biotechnology
|February 22, 2022
PubMed
Summary
This summary is machine-generated.

Deep learning models accurately predict protein function from amino acid sequences, improving protein annotation and extending the coverage of the protein families database (Pfam). This advance aids in understanding microbial protein function and discovering new human protein annotations.

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.6K
Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification
10:37

Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification

Published on: November 15, 2017

12.1K

Related Experiment Videos

Last Updated: Oct 3, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.3K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.6K
Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification
10:37

Deep Proteome Profiling by Isobaric Labeling, Extensive Liquid Chromatography, Mass Spectrometry, and Software-assisted Quantification

Published on: November 15, 2017

12.1K

Area of Science:

  • * Computational biology
  • * Bioinformatics
  • * Structural biology

Background:

  • * Predicting protein function from amino acid sequences is crucial for biological and translational research.
  • * Current alignment-based methods fail to annotate approximately one-third of microbial protein sequences, limiting data utilization.
  • * Gaps in functional annotation hinder the exploitation of genomic data from diverse organisms.

Purpose of the Study:

  • * To develop and evaluate deep learning models for accurate functional annotation of unaligned amino acid sequences.
  • * To assess the models' ability to infer evolutionary patterns and cluster novel protein families.
  • * To integrate deep learning with existing methods for enhanced remote homology detection.

Main Methods:

  • * Training deep learning models on a comprehensive dataset from the Protein Families database (Pfam).
  • * Rigorous benchmark assessments using 17,929 Pfam families to evaluate model performance.
  • * Combining deep learning predictions with established alignment-based techniques.

Main Results:

  • * Deep learning models accurately predict functional annotations for unaligned sequences, inferring evolutionary substitution patterns.
  • * Models successfully cluster sequences from previously unseen protein families.
  • * Integration with existing methods significantly improves remote homology detection, indicating complementary information capture.
  • * Pfam coverage is extended by over 9.5%, surpassing additions from the past decade.
  • * Functional annotations were predicted for 360 human reference proteome proteins lacking prior Pfam annotation.

Conclusions:

  • * Deep learning offers a powerful approach to overcome limitations of alignment-based protein function prediction.
  • * These models enhance protein annotation tools, significantly expanding the functional knowledge of protein databases like Pfam.
  • * The findings suggest deep learning will be integral to future protein annotation strategies and biological discovery.