Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.0K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.0K
Protein Networks02:26

Protein Networks

2.4K
2.4K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

11.0K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
11.0K
Protein and Protein Structure02:15

Protein and Protein Structure

79.9K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
79.9K
Protein-protein Interfaces02:04

Protein-protein Interfaces

12.6K
Many proteins form complexes to carry out their functions, making protein-protein interactions (PPIs) essential for an organism's survival. Most PPIs are stabilized by numerous weak noncovalent chemical forces. The physical shape of the interfaces determines the way two proteins interact. Many globular proteins have closely-matching shapes on their surfaces, which form a large number of weak bonds. Additionally, many PPIs occur between two helices or between a surface cleft and a...
12.6K
Conservation of Protein Domains02:26

Conservation of Protein Domains

3.1K
3.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

On the state of protein function prediction: a report on the fourth CAFA challenge.

bioRxiv : the preprint server for biology·2026
Same author

PhenoRareAI: Phenotype-based intelligent diagnosis for rare neuromuscular disorders of glycogen storage disease and spinal muscular atrophy.

Health information science and systems·2026
Same author

NPRL2 restricts porcine reproductive and respiratory syndrome virus replication by targeting viral Nsp1α for autophagic degradation.

Veterinary research·2026
Same author

Predictive Factors for Ciliary Block after Cataract and Goniosynechialysis Surgery in Primary Angle-Closure Disease.

Ophthalmology. Glaucoma·2026
Same author

VACmap: an accurate long-read aligner for unraveling complex genomic rearrangements.

Nature communications·2026
Same author

Vaner2: towards more general biomedical named entity recognition using multi-task large language model encoders.

Health information science and systems·2025
Same journal

Tranquillyzer: A Neural Network Framework for Long-read Annotation and Demultiplexing.

Genomics, proteomics & bioinformatics·2026
Same journal

Advancing Functional Transcriptomics in Zebrafish with High-accuracy Full-length RNA Sequencing.

Genomics, proteomics & bioinformatics·2026
Same journal

NanoRAPID: A Deep Learning-based Framework for Single-molecule RNA Structure Analysis Using Nanopore Direct RNA Sequencing.

Genomics, proteomics & bioinformatics·2026
Same journal

Single-cell Multiomic and Spatiotemporal Dissection of the Liver Circadian Clock.

Genomics, proteomics & bioinformatics·2026
Same journal

Ï€-HelixNovo2: Making Accurate Online De Novo Peptide Sequencing Available to All.

Genomics, proteomics & bioinformatics·2026
Same journal

CTSC-RAB38 Potentiates Responsiveness to PD-1 Blockade in Esophageal Squamous Cell Carcinoma.

Genomics, proteomics & bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Aug 2, 2025

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes
09:10

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes

Published on: May 22, 2018

9.3K

NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations.

Shaojun Wang1, Ronghui You1, Yunjia Liu2

  • 1Institute of Science and Technology for Brain-Inspired Intelligence and MOE Frontiers Center for Brain Science, Fudan University, Shanghai 200433, China.

Genomics, Proteomics & Bioinformatics
|April 19, 2023
PubMed
Summary
This summary is machine-generated.

NetGO 3.0 enhances automated protein function prediction by integrating protein language models, specifically Evolutionary Scale Modeling (ESM)-1b embeddings, to leverage information from unannotated proteins, significantly improving performance.

Keywords:
Large-scale multi-label learningLearning to rankProtein function predictionProtein language modelWeb service

More Related Videos

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.7K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.5K

Related Experiment Videos

Last Updated: Aug 2, 2025

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes
09:10

A Fast and Quantitative Method for Post-translational Modification and Variant Enabled Mapping of Peptides to Genomes

Published on: May 22, 2018

9.3K
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.7K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.5K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Automated Function Prediction (AFP) methods like NetGO 2.0 are crucial for understanding protein functions.
  • Current AFP methods primarily rely on experimentally annotated proteins, overlooking the vast potential of unannotated protein data.
  • Protein language models (PLMs) offer a novel approach to learn sequence-based representations through self-supervision.

Purpose of the Study:

  • To develop an improved AFP method by incorporating information from unannotated proteins.
  • To evaluate the efficacy of using protein language model embeddings for AFP.
  • To enhance the NetGO 2.0 framework with a new PLM-based component.

Main Methods:

  • Representing proteins using Evolutionary Scale Modeling (ESM)-1b embeddings.
  • Training a logistic regression (LR) model, termed LR-ESM, for AFP using ESM-1b representations.
  • Integrating the developed LR-ESM model into the existing NetGO 2.0 framework to create NetGO 3.0.

Main Results:

  • The LR-ESM model demonstrated performance comparable to the top components of NetGO 2.0.
  • The integration of LR-ESM into NetGO 2.0 resulted in substantial performance improvements in automated function prediction.
  • NetGO 3.0 effectively leverages information from unannotated proteins, expanding the scope of AFP.

Conclusions:

  • Protein language models, exemplified by ESM-1b, provide valuable representations for improving automated function prediction.
  • NetGO 3.0 represents a significant advancement in AFP by successfully integrating PLM-based features.
  • The enhanced NetGO 3.0 tool offers improved performance and broader applicability in protein function annotation.