Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Networks02:26

Protein Networks

4.1K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
4.1K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

19.3K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
19.3K
Protein Families02:47

Protein Families

3.4K
3.4K
Gene Families01:57

Gene Families

9.2K
Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
9.2K
Multi-pass Transmembrane Proteins and β-barrels01:09

Multi-pass Transmembrane Proteins and β-barrels

5.6K
In multi-pass transmembrane proteins, the polypeptide chain crosses the membrane more than once. The transmembrane polypeptide chain either forms an α-helix or β-strand structure. α-Helix containing multi-pass transmembrane proteins are ubiquitous, whereas β-strand containing ones are mainly found in gram-negative bacteria, mitochondria, and chloroplasts.
α-Helix containing multi-pass transmembrane proteins
Multi-pass transmembrane proteins such as...
5.6K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.3K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Posttransplant mortality risk assessment for adult-to-adult right-lobe living donor liver recipients with benign end-stage liver disease.

Scandinavian journal of gastroenterology·2012
Same author

Sodium nitrite protects against kidney injury induced by brain death and improves post-transplant function.

Kidney international·2012
Same author

OIC-A006-loaded true bone ceramic heals rabbit critical-sized segmental radial defect.

Die Pharmazie·2012
Same author

Structural and functional characterization of mature forms of metalloprotease E495 from Arctic sea-ice bacterium Pseudoalteromonas sp. SM495.

PloS one·2012
Same author

Liquid chromatography-mass spectrometric multiple reaction monitoring-based strategies for expanding targeted profiling towards quantitative metabolomics.

Current drug metabolism·2012
Same author

Practical access to four stereoisomers of naftidrofuryl and their binding affinity towards 5-hydroxytryptamine 2A receptor.

Bioorganic & medicinal chemistry letters·2012
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026
Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026
Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026
Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Sep 16, 2025

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

9.9K

ProtGO: universal protein function prediction utilizing multi-modal gene ontology knowledge.

Boyan Wang1,2, Yangliao Geng3, Xingyi Cheng4

  • 1School of Intelligence Science and Technology, Nanjing University, Suzhou, Jiangsu 215163, China.

Bioinformatics (Oxford, England)
|July 9, 2025
PubMed
Summary
This summary is machine-generated.

ProtGO enhances protein function prediction by integrating multi-modal data with Gene Ontology (GO) knowledge. This AI framework significantly improves predictions, addressing challenges in life sciences and biomedicine.

More Related Videos

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

19.8K
Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

12.9K

Related Experiment Videos

Last Updated: Sep 16, 2025

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

9.9K
Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics
08:09

Annotation of Plant Gene Function via Combined Genomics, Metabolomics and Informatics

Published on: June 17, 2012

19.8K
Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames
07:38

Mass Spectrometry-Based Proteomics Analyses Using the OpenProt Database to Unveil Novel Proteins Translated from Non-Canonical Open Reading Frames

Published on: April 11, 2019

12.9K

Area of Science:

  • Biochemistry and Molecular Biology
  • Bioinformatics
  • Artificial Intelligence in Life Sciences

Background:

  • Protein function prediction is a critical challenge in life sciences, exacerbated by the increasing number of AI-designed proteins and the need to process multi-modal data.
  • Traditional low-throughput experimental methods are insufficient to keep pace with the growing volume of known protein sequences and the demand for functional prediction.

Purpose of the Study:

  • To develop a universal multi-modal method for protein function prediction that addresses the limitations of current approaches.
  • To leverage the Gene Ontology (GO) knowledge base and integrate diverse data modalities for enhanced prediction accuracy.

Main Methods:

  • ProtGO framework utilizes pre-trained protein language models (PLMs) for sequence representation.
  • Integrates specialized modules for text descriptions (text alignment), species-specific taxonomy (taxonomy encoding), and biological relationships (GO graph embedding).
  • Combines four distinct knowledge representations to maximize the utility of GO resources.

Main Results:

  • ProtGO demonstrated significant improvements in protein function prediction, with an 8% to 27% increase in the maximum F1 measure (Fmax) compared to baseline models.
  • The framework effectively enhances the performance of standard PLMs and biological language models (LMs) in GO prediction tasks.
  • Achieved outstanding performance by integrating functional and evolutionary knowledge from multiple data sources.

Conclusions:

  • ProtGO offers a robust and adaptable framework for accurate protein function prediction.
  • The multi-modal approach effectively harnesses biological knowledge, advancing the field of bioinformatics.
  • This work provides a valuable tool for researchers in life sciences and biomedicine facing big data challenges.