Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

22.2K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
22.2K
Protein Networks02:26

Protein Networks

2.9K
2.9K
Protein Networks02:26

Protein Networks

4.7K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
4.7K
Graphs of Functions01:30

Graphs of Functions

510
Graphs of functions provide a visual representation of how output values change in response to varying inputs. Each point on the graph corresponds to an ordered pair, where the x-coordinate (independent variable) determines the horizontal position and the y-coordinate (dependent variable) determines the vertical position. Linear functions like y = x give a straight line, indicating a constant rate of change.Nonlinear functions display more complex behaviors. Even power functions generate...
510
Gene Families01:57

Gene Families

10.2K
Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
10.2K
Gene Families01:57

Gene Families

4.0K
4.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Promera: a unified model for biomolecular structure prediction, filtering, and design.

bioRxiv : the preprint server for biology·2026
Same author

Memorization in large language models in medicine prevalence characteristics and implications.

Nature communications·2026
Same author

Privacy-enhancing sequential learning under heterogeneous selection bias in multi-site electronic health records data.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

Thousandfold Expansion Microscopy.

bioRxiv : the preprint server for biology·2026
Same author

SwitchCraft: A Programmatic Framework for Designing State-Switching Proteins.

ArXiv·2026
Same author

Multi-resolution modeling of a discrete stochastic process identifies causes of cancer.

... International Conference on Learning Representations·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026
Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026
Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026
Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Apr 10, 2026

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

2.3K

Exploiting ontology graph for predicting sparsely annotated gene function.

Sheng Wang1, Hyunghoon Cho1, ChengXiang Zhai1

  • 1Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, Computer Science and Artificial Intelligence Laboratory, MIT, Cambridge, MA and Department of Mathematics, MIT, Cambridge, MA, USA.

Bioinformatics (Oxford, England)
|June 15, 2015
PubMed
Summary
This summary is machine-generated.

Predicting gene function is improved by clusDCA, a novel algorithm that transfers information between similar functions. This method effectively addresses sparsely annotated functions across species, outperforming existing approaches.

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.5K
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.6K

Related Experiment Videos

Last Updated: Apr 10, 2026

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

2.3K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.5K
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.6K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Systems Biology

Background:

  • Predicting gene and protein function using molecular interaction networks aids in refining biological databases like Gene Ontology (GO).
  • Sparsely annotated functions (<10 genes) present a significant challenge for prediction algorithms due to limited data, leading to overfitting and poor performance.
  • Existing function prediction algorithms struggle with the 'overfitting' issue of sparsely annotated functions and scalability to large annotation catalogs.

Purpose of the Study:

  • To develop a novel algorithm for predicting gene function that effectively handles sparsely annotated functions.
  • To improve the scalability and accuracy of function prediction methods for large biological datasets.
  • To address the overfitting problem inherent in predicting functions with limited annotated genes.

Main Methods:

  • Proposed a novel function prediction algorithm named clusDCA.
  • Implemented an information transfer mechanism between similar functional labels to mitigate overfitting.
  • Ensured the method's scalability for datasets with a large number of annotations.

Main Results:

  • clusDCA significantly outperformed state-of-the-art algorithms in predicting sparsely annotated functions across yeast, mouse, and human datasets.
  • The method maintained high performance on labels with sufficient annotations.
  • Demonstrated accurate prediction of genes for previously unannotated functional labels by leveraging ontology graph structure and related gene annotations.

Conclusions:

  • clusDCA effectively addresses the challenge of predicting sparsely annotated functions by transferring information between similar labels.
  • The algorithm is scalable and improves prediction accuracy without compromising performance on well-annotated functions.
  • clusDCA demonstrates a robust approach to gene function prediction, highlighting the utility of inter-label similarity in biological networks.