Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

20.2K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
20.2K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.4K
Genomics02:02

Genomics

39.3K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
39.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

On the state of protein function prediction: a report on the fourth CAFA challenge.

bioRxiv : the preprint server for biology·2026
Same author

Open and sustainable AI: challenges, opportunities and the road ahead in the life sciences.

Nature methods·2026
Same author

OMAnnotator: a novel approach to building an annotated consensus genome sequence.

Bioinformatics advances·2026
Same author

The missing link in FAIR data policy: biodata resources in life sciences.

Scientific data·2026
Same author

Quest for Orthologs in the era of Data Deluge and AI: Challenges and Innovations in Orthology Prediction and Data Integration.

Journal of molecular evolution·2025
Same author

Structural phylogenetics unravels the evolutionary diversification of communication systems in gram-positive bacteria and their viruses.

Nature structural & molecular biology·2025
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Dec 15, 2025

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

2.0K

Benchmarking gene ontology function predictions using negative annotations.

Alex Warwick Vesztrocy1,2,3, Christophe Dessimoz1,2,3,4,5

  • 1Department of Genetics, Evolution and Environment, University College London, London, WC1E 6BT, UK.

Bioinformatics (Oxford, England)
|July 14, 2020
PubMed
Summary
This summary is machine-generated.

A new benchmark for protein function annotation improves precision by incorporating negative experimental data, addressing limitations in current methods like CAFA. This approach enhances the accuracy of gene function characterization in diverse species.

More Related Videos

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.2K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K

Related Experiment Videos

Last Updated: Dec 15, 2025

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

2.0K
Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information
09:37

Navigating MARRVEL, a Web-Based Tool that Integrates Human Genomics and Model Organism Genetics Information

Published on: August 15, 2019

10.2K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K

Area of Science:

  • Genomics and Bioinformatics
  • Computational Biology
  • Protein Function Prediction

Background:

  • Automated gene function characterization is crucial due to the increasing volume and diversity of sequenced species.
  • Current methods, including the Critical Assessment of protein Function Annotation (CAFA), rely heavily on automated electronic methods.
  • Existing benchmarks may underestimate precision due to an incomplete handling of the open world assumption (OWA) and a lack of negative experimental annotations.

Purpose of the Study:

  • To introduce a novel benchmark for evaluating protein function annotation algorithms that is compliant with the open world assumption (OWA).
  • To address the systematic underestimation of precision in existing benchmarks by incorporating negative experimental annotations.

Main Methods:

  • Developed a new benchmark utilizing a balanced test set of both positive and negative protein annotations.
  • Derived negative annotations from expert-curated protein family annotations on phylogenetic trees.
  • Tested the benchmark using baseline methods (naïve, BLAST) and orthology-based methods.

Main Results:

  • The new benchmark significantly increases the average information content of negative annotations.
  • The OWA-compliant benchmark provides a more accurate assessment of prediction precision.
  • The benchmark was successfully tested with various established annotation methods.

Conclusions:

  • The proposed benchmark offers a more robust evaluation of protein function annotation tools.
  • This new benchmark can serve as a valuable complement to existing benchmarks in future CAFA experiments.
  • Improved benchmarking is essential for accurate gene function characterization in the era of large-scale sequencing.