Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Filtering erroneous protein annotation.

D Wieser1, E Kretschmann, R Apweiler

  • 1Sequence Database Group, European Bioinformatics Institute, Cambridge, UK.

Bioinformatics (Oxford, England)
|July 21, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Gene Ontology annotations and resources.

Nucleic acids research·2012
Same author

Refractive index of LiF films as a function of time.

Applied optics·2010
Same author

IntAct--open source resource for molecular interaction data.

Nucleic acids research·2006
Same author

The transcriptional landscape of the mammalian genome.

Science (New York, N.Y.)·2005
Same author

InterProScan: protein domains identifier.

Nucleic acids research·2005
Same author

E-MSD: an integrated data resource for bioinformatics.

Nucleic acids research·2004
Same journal

Biomedical Concept Recognition with Error-aware Negative-enhanced Ranking Framework.

Bioinformatics (Oxford, England)·2026
Same journal

TEDLH: Domain HMMs for sensitive detection of remote homologues.

Bioinformatics (Oxford, England)·2026
Same journal

PLNFGL: Joint Estimation of Multi-Condition Gene Networks from Single-cell RNA-seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
See all related articles

Xanthippe, a post-processing system, effectively identifies and flags annotation errors in protein data, enhancing the reliability of UniProt (Universal Protein Resource) entries. This data-mining approach improves automated annotation accuracy and data quality for researchers.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Proteomics

Background:

  • Automated annotation of protein data for UniProt (Universal Protein Resource) is set to increase significantly, with over 500,000 TrEMBL entries expected to be enhanced.
  • Automated annotation pipelines and other data sources can introduce errors, compromising the reliability of protein information.

Purpose of the Study:

  • To present Xanthippe, a novel post-processing system designed to detect and correct errors in automatically generated protein annotations.
  • To enhance the overall quality and reliability of protein data within the UniProt database.

Main Methods:

  • Xanthippe employs a simple exclusion mechanism combined with a decision tree approach utilizing the C4.5 data-mining algorithm.
  • The system was cross-validated against Swiss-Prot data to assess its error detection capabilities.

Related Experiment Videos

Main Results:

  • Xanthippe successfully detects and flags a significant portion of annotation errors, improving data reliability.
  • The system effectively filters out errors in protein descriptions, comments, and keywords, as confirmed by Swiss-Prot cross-validation.
  • Xanthippe can be integrated with predictive systems to enhance precision or recall in automated annotation.

Conclusions:

  • Xanthippe significantly increases the reliability of protein annotations in UniProt, addressing the challenge of erroneous data.
  • The system offers flexibility in improving automated annotation by balancing precision and recall.
  • The Xanthippe rules and application are accessible for review at http://www.ebi.uniprot.org/.