Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Dictionary-driven protein annotation.

Isidore Rigoutsos1, Tien Huynh, Aris Floratos

  • 1Bioinformatics and Pattern Discovery Group, IBM TJ Watson Research Center, Yorktown Heights, NY 10598, USA. rigoutso@us.ibm.com

Nucleic Acids Research
|August 31, 2002
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Publisher Correction: Tumor transcriptional state predicts survival in immune-checkpoint-blockade-treated glioblastoma.

Nature cancer·2026
Same author

Quantum ensembling methods for healthcare and life science.

Briefings in bioinformatics·2026
Same author

Tumor transcriptional state predicts survival in immune-checkpoint-blockade-treated glioblastoma.

Nature cancer·2026
Same author

Remics: a redescription-based framework for multi-omics analysis.

Frontiers in cell and developmental biology·2026
Same author

Genomic modifiers of malignant and neurodevelopmental phenotypes in individuals with PTEN hamartoma tumor syndrome.

NPJ genomic medicine·2026
Same author

In-Pocket 3D Graphs Enhance Ligand-Target Compatibility in Generative Small-Molecule Creation: A Dopamine D2 Receptor Model System.

The journal of physical chemistry. B·2026
Same journal

Correction to 'New origin firing is inhibited by APC/CCdh1 activation in S-phase after severe replication stress'.

Nucleic acids research·2026
Same journal

VeloRM: disentangling pre- and post-splicing RNA modification dynamics at single-cell resolution.

Nucleic acids research·2026
Same journal

Accessibility of telomeric overhangs to stabilizing small-molecule ligands.

Nucleic acids research·2026
Same journal

Multivalent interactions mediate SNAIL transcription factor stimulation of the nucleosome deacetylase activity of the CoREST complex.

Nucleic acids research·2026
Same journal

Genome-wide mapping of DNA G-quadruplexes in Trypanosoma brucei chromatin reveals enrichment in coding regions and transcription start sites.

Nucleic acids research·2026
Same journal

Correction to 'The Gene Ontology knowledgebase in 2026'.

Nucleic acids research·2026
See all related articles

This study introduces a novel computational method for rapid, automated protein annotation using a Bio-Dictionary. The approach efficiently predicts protein properties, aiding in the analysis of vast sequence data.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Increasing volume of protein sequence data necessitates automated annotation tools.
  • Existing methods struggle with speed, exhaustiveness, and objectivity.

Purpose of the Study:

  • To develop an automated computational method for protein property determination directly from amino acid sequences.
  • To create a system that can rapidly and exhaustively annotate individual sequences and entire genomes.

Main Methods:

  • Utilized a Bio-Dictionary of amino acid patterns to capture evolutionary conserved signals.
  • Employed a weighted, position-specific scoring scheme robust to database biases.
  • Developed a single-pass method for analyzing query sequences against public databases.

Related Experiment Videos

Main Results:

  • The method determines local/global similarities, sequence likeness across domains (archaeal, bacterial, eukaryotic, viral), secondary structure, subcellular localization (cytoplasmic, transmembrane, extracellular), and functional sites (binding domains, active sites, post-translational modification sites, signal peptides).
  • Demonstrated exhaustive, objective, and rapid annotation capabilities on individual sequences and complete genomes.
  • Annotated over 70 complete genomes, made publicly available online.

Conclusions:

  • The developed computational approach provides an efficient and comprehensive solution for protein sequence annotation.
  • The Bio-Dictionary and scoring scheme enable accurate prediction of diverse protein properties.
  • The availability of genome-wide annotations facilitates further biological research.