Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

An expert system for processing sequence homology data

E L Sonnhammer1, R Durbin

  • 1Sanger Centre, Hinxton, Cambridge, UK.

Proceedings. International Conference on Intelligent Systems for Molecular Biology
|January 1, 1994
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Introgression dynamics of sex-linked chromosomal inversions shape the Malawi cichlid radiation.

Science (New York, N.Y.)·2025
Same author

Ensembl 2009.

Nucleic acids research·2008
Same author

Identification of motifs in protein sequences.

Current protocols in cell biology·2008
Same author

Ensembl 2008.

Nucleic acids research·2007
Same author

Ensembl 2007.

Nucleic acids research·2006
Same author

The DNA sequence and biological annotation of human chromosome 1.

Nature·2006
Same journal

Proceedings of the 8th International Conference on Intelligent Systems for Molecular Biology (ISMB 2000). San Diego, California, USA. August 19-23, 2000.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2001
Same journal

Analysis of gene expression data with pathway scores.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

Towards a complete map of the protein space based on a unified sequence and structure analysis of all known proteins.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

Mining for putative regulatory elements in the yeast genome using gene expression data.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

A multiple alignment algorithm for metabolic pathway analysis using enzyme hierarchy.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

Sequence database search using jumping alignments.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
See all related articles

A new expert system, HSPcrunch, automatically analyzes large sequence datasets, identifying distant homologies with high accuracy. This tool reduces information overload from BLAST searches, enabling faster and more reliable homology assessment.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Database searching tools like BLAST and FASTA generate excessive data for large-scale homology searches.
  • Manual analysis of extensive sequence homology results is time-consuming and requires expert interpretation.

Purpose of the Study:

  • To develop an automated system for analyzing large sequence homology search outputs.
  • To improve the efficiency and accuracy of identifying distant homologies in biological sequences.
  • To present relevant homology findings concisely, minimizing user effort.

Main Methods:

  • Developed a rule-based expert system, HSPcrunch, integrated with an algorithm to filter biased residue composition matches.
  • HSPcrunch processes output from BLAST suite programs.

Related Experiment Videos

  • Employs rules to detect distant similarities by identifying consistent weak matches forming larger gapped alignments, even when BLAST splits them into smaller ungapped alignments.
  • Main Results:

    • HSPcrunch effectively identifies distant homologies by recognizing patterns of weak matches indicative of larger alignments.
    • The system reduces spurious matches while detecting more remote similarities.
    • Empirically derived rules, operating at different scoring levels for weak and medium-weak matches, ensure robust separation of true homologies from false positives.
    • A key rule limits overlapping matches to a query sequence region, significantly reducing output volume.

    Conclusions:

    • HSPcrunch provides a robust and efficient method for homology detection in large sequence datasets.
    • The expert system enhances the reliability of homology assessment by minimizing information overload and spurious results.
    • This approach facilitates quicker and more accurate identification of evolutionary relationships between sequences.