Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Issues in searching molecular sequence databases

S F Altschul1, M S Boguski, W Gish

  • 1National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894.

Nature Genetics
|February 1, 1994
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Expression of the Escherichia coli glutamate dehydrogenase gene in the cyanobacterium Synechococcus PCC6301 causes ammonium tolerance.

Plant molecular biology·2013
Same author

Isolation and characterization of yeast DNA repair genes : I. Cloning of the RAD52 gene.

Current genetics·2013
Same author

Rural medicine: implications for the future.

Canadian family physician Medecin de famille canadien·2011
Same author

[Not Available].

Canadian family physician Medecin de famille canadien·2011
Same author

Sequence databases: integrated information retrieval and data submission.

Current protocols in human genetics·2008
Same author

Sequence databases: integrated information retrieval and data submission.

Current protocols in molecular biology·2008
Same journal

Lineage tracing from cellular heritage to disease destiny.

Nature genetics·2026
Same journal

Multiomics analysis of primary metabolism reveals the genetic basis of nitrogen partitioning modulated by ZmAVT1A-1 in maize.

Nature genetics·2026
Same journal

No evidence of immunosurveillance in mutation-hotspot-driven clonal hematopoiesis.

Nature genetics·2026
Same journal

Near-perfect genome sequencing in medical genetics.

Nature genetics·2026
Same journal

Three decades of cancer genetics.

Nature genetics·2026
Same journal

Advances and challenges of splicing prediction with AI.

Nature genetics·2026
See all related articles

Sequence similarity searches aid molecular biologists in identifying DNA coding regions and understanding gene/protein functions. This study explores crucial associated issues beyond algorithms, like scoring systems and database redundancy, for maximizing these tools' potential.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Molecular Biology

Background:

  • Sequence similarity search programs are vital for molecular biologists.
  • These tools help identify DNA coding regions and infer gene/protein structure and function.
  • Existing research often focuses on algorithms and speed, overlooking other critical factors.

Purpose of the Study:

  • To highlight important, often overlooked, factors influencing the effectiveness of sequence similarity searches.
  • To provide a comprehensive overview of considerations beyond core algorithms.

Main Methods:

  • Discussion and analysis of associated issues in sequence similarity searching.
  • Review of scoring systems, statistical significance of alignments, and sequence masking.

Related Experiment Videos

  • Examination of database redundancy and network access considerations.
  • Main Results:

    • The choice of scoring systems significantly impacts alignment quality.
    • Statistical significance assessment is crucial for reliable similarity detection.
    • Sequence redundancy in databases can affect search outcomes.
    • Effective masking of non-informative regions enhances search accuracy.

    Conclusions:

    • Optimizing sequence similarity searches requires careful consideration of scoring, statistical validation, and data preprocessing.
    • Addressing database redundancy and network access is essential for efficient and reliable results.
    • A holistic approach, beyond just algorithms, is key to fully leveraging these powerful bioinformatics tools.