Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Correcting sequencing errors in DNA coding regions using a dynamic programming approach

Y Xu1, R J Mural, E C Uberbacher

  • 1Engineering Physics and Mathematics Division, Oak Ridge National Laboratory, TN 37831-6364, USA.

Computer Applications in the Biosciences : CABIOS
|April 1, 1995
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Computational tools for protein modeling.

Current protein & peptide science·2002
Same author

ARTEMIS: a tool for displaying and annotating DNA sequence.

Briefings in bioinformatics·2001
Same author

The sequence of the human genome.

Science (New York, N.Y.)·2001
Same author

Sequence-structure specificity of a knowledge based energy function at the secondary structure level.

Bioinformatics (Oxford, England)·2000
Same author

Current status of computational gene finding: a perspective.

Methods in enzymology·1999
Same author

The Genome Channel: a browser to a uniform first-pass annotation of genomic DNA.

Trends in genetics : TIG·1999
Same journal

DCA: an efficient implementation of the divide-and-conquer approach to simultaneous multiple sequence alignment.

Computer applications in the biosciences : CABIOS·1998
Same journal

Two applications to facilitate the viewing of database search result files on the Macintosh.

Computer applications in the biosciences : CABIOS·1998
Same journal

BioWish: a molecular biology command extension to Tcl/Tk.

Computer applications in the biosciences : CABIOS·1998
Same journal

The Sequence Alerting Server--a new WEB server.

Computer applications in the biosciences : CABIOS·1998
Same journal

A software tool for the analysis of mass spectrometric disulfide mapping experiments.

Computer applications in the biosciences : CABIOS·1998
Same journal

SAMBA: hardware accelerator for biological sequence comparison.

Computer applications in the biosciences : CABIOS·1998
See all related articles

This study introduces an algorithm to detect and correct DNA sequencing errors like indels, improving gene identification efficiency and reducing costs. The method enhances DNA sequence analysis, making low-redundancy data more informative for genetic research.

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • DNA sequencing technologies are prone to errors, particularly insertions and deletions (indels).
  • High-redundancy sequencing is often required for accurate gene identification, increasing costs and reducing efficiency.
  • Existing DNA sequence analysis systems may struggle with error-containing data.

Purpose of the Study:

  • To develop and evaluate an algorithm for detecting and correcting indel sequencing errors in DNA coding regions.
  • To enhance the informativeness of single-pass or low-redundancy DNA sequence data.
  • To reduce the need for high-redundancy sequencing, thereby improving efficiency and lowering genome sequencing costs.

Main Methods:

  • The algorithm identifies sequencing errors by detecting shifts in the statistically preferred reading frame within putative coding regions.

Related Experiment Videos

  • It corrects errors by inserting neutral bases at perceived reading frame transition points to maintain frame consistency.
  • The algorithm was implemented as a subsystem of the GRAIL DNA sequence analysis system.
  • Main Results:

    • The algorithm detected and corrected 76% of randomly generated indels in a test set of 68 human DNA sequences.
    • The average distance between the actual and predicted indel position was 9.4 bases.
    • GRAIL's coding message prediction accuracy improved from 69% to 89% with the error-correction subsystem, with a similar false positive rate.

    Conclusions:

    • The developed algorithm effectively detects and corrects indel sequencing errors in DNA coding regions.
    • This error-correction capability significantly improves the accuracy of gene prediction from noisy sequence data.
    • The algorithm shows promise for enhancing DNA sequence analysis efficiency and reducing overall genome sequencing costs.