Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

An experimentally derived data set constructed for testing large-scale DNA sequence assembly algorithms

D Seto1, B F Koop, L Hood

  • 1Division of Biology, California Institute of Technology, Pasadena 91125.

Genomics
|March 1, 1993
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Detection of selection signatures in farmed coho salmon (Oncorhynchus kisutch) using dense genome-wide information.

Scientific reports·2021
Same author

Carotenoid pigmentation in salmon: variation in expression at <i>BCO2-l</i> locus controls a key fitness trait affecting red coloration.

Proceedings. Biological sciences·2019
Same author

Examining risk factors for cardiovascular disease among food bank members in Vancouver.

Preventive medicine reports·2018
Same author

NORTH AMERICAN BLACK BEAR mtDNA PHYLOGEOGRAPHY: IMPLICATIONS FOR MORPHOLOGY AND THE HAIDA GWAII GLACIAL REFUGIUM CONTROVERSY.

Evolution; international journal of organic evolution·2017
Same author

Identification of olfactory receptor genes in Atlantic salmon Salmo salar.

Journal of fish biology·2012
Same author

Expression of olfactory receptors in different life stages and life histories of wild Atlantic salmon (Salmo salar).

Molecular ecology·2011
Same journal

Integrating transcriptomics and metabolomics reveals the molecular landscape of sperm maturation driven by regional differentiation in the epididymis of Guizhou-Guiqian semi-fine wool sheep.

Genomics·2026
Same journal

Impact of genotype on histopathology and clinical characters in a Chinese cohort with obstructive hypertrophic cardiomyopathy.

Genomics·2026
Same journal

A novel reusable transcriptome-wide association study workflow used to map key genes linked to important cattle traits.

Genomics·2026
Same journal

The large mitochondrial genome of Syndiclis anlungensis (Lauraceae): Genome structure, comparative analysis, and phylogenetic relationships with other Syndiclis species.

Genomics·2026
Same journal

DeepGEP: Deep learning for gene expression prediction from multi-omics in mammals.

Genomics·2026
Same journal

Molecular features of external Auditory Canal cholesteatoma by microbial metagenomic sequencing.

Genomics·2026
See all related articles

A new standard DNA sequencing dataset is available for algorithm testing. This resource, including raw and refined sequences, aims to improve DNA sequencing project efficiency and data accessibility.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Large-scale DNA sequencing projects generate vast amounts of data.
  • Standardized datasets are crucial for evaluating and developing new bioinformatics algorithms.
  • Existing datasets may not be optimized for testing diverse algorithmic approaches.

Purpose of the Study:

  • To propose a standardized genomic DNA sequencing dataset for algorithm benchmarking.
  • To facilitate the development and validation of computational tools for DNA sequence analysis.
  • To accelerate the availability of high-quality genomic data for research.

Main Methods:

  • Collection and public release of a comprehensive DNA sequence dataset.
  • Division of the dataset into raw (1023 clones) and refined (820 clones) subsets.

Related Experiment Videos

  • Presentation of suggested criteria for DNA sequence data refinement.
  • Main Results:

    • A publicly accessible dataset comprising raw and partially refined DNA sequences from a large-scale project.
    • Guidelines for data refinement to aid in the development of preprocessing and screening algorithms.
    • Establishment of a benchmark for testing various DNA sequencing data analysis algorithms.

    Conclusions:

    • The proposed dataset will serve as a valuable resource for the bioinformatics community.
    • Development of optimized algorithms will expedite large-scale DNA sequencing and data analysis.
    • Improved computational tools will enhance the efficiency of both large-scale and routine sequencing projects.