Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mismatch Repair01:36

Mismatch Repair

38.1K
Overview
38.1K
Mismatch Repair01:20

Mismatch Repair

5.4K
Organisms are capable of detecting and fixing nucleotide mismatches that occur during DNA replication. This sophisticated process requires identifying the new strand and replacing the erroneous bases with correct nucleotides. Mismatch repair is coordinated by many proteins in both prokaryotes and eukaryotes.
The Mutator Protein Family Plays a Key Role in DNA Mismatch Repair
The human genome has more than 3 billion base pairs of DNA per cell. Prior to cell division, that vast amount of genetic...
5.4K
Proofreading01:31

Proofreading

7.5K
Synthesis of new DNA molecules is carried out by the enzyme DNA polymerase, which adds nucleotides on the daughter strand complementary to the template DNA strand. DNA polymerase has a higher affinity to add the correct base and ensures fidelity during DNA replication. Furthermore,  it exhibits proofreading activity during replication, using an exonuclease domain that cuts off incorrect nucleotides from the nascent DNA strand.
Errors During Replication are Corrected by the DNA Polymerase...
7.5K
Proofreading01:43

Proofreading

52.0K
Overview
52.0K
Genome Copying Errors02:46

Genome Copying Errors

4.3K
DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.
4.3K
Next-generation Sequencing03:00

Next-generation Sequencing

87.9K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
87.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Pilliga Ghosts: The Novel Fungi of the Rivers, Creeks, Lakes, and Dams of the Narrabri Region, Australia.

Environmental microbiology reports·2026
Same author

BINge: Multispecies Ortholog Clustering for Differential Gene Expression Analyses.

Molecular ecology resources·2026
Same author

MLDeCNV: A machine learning approach for predicting copy number variation types in plant genomes.

Computers in biology and medicine·2025
Same author

Democratising high performance computing for bioinformatics through serverless cloud computing: A case study on CRISPR-Cas9 guide RNA design with Crackling Cloud.

PLoS computational biology·2025
Same author

Bioinformatic assessment of allergenicity, virulence, and secondary metabolites in Aspergillus species for industrial applications.

Computational biology and chemistry·2025
Same author

Correction: Filippou et al. Transcriptomic Analysis Reveals Molecular Mechanisms Underpinning Mycovirus-Mediated Hypervirulence in <i>Beauveria bassiana</i> Infecting <i>Tenebrio molitor</i>. <i>J. Fungi</i> 2025, <i>11</i>, 63.

Journal of fungi (Basel, Switzerland)·2025
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026
Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026
Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026
Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Apr 28, 2026

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms
09:30

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

7.9K

Blue: correcting sequencing errors using consensus and context.

Paul Greenfield1, Konsta Duesing2, Alexie Papanicolaou2

  • 1CSIRO Computational Informatics, School of IT, University of Sydney, CSIRO Animal, Food and Health Sciences, Sydney, NSW 2113, and CSIRO Ecosystem Sciences, Canberra, ACT 2601, Australia CSIRO Computational Informatics, School of IT, University of Sydney, CSIRO Animal, Food and Health Sciences, Sydney, NSW 2113, and CSIRO Ecosystem Sciences, Canberra, ACT 2601, Australia.

Bioinformatics (Oxford, England)
|June 13, 2014
PubMed
Summary
This summary is machine-generated.

Blue is a novel bioinformatics tool that corrects sequencing errors in DNA data. It improves read accuracy and contig assembly, making it essential for high-quality genome sequencing.

More Related Videos

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

14.6K
Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example
05:45

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

10.9K

Related Experiment Videos

Last Updated: Apr 28, 2026

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms
09:30

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

7.9K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

14.6K
Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example
05:45

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

10.9K

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • High-throughput sequencing generates large datasets requiring accurate analysis.
  • Bioinformatics tools like assemblers and aligners depend on high-quality input data.
  • Sequencing errors (substitutions, insertions, deletions, uncalled bases) reduce downstream analysis accuracy.

Purpose of the Study:

  • To develop and present Blue, a fast, scalable, and accurate DNA sequence error-correction algorithm.
  • To create a transparent bioinformatics tool that improves sequence data quality for downstream applications.
  • To enable correction of various error types across different data formats and organism types.

Main Methods:

  • Blue utilizes a k-mer consensus and context-based algorithm for error detection and correction.
  • The algorithm corrects substitution, deletion, and insertion errors, as well as uncalled bases.
  • It processes data in FASTQ and FASTA formats, corrects quality scores, and maintains read pairing.

Main Results:

  • Blue demonstrates higher accuracy than other published algorithms on tested datasets.
  • It results in more accurate read alignments and the assembly of longer, higher-quality contigs.
  • The algorithm is memory-efficient, scalable, and faster than existing tools for large datasets.

Conclusions:

  • Blue effectively corrects diverse sequencing errors, enhancing the quality of genomic data.
  • Its performance and scalability make it suitable for large-scale sequencing projects.
  • The ability to use cross-correction with different data types offers significant advantages for genome assembly and finishing.