Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mismatch Repair

Mismatch Repair

Mismatch Repair

Mismatch Repair

Organisms are capable of detecting and fixing nucleotide mismatches that occur during DNA replication. This sophisticated process requires identifying the new strand and replacing the erroneous bases with correct nucleotides. Mismatch repair is coordinated by many proteins in both prokaryotes and eukaryotes.
The Mutator Protein Family Plays a Key Role in DNA Mismatch Repair
The human genome has more than 3 billion base pairs of DNA per cell. Prior to cell division, that vast amount of genetic...

Proofreading

Proofreading

Synthesis of new DNA molecules is carried out by the enzyme DNA polymerase, which adds nucleotides on the daughter strand complementary to the template DNA strand. DNA polymerase has a higher affinity to add the correct base and ensures fidelity during DNA replication. Furthermore, it exhibits proofreading activity during replication, using an exonuclease domain that cuts off incorrect nucleotides from the nascent DNA strand.
Errors During Replication are Corrected by the DNA Polymerase...

Proofreading

Proofreading

Genome Copying Errors

Genome Copying Errors

DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their survival. Therefore, the copying errors are checked and repaired at three levels.

Next-generation Sequencing

Next-generation Sequencing

The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Pilliga Ghosts: The Novel Fungi of the Rivers, Creeks, Lakes, and Dams of the Narrabri Region, Australia.

Environmental microbiology reports·2026

Same author

BINge: Multispecies Ortholog Clustering for Differential Gene Expression Analyses.

Molecular ecology resources·2026

Same author

MLDeCNV: A machine learning approach for predicting copy number variation types in plant genomes.

Computers in biology and medicine·2025

Same author

Democratising high performance computing for bioinformatics through serverless cloud computing: A case study on CRISPR-Cas9 guide RNA design with Crackling Cloud.

PLoS computational biology·2025

Same author

Bioinformatic assessment of allergenicity, virulence, and secondary metabolites in Aspergillus species for industrial applications.

Computational biology and chemistry·2025

Same author

Correction: Filippou et al. Transcriptomic Analysis Reveals Molecular Mechanisms Underpinning Mycovirus-Mediated Hypervirulence in <i>Beauveria bassiana</i> Infecting <i>Tenebrio molitor</i>. <i>J. Fungi</i> 2025, <i>11</i>, 63.

Journal of fungi (Basel, Switzerland)·2025

Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026

Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026

Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026

Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026

Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 28, 2026

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

Blue: correcting sequencing errors using consensus and context.

Paul Greenfield¹, Konsta Duesing², Alexie Papanicolaou²

¹CSIRO Computational Informatics, School of IT, University of Sydney, CSIRO Animal, Food and Health Sciences, Sydney, NSW 2113, and CSIRO Ecosystem Sciences, Canberra, ACT 2601, Australia CSIRO Computational Informatics, School of IT, University of Sydney, CSIRO Animal, Food and Health Sciences, Sydney, NSW 2113, and CSIRO Ecosystem Sciences, Canberra, ACT 2601, Australia.

Bioinformatics (Oxford, England)

|June 13, 2014

Summary

This summary is machine-generated.

Blue is a novel bioinformatics tool that corrects sequencing errors in DNA data. It improves read accuracy and contig assembly, making it essential for high-quality genome sequencing.

More Related Videos

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

Related Experiment Videos

Last Updated: Apr 28, 2026

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

Area of Science:

Bioinformatics
Genomics
Computational Biology

Background:

High-throughput sequencing generates large datasets requiring accurate analysis.
Bioinformatics tools like assemblers and aligners depend on high-quality input data.
Sequencing errors (substitutions, insertions, deletions, uncalled bases) reduce downstream analysis accuracy.

Purpose of the Study:

To develop and present Blue, a fast, scalable, and accurate DNA sequence error-correction algorithm.
To create a transparent bioinformatics tool that improves sequence data quality for downstream applications.
To enable correction of various error types across different data formats and organism types.

Main Methods:

Blue utilizes a k-mer consensus and context-based algorithm for error detection and correction.
The algorithm corrects substitution, deletion, and insertion errors, as well as uncalled bases.
It processes data in FASTQ and FASTA formats, corrects quality scores, and maintains read pairing.

Main Results:

Blue demonstrates higher accuracy than other published algorithms on tested datasets.
It results in more accurate read alignments and the assembly of longer, higher-quality contigs.
The algorithm is memory-efficient, scalable, and faster than existing tools for large datasets.

Conclusions:

Blue effectively corrects diverse sequencing errors, enhancing the quality of genomic data.
Its performance and scalability make it suitable for large-scale sequencing projects.
The ability to use cross-correction with different data types offers significant advantages for genome assembly and finishing.