Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Next-generation Sequencing

Next-generation Sequencing

The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....

Mismatch Repair

Mismatch Repair

Types of Errors: Detection and Minimization

Types of Errors: Detection and Minimization

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...

Genome Copying Errors

Genome Copying Errors

DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their survival. Therefore, the copying errors are checked and repaired at three levels.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

RMapAlign3N: fast mapping of 3N-Reads.

Bioinformatics advances·2025

Same author

GPU-accelerated homology search with MMseqs2.

Nature methods·2025

Same author

RabbitSketch: a high-performance sketching library for genome analysis.

Bioinformatics (Oxford, England)·2025

Same author

CAREx: context-aware read extension of paired-end sequencing data.

BMC bioinformatics·2024

Same author

RabbitKSSD: accelerating genome distance estimation on modern multi-core architectures.

Bioinformatics (Oxford, England)·2023

Same author

MetaTransformer: deep metagenomic sequencing read classification using self-attention models.

NAR genomics and bioinformatics·2023

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 8, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

CARE 2.0: reducing false-positive sequencing error corrections using machine learning.

Felix Kallenborn¹, Julian Cascitti², Bertil Schmidt²

¹Department of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany. kallenborn@uni-mainz.de.

BMC Bioinformatics

|June 13, 2022

Summary

This summary is machine-generated.

Next-generation sequencing error correction tools can introduce false positives. CARE 2.0 significantly reduces these errors using a machine learning approach, improving downstream analysis like k-mer statistics and de novo assembly.

Keywords:

Error correction Machine learning Next-generation sequencing

More Related Videos

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

Related Experiment Videos

Last Updated: Sep 8, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

Area of Science:

Genomics
Bioinformatics

Background:

Next-generation sequencing (NGS) requires preprocessing for error correction.
Existing tools correct most errors but introduce false positives, impacting downstream analyses.
There is a need for more precise sequencing error correction methods.

Purpose of the Study:

To develop a more precise sequencing read error correction tool.
To minimize false-positive corrections while maintaining high true-positive rates.

Main Methods:

Developed CARE 2.0, a context-aware read error correction tool.
Utilized multiple sequence alignment and a random decision forest classifier trained on Illumina data.
Implemented in C++/CUDA for CPU and GPU execution.

Main Results:

CARE 2.0 achieved up to two orders of magnitude fewer false positives than state-of-the-art tools.
Maintained comparable true-positive correction rates.
Demonstrated improved de novo assembly and k-mer analysis on simulated and real-world data.

Conclusions:

CARE 2.0 significantly reduces false-positive sequencing errors, enhancing data quality.
Machine learning approaches are effective for improving read error correction.
The tool's precision benefits downstream genomic analyses and is publicly available.