Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

11.8K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.8K
Next-generation Sequencing03:00

Next-generation Sequencing

92.5K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
92.5K
Mismatch Repair01:36

Mismatch Repair

40.6K
Overview
40.6K
Types of Errors: Detection and Minimization01:12

Types of Errors: Detection and Minimization

2.3K
Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...
2.3K
Genome Copying Errors02:46

Genome Copying Errors

4.4K
DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.
4.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

RMapAlign3N: fast mapping of 3N-Reads.

Bioinformatics advances·2025
Same author

GPU-accelerated homology search with MMseqs2.

Nature methods·2025
Same author

RabbitSketch: a high-performance sketching library for genome analysis.

Bioinformatics (Oxford, England)·2025
Same author

CAREx: context-aware read extension of paired-end sequencing data.

BMC bioinformatics·2024
Same author

RabbitKSSD: accelerating genome distance estimation on modern multi-core architectures.

Bioinformatics (Oxford, England)·2023
Same author

MetaTransformer: deep metagenomic sequencing read classification using self-attention models.

NAR genomics and bioinformatics·2023
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Sep 8, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.2K

CARE 2.0: reducing false-positive sequencing error corrections using machine learning.

Felix Kallenborn1, Julian Cascitti2, Bertil Schmidt2

  • 1Department of Computer Science, Johannes Gutenberg University Mainz, Mainz, Germany. kallenborn@uni-mainz.de.

BMC Bioinformatics
|June 13, 2022
PubMed
Summary
This summary is machine-generated.

Next-generation sequencing error correction tools can introduce false positives. CARE 2.0 significantly reduces these errors using a machine learning approach, improving downstream analysis like k-mer statistics and de novo assembly.

Keywords:
Error correctionMachine learningNext-generation sequencing

More Related Videos

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms
09:30

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

9.6K
Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example
05:45

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

8.9K

Related Experiment Videos

Last Updated: Sep 8, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.2K
Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms
09:30

Genome-wide Surveillance of Transcription Errors in Eukaryotic Organisms

Published on: September 13, 2018

9.6K
Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example
05:45

Validating Whole Genome Nanopore Sequencing, using Usutu Virus as an Example

Published on: March 11, 2020

8.9K

Area of Science:

  • Genomics
  • Bioinformatics

Background:

  • Next-generation sequencing (NGS) requires preprocessing for error correction.
  • Existing tools correct most errors but introduce false positives, impacting downstream analyses.
  • There is a need for more precise sequencing error correction methods.

Purpose of the Study:

  • To develop a more precise sequencing read error correction tool.
  • To minimize false-positive corrections while maintaining high true-positive rates.

Main Methods:

  • Developed CARE 2.0, a context-aware read error correction tool.
  • Utilized multiple sequence alignment and a random decision forest classifier trained on Illumina data.
  • Implemented in C++/CUDA for CPU and GPU execution.

Main Results:

  • CARE 2.0 achieved up to two orders of magnitude fewer false positives than state-of-the-art tools.
  • Maintained comparable true-positive correction rates.
  • Demonstrated improved de novo assembly and k-mer analysis on simulated and real-world data.

Conclusions:

  • CARE 2.0 significantly reduces false-positive sequencing errors, enhancing data quality.
  • Machine learning approaches are effective for improving read error correction.
  • The tool's precision benefits downstream genomic analyses and is publicly available.