Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A hybrid gazelle optimization and reptile search algorithm for optimal clustering in wireless sensor networks.

Scientific reports·2025
Same author

Towards precision oncology: a multi-level cancer classification system integrating liquid biopsy and machine learning.

BioData mining·2025
Same author

A Multi-model Deep Learning Architecture for Diagnosing Multi-class Skin Diseases.

Journal of imaging informatics in medicine·2024
Same author

Fine tuning deep learning models for breast tumor classification.

Scientific reports·2024
Same author

Precision cancer classification using liquid biopsy and advanced machine learning techniques.

Scientific reports·2024
Same author

Unboxing machine learning models for concrete strength prediction using XAI.

Scientific reports·2023
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jul 4, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.1K

MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads.

Amira Sami1, Sara El-Metwally2,3, M Z Rashad1

  • 1Department of Computer Science, Faculty of Computers and Information, Mansoura University, P.O. Box: 35516, Mansoura, Egypt.

BMC Bioinformatics
|February 6, 2024
PubMed
Summary
This summary is machine-generated.

Machine learning effectively filters errors in next-generation sequencing (NGS) reads, improving data quality and genomic assembly. This approach enhances accuracy in genomics and personalized medicine research.

Keywords:
ClassificationError filtrationFeature extractionMachine learningNext-generation sequencing

More Related Videos

Targeted DNA Methylation Analysis by Next-generation Sequencing
08:38

Targeted DNA Methylation Analysis by Next-generation Sequencing

Published on: February 24, 2015

37.1K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.2K

Related Experiment Videos

Last Updated: Jul 4, 2025

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.1K
Targeted DNA Methylation Analysis by Next-generation Sequencing
08:38

Targeted DNA Methylation Analysis by Next-generation Sequencing

Published on: February 24, 2015

37.1K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.2K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Next-generation sequencing (NGS) generates vast biological data, but increased speed and affordability lead to more errors.
  • Data quality assurance is shifting from hardware to software preprocessing stages.
  • Effective error detection and filtration are crucial for reliable NGS data analysis.

Purpose of the Study:

  • To introduce MAC-ErrorReads, a novel Machine learning-Assisted Classifier for filtering erroneous NGS reads.
  • To evaluate the performance of machine learning algorithms in identifying and removing low-quality reads.
  • To enhance the accuracy and reliability of genomic data.

Main Methods:

  • Developed MAC-ErrorReads, a binary classification model using five supervised machine learning algorithms.
  • Extracted features using Term Frequency-Inverse Document Frequency (TF_IDF) from diverse biological datasets.
  • Trained and tested models on datasets including E. coli, S. aureus, H. Chr14, Arabidopsis thaliana Chr1, and M. zebra.

Main Results:

  • Naive Bayes (NB) within MAC-ErrorReads demonstrated robust performance across datasets with high accuracy, precision, recall, and F1-scores.
  • MAC-ErrorReads NB achieved a 38.69% alignment rate for S. aureus reads, outperforming many error correction tools.
  • MAC-ErrorReads showed competitive alignment rates (e.g., >99% for H. Chr14, ~90% for A. thaliana Chr1, ~83% for M. zebra) compared to other tools.

Conclusions:

  • Machine learning effectively filters NGS reads, significantly improving assembly quality and genomic coverage.
  • Integrating artificial intelligence with genomics enhances NGS data quality and downstream analysis.
  • This approach holds promise for advancing genetics, genomics, and personalized medicine research.