Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Mar 7, 2026

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

4.4K

Removing duplicate reads using graphics processing units.

Andrea Manconi1, Marco Moscatelli2, Giuliano Armano3

  • 1Institute for Biomedical Technologies, National Research Council, Via Fratelli Cervi, 93, Segrate (Mi), 20090, Italy. andrea.manconi@itb.cnr.it.

BMC Bioinformatics
|February 11, 2017
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Decoding polymorphous low-grade neuroepithelial tumor of the young (PLNTY): Electroclinical features and molecular signatures in epilepsy surgery candidates.

Epilepsia·2026
Same author

Feature representation for explainable CRISPR off-target prediction and base editing efficiency.

Frontiers in bioinformatics·2026
Same author

Repetitive Head Movements: An Unusual Subcortical Myoclonus Presentation.

Neuropediatrics·2026
Same author

A comprehensive European Colorectal Cancer Cohort dataset.

Scientific data·2026
Same author

Uncovering miRNA-Disease Associations Through Graph Based Neural Network Representations.

Biomedicines·2026
Same author

Adrenoleukodystrophy in adults: phenotypic characterisation and natural history in a large cohort.

Journal of neurology, neurosurgery, and psychiatry·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

This study introduces GPU-DupRemoval, a faster method for removing duplicate DNA sequences from sequencing data. It improves accuracy and handles paired-end libraries, essential for reliable genomic analysis.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Polymerase chain reaction (PCR) during DNA library preparation generates duplicate sequences.
  • Duplicate reads can lead to misinterpretation of sequencing data.
  • Existing methods for removing identical and nearly-identical duplicates have limitations.

Purpose of the Study:

  • To develop an improved GPU-accelerated method for removing duplicate sequencing reads.
  • To overcome limitations of previous duplicate removal techniques, including prefix length constraints and paired-end read support.

Main Methods:

  • Developed GPU-DupRemoval, a novel implementation utilizing graphics processing units (GPUs) for parallel processing.
  • Implemented prefix-suffix comparison for identifying potential duplicates.
Keywords:
CUDADuplicate readsGraphics processing unitsNext generation sequencing

More Related Videos

Capturing Chromosome Conformation Across Length Scales
10:15

Capturing Chromosome Conformation Across Length Scales

Published on: January 20, 2023

4.2K
G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

6.3K

Related Experiment Videos

Last Updated: Mar 7, 2026

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

4.4K
Capturing Chromosome Conformation Across Length Scales
10:15

Capturing Chromosome Conformation Across Length Scales

Published on: January 20, 2023

4.2K
G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome
06:40

G2-seq: A High Throughput Sequencing-based Technique for Identifying Late Replicating Regions of the Genome

Published on: March 22, 2018

6.3K
  • Enhanced clustering algorithms to handle reads without prefix length constraints and analyze large duplicate clusters.
  • Main Results:

    • GPU-DupRemoval supports both single- and paired-end read libraries.
    • The method effectively clusters reads without maximum prefix length limitations.
    • Analyzes large clusters of potential duplicates, improving accuracy.

    Conclusions:

    • GPU-DupRemoval offers faster duplicate read removal compared to existing solutions.
    • The method demonstrates superior performance in the quantity of duplicates removed.
    • Leverages massive parallelization on GPUs for efficient genomic data processing.