Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

11.6K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
11.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comprehensive molecular profiling of Hispanic, Black, Asian, and White patients with atopic dermatitis via tape strip RNA sequencing and serum proteomics.

The Journal of allergy and clinical immunology·2026
Same author

Orthostatic headache and aseptic cerebrospinal fluid pleocytosis from an intrasacral meningocele.

European spine journal : official publication of the European Spine Society, the European Spinal Deformity Society, and the European Section of the Cervical Spine Research Society·2026
Same author

A Structured Framework for Electronic Health Record Optimization: The EASY Program.

Applied clinical informatics·2026
Same author

Why is this journal called Protoplasma? A history of protoplasm theory and the divisions in cell biology before 1926.

Protoplasma·2026
Same author

Long-Term Real-World Evaluation of Abrocitinib in Moderate-to-Severe Atopic Dermatitis Across a Diverse Patient Population in a Multi-site Hospital System.

American journal of clinical dermatology·2026
Same author

Skin tape strips identify age-specific immune, metabolic, and barrier dysregulation signatures in children, adolescents, and adults with allergic asthma.

The Journal of allergy and clinical immunology·2026
Same journal

Association between intestinal functional disorders and anal fistula: evidence from a retrospective case-control study.

PeerJ·2026
Same journal

Automated recognition of Meso-Cenozoic foraminifera from Senegalese sedimentary deposits using convolutional neural networks.

PeerJ·2026
Same journal

Genome-wide analysis of <i>HSP70</i> gene superfamily in kelp (<i>Saccharina japonica</i>): identification, characterization, and heat stress-responsive expression profiles.

PeerJ·2026
Same journal

Morphological and molecular evidence of the Antarctic sleeper shark <i>Somniosus antarcticus</i> (Somniosidae) in northern Chile.

PeerJ·2026
Same journal

Stroboscopic balance training enhances dynamic stability and postural control in collegiate badminton players: a randomized controlled trial.

PeerJ·2026
Same journal

Frequent exposure to biologics is associated with small intestinal bacterial overgrowth in patients with Crohn's disease: a retrospective case-control study.

PeerJ·2026
See all related articles

Related Experiment Video

Updated: Jan 1, 2026

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.5K

Algorithms for efficiently collapsing reads with Unique Molecular Identifiers.

Daniel Liu1,2

  • 1Torrey Pines High School, San Diego, CA, United States of America.

Peerj
|December 25, 2019
PubMed
Summary
This summary is machine-generated.

This study introduces UMICollapse, a new method for removing PCR duplicates using Unique Molecular Identifiers (UMIs). It efficiently handles errors and scales to large datasets, improving sequencing data accuracy.

Keywords:
BAMDeduplicateFASTQRNA-seqUnique molecular identifier

More Related Videos

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K
In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses
12:23

In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses

Published on: September 7, 2022

2.0K

Related Experiment Videos

Last Updated: Jan 1, 2026

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.5K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K
In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses
12:23

In Vitro Selection of Aptamers to Differentiate Infectious from Non-Infectious Viruses

Published on: September 7, 2022

2.0K

Area of Science:

  • Bioinformatics
  • Genomics
  • Molecular Biology

Background:

  • Unique Molecular Identifiers (UMIs) are crucial for identifying and removing PCR duplicates in high-throughput sequencing experiments.
  • Existing deduplication tools struggle with substitution errors and inefficient pairwise comparisons, limiting scalability for large datasets.

Purpose of the Study:

  • To develop a more efficient and scalable method for Unique Molecular Identifier (UMI) deduplication.
  • To address limitations in current tools, particularly regarding substitution errors and computational complexity.

Main Methods:

  • Reformulated the UMI deduplication problem to enable advanced optimizations and data structures.
  • Implemented a novel tool, UMICollapse, leveraging these optimizations for efficient processing.

Main Results:

  • UMICollapse successfully deduplicated over one million unique UMIs (length 9) at a single alignment position in approximately 26 seconds.
  • The tool operates efficiently on a single thread with low memory usage (<10 GB).

Conclusions:

  • A novel formulation of the UMI deduplication problem has been presented.
  • The new approach allows for faster and more sophisticated solutions using advanced data structures.