Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Zseq: An Approach for Preprocessing Next-Generation Sequencing Data.

Abedalrhman Alkhateeb1, Luis Rueda1

  • 1School of Computer Science, University of Windsor , Windsor, Canada .

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology
|April 18, 2017
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A survey of contrastive learning methods in molecular representation.

Briefings in bioinformatics·2026
Same author

PPISHES-an enhanced physicochemical approach for predicting protein interaction sites using graph neural networks.

Protein science : a publication of the Protein Society·2025
Same author

Gene expression alterations in testicular biopsies from males with spermatogenesis arrest identified by transcriptome analysis.

PloS one·2025
Same author

Angioplasty and Stenting of Persistent Primitive Trigeminal Artery Stenosis in Acute Ischemic Stroke.

Clinical neuroradiology·2025
Same author

Prediction of obesity levels based on physical activity and eating habits with a machine learning model integrated with explainable artificial intelligence.

Frontiers in physiology·2025
Same author

Two-stage object detection in low-light environments using deep learning image enhancement.

PeerJ. Computer science·2025
Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026
See all related articles

Zseq is a novel linear method that preprocesses next-generation sequencing data by identifying informative genomic sequences and filtering artifacts. This improves read mapping, reduces ambiguous bases, and enhances transcript assembly for better sample discrimination.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Next-generation sequencing (NGS) generates vast genomic data but is prone to artifacts.
  • Preprocessing NGS reads is crucial for accurate downstream analysis.
  • Existing methods struggle with sequence bias, duplication, and ambiguity.

Purpose of the Study:

  • To introduce Zseq, a linear method for efficient preprocessing of NGS data.
  • To enhance the quality of genomic sequences for improved downstream applications.
  • To develop a robust filtering strategy for NGS reads.

Main Methods:

  • Zseq calculates sequence complexity using unique k-mer counts and considers factors like ambiguous nucleotides and GC-content.
  • A z-score threshold is applied to filter low-complexity or biased sequences.
Keywords:
RNA-SEQ analysismachine learningnext-generation sequencingpreprocessing

Related Experiment Videos

  • The method was evaluated using read mapping, reference-based, and de novo transcript assembly.
  • Main Results:

    • Zseq significantly reduces ambiguous bases and improves read mapping rates compared to other methods.
    • Transcripts assembled from Zseq-filtered reads show enhanced discriminative ability for cancer vs. normal samples.
    • De novo assembled transcripts generated longer genomic sequences than those from alternative methods.

    Conclusions:

    • Zseq provides an effective linear approach for NGS data preprocessing.
    • The method improves the quality of genomic data, leading to better transcript assembly and biological insights.
    • Zseq offers a valuable tool for researchers working with large-scale sequencing data.