Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce.

Dries Decap1,2, Joke Reumers3,2, Charlotte Herzeel4,2

  • 1Department of Information Technology, IDLab, Ghent University - imec, Ghent, Belgium.

Plos One
|March 31, 2017
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Data-driven hypothesis discovery from disease trajectories in multiple sclerosis.

Frontiers in immunology·2026
Same author

Run-length compressed metagenomic read classification with SMEM-finding and tagging.

iScience·2026
Same author

Columba: fast approximate pattern matching with optimized search schemes.

Bioinformatics (Oxford, England)·2025
Same author

b-move: faster lossless approximate pattern matching in a run-length compressed index.

Algorithms for molecular biology : AMB·2025
Same author

Run-length compressed metagenomic read classification with SMEM-finding and tagging.

bioRxiv : the preprint server for biology·2025
Same author

ELLIPSIS: robust quantification of splicing in scRNA-seq.

Bioinformatics (Oxford, England)·2025
Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026
Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026
Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026
Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026
Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026
Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026
See all related articles

Halvade-RNA is a new parallel pipeline for RNA-seq variant calling. It significantly reduces processing time from 28 hours to 2 hours, making RNA-seq data analysis more cost-effective.

Area of Science:

  • Genomics
  • Bioinformatics

Background:

  • Next-generation sequencing (NGS) generates vast amounts of DNA-seq and RNA-seq data.
  • Efficient genetic variant calling is crucial for analyzing NGS data.
  • Current RNA-seq variant calling pipelines lack efficient parallelization.

Purpose of the Study:

  • Introduce Halvade-RNA, a parallel, multi-node pipeline for RNA-seq variant calling.
  • Optimize runtime and cost-effectiveness for processing large RNA-seq datasets.
  • Adhere to GATK Best Practices for variant calling.

Main Methods:

  • Utilizes the MapReduce programming model for parallel data stream management.
  • Runs multiple instances of tools like STAR and GATK concurrently.
  • Implemented in Java using the Hadoop MapReduce 2.0 API.

Related Experiment Videos

Main Results:

  • Reduces RNA-seq variant calling runtime from approximately 28 hours to 2 hours on a small cluster.
  • Achieves significant runtime reduction even on a single multi-core workstation.
  • Demonstrates cost-effectiveness for RNA-seq data processing.

Conclusions:

  • Halvade-RNA provides an efficient and parallelized solution for RNA-seq variant calling.
  • The pipeline is compatible with various Hadoop distributions (Cloudera, Amazon EMR).
  • Enables faster and more economical analysis of transcriptomic data for variant discovery.