Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
Precipitation and Co-precipitation01:17

Precipitation and Co-precipitation

Precipitation and coprecipitation methods can be used to separate a mixture of ions in a solution. In qualitative inorganic analysis, ions that form sparingly soluble precipitates with the same reagent are separated based on the differences in solubility products. For example, consider the separation of Cu(II) and Fe(II) ions by precipitation as insoluble sulfides. First, copper(II) sulfide is precipitated by the addition of acidic H2S, where the dissociation of H2S is suppressed. Adding H2S...
Precipitation Processes01:12

Precipitation Processes

The experimental conditions in a gravimetric analysis should be optimized to maximize the particle size and purity of the obtained precipitate. Ideally, the concentration of the precipitating reagent should be low with effective stirring to maintain low relative supersaturation for the growth of large crystals. In homogeneous precipitation, the precipitant is slowly generated by a chemical reaction in the solution to avoid local reagent excesses. For example, urea decomposes gradually to...
Responses to Drought and Flooding02:41

Responses to Drought and Flooding

Water plays a significant role in the life cycle of plants. However, insufficient or excess of water can be detrimental and pose a serious threat to plants.
Elastic Collisions: Case Study01:15

Elastic Collisions: Case Study

Elastic collision of a system demands conservation of both momentum and kinetic energy. To solve problems involving one-dimensional elastic collisions between two objects, the equations for conservation of momentum and conservation of internal kinetic energy can be used. For the two objects, the sum of momentum before the collision equals the total momentum after the collision. An elastic collision conserves internal kinetic energy, and so the sum of kinetic energies before the collision equals...
RNA-seq03:21

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Common Fund Data Ecosystem (CFDE).

bioRxiv : the preprint server for biology·2026
Same author

Learning Explainable Imaging-Genetics Associations Related to a Neurological Disorder.

Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention·2026
Same author

Assembling unmapped reads reveals hidden variation in South Asian genomes.

Nature communications·2026
Same author

Perseus: Lineage-Aware Refinement of Kraken2 Taxonomic Classification for Long Read Metagenomes.

bioRxiv : the preprint server for biology·2026
Same author

A neofunctionalized flowering antagonist created an evolutionary contingency that channeled Solanaceae adaptation.

bioRxiv : the preprint server for biology·2026
Same author

AniAnn's: alignment-free annotation of tandem repeat arrays using fast average nucleotide identity estimates.

bioRxiv : the preprint server for biology·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Jun 24, 2026

Introductory Analysis and Validation of CUT&RUN Sequencing Data
04:58

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

CloudBurst: highly sensitive read mapping with MapReduce.

Michael C Schatz1

  • 1Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA. mschatz@umiacs.umd.edu

Bioinformatics (Oxford, England)
|April 10, 2009
PubMed
Summary
This summary is machine-generated.

CloudBurst is a novel parallel read-mapping algorithm designed for next-generation sequencing data. It significantly accelerates genomic analysis by leveraging MapReduce for efficient, scalable processing on multiple compute nodes.

More Related Videos

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry
11:14

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry

Published on: October 2, 2016

Related Experiment Videos

Last Updated: Jun 24, 2026

Introductory Analysis and Validation of CUT&RUN Sequencing Data
04:58

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry
11:14

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry

Published on: October 2, 2016

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Next-generation sequencing generates vast amounts of data, overwhelming traditional single-processor read-mapping algorithms.
  • Efficiently mapping short DNA reads to reference genomes is crucial for various biological analyses.

Purpose of the Study:

  • To develop a parallel read-mapping algorithm, CloudBurst, optimized for next-generation sequencing data.
  • To improve the speed and scalability of mapping short reads to large reference genomes like the human genome.

Main Methods:

  • CloudBurst is a parallel read-mapping algorithm modeled after RMAP.
  • It utilizes the open-source Hadoop implementation of MapReduce to parallelize execution across multiple compute nodes.
  • The algorithm reports all alignments or the best alignment for each read, allowing for adjustable mismatch tolerance.

Main Results:

  • CloudBurst exhibits linear scaling of running time with the number of reads mapped.
  • Near-linear speedup is achieved as the number of processors increases.
  • On a 96-core system, CloudBurst demonstrated over 100-fold performance improvement compared to single-core RMAP, reducing mapping time from hours to minutes for millions of reads.

Conclusions:

  • CloudBurst offers a significant performance enhancement for mapping next-generation sequencing reads.
  • Its parallel architecture and MapReduce implementation provide a scalable solution for large-scale genomic analyses.
  • The open-source availability of CloudBurst facilitates its adoption and serves as a model for parallelizing similar algorithms.