Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Precipitation and Co-precipitation

Precipitation and Co-precipitation

Precipitation and coprecipitation methods can be used to separate a mixture of ions in a solution. In qualitative inorganic analysis, ions that form sparingly soluble precipitates with the same reagent are separated based on the differences in solubility products. For example, consider the separation of Cu(II) and Fe(II) ions by precipitation as insoluble sulfides. First, copper(II) sulfide is precipitated by the addition of acidic H2S, where the dissociation of H2S is suppressed. Adding H2S...

Precipitation Processes

Precipitation Processes

The experimental conditions in a gravimetric analysis should be optimized to maximize the particle size and purity of the obtained precipitate. Ideally, the concentration of the precipitating reagent should be low with effective stirring to maintain low relative supersaturation for the growth of large crystals. In homogeneous precipitation, the precipitant is slowly generated by a chemical reaction in the solution to avoid local reagent excesses. For example, urea decomposes gradually to...

Responses to Drought and Flooding

Responses to Drought and Flooding

Water plays a significant role in the life cycle of plants. However, insufficient or excess of water can be detrimental and pose a serious threat to plants.

Elastic Collisions: Case Study

Elastic Collisions: Case Study

Elastic collision of a system demands conservation of both momentum and kinetic energy. To solve problems involving one-dimensional elastic collisions between two objects, the equations for conservation of momentum and conservation of internal kinetic energy can be used. For the two objects, the sum of momentum before the collision equals the total momentum after the collision. An elastic collision conserves internal kinetic energy, and so the sum of kinetic energies before the collision equals...

RNA-seq

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases.
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The Common Fund Data Ecosystem (CFDE).

bioRxiv : the preprint server for biology·2026

Same author

Learning Explainable Imaging-Genetics Associations Related to a Neurological Disorder.

Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention·2026

Same author

Assembling unmapped reads reveals hidden variation in South Asian genomes.

Nature communications·2026

Same author

Perseus: Lineage-Aware Refinement of Kraken2 Taxonomic Classification for Long Read Metagenomes.

bioRxiv : the preprint server for biology·2026

Same author

A neofunctionalized flowering antagonist created an evolutionary contingency that channeled Solanaceae adaptation.

bioRxiv : the preprint server for biology·2026

Same author

AniAnn's: alignment-free annotation of tandem repeat arrays using fast average nucleotide identity estimates.

bioRxiv : the preprint server for biology·2026

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 24, 2026

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

CloudBurst: highly sensitive read mapping with MapReduce.

Michael C Schatz¹

¹Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA. mschatz@umiacs.umd.edu

Bioinformatics (Oxford, England)

|April 10, 2009

Summary

This summary is machine-generated.

CloudBurst is a novel parallel read-mapping algorithm designed for next-generation sequencing data. It significantly accelerates genomic analysis by leveraging MapReduce for efficient, scalable processing on multiple compute nodes.

More Related Videos

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry

Published on: October 2, 2016

Related Experiment Videos

Last Updated: Jun 24, 2026

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Introductory Analysis and Validation of CUT&RUN Sequencing Data

Published on: December 13, 2024

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry

Rapid High-throughput Species Identification of Botanical Material Using Direct Analysis in Real Time High Resolution Mass Spectrometry

Published on: October 2, 2016

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

Next-generation sequencing generates vast amounts of data, overwhelming traditional single-processor read-mapping algorithms.
Efficiently mapping short DNA reads to reference genomes is crucial for various biological analyses.

Purpose of the Study:

To develop a parallel read-mapping algorithm, CloudBurst, optimized for next-generation sequencing data.
To improve the speed and scalability of mapping short reads to large reference genomes like the human genome.

Main Methods:

CloudBurst is a parallel read-mapping algorithm modeled after RMAP.
It utilizes the open-source Hadoop implementation of MapReduce to parallelize execution across multiple compute nodes.
The algorithm reports all alignments or the best alignment for each read, allowing for adjustable mismatch tolerance.

Main Results:

CloudBurst exhibits linear scaling of running time with the number of reads mapped.
Near-linear speedup is achieved as the number of processors increases.
On a 96-core system, CloudBurst demonstrated over 100-fold performance improvement compared to single-core RMAP, reducing mapping time from hours to minutes for millions of reads.

Conclusions:

CloudBurst offers a significant performance enhancement for mapping next-generation sequencing reads.
Its parallel architecture and MapReduce implementation provide a scalable solution for large-scale genomic analyses.
The open-source availability of CloudBurst facilitates its adoption and serves as a model for parallelizing similar algorithms.