Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Sep 13, 2025

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.7K

Enabling scalable single-cell transcriptomic analysis through distributed computing with Apache spark.

Asif Adil1,2,3, Namrata Bhattacharya4,5, Aadam6

  • 1Department of Computer Sciences, Baba Ghulam Shah Badshah University, Rajouri, India. asifadil@bgsbu.ac.in.

Scientific Reports
|July 29, 2025
PubMed
Summary

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Advanced deep learning enables prediction of allogeneic stem cell mobilization success.

Bone marrow transplantation·2026
Same author

Advancing automated cell type annotation with large language models and single-cell isoform sequencing.

Computational and structural biotechnology journal·2025
Same author

Network based simultaneous embedding of cells and marker genes from scRNA-seq studies.

Briefings in bioinformatics·2025
Same author

Artificial intelligence approaches for tumor phenotype stratification from single-cell transcriptomic data.

eLife·2025
Same author

Usefulness of delayed primary closure in unplanned caesarean section to reduce surgical site infection in a resource-poor high population country: a randomised controlled trial.

Journal of the Turkish German Gynecological Association·2025
Same author

Correlation of Follicle-stimulating Hormone, Anti-Mullerian Hormone, and Antral Follicle Count with Age in Ovarian Reserve Testing.

International journal of applied & basic medical research·2024
Same journal

Turbulent flow in a vortex separator with a directed pipe inlet.

Scientific reports·2026
Same journal

Systematic characteristic evaluation of clay-based cementitious material derived from calcium carbide residue and waste tile powder.

Scientific reports·2026
Same journal

Retraction Note: Improvement of a rapid diagnostic application of monoclonal antibodies against avian influenza H7 subtype virus using Europium nanoparticles.

Scientific reports·2026
Same journal

Applying large language models to spam detection in the Kazakh low-resource language setting.

Scientific reports·2026
Same journal

An open-source 3D printing system enabling in-situ freeze-thaw processing of hydrogels.

Scientific reports·2026
Same journal

An enhanced EfficientNet framework for automated waste classification using cosine annealing and label smoothing.

Scientific reports·2026
See all related articles
This summary is machine-generated.

A new framework, scSPARKL, uses Apache Spark for efficient big single-cell data science. This tool accelerates the analysis of large single-cell RNA sequencing datasets, overcoming current computational challenges.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Single-cell RNA sequencing (scRNA-seq) generates vast datasets, posing significant management and analysis challenges.
  • The emerging field of big single-cell data science aims to address these challenges.
  • Existing computational tools often focus on analysis, neglecting the scale of scRNA-seq data generation.

Purpose of the Study:

  • To present scSPARKL, a novel parallel analytical framework for efficient analysis of big single-cell data.
  • To leverage Apache Spark's capabilities for scalable and fault-tolerant processing of scRNA-seq data.
  • To introduce optimized algorithms for key single-cell data operations within the Spark environment.

Main Methods:

  • Development of scSPARKL, a parallel analytical framework utilizing Apache Spark.
Keywords:
Apache sparkBig dataNormalizationQuality controlScRNA-seq

More Related Videos

An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing
10:00

An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing

Published on: May 23, 2018

17.9K
Transcriptome Analysis of Single Cells
07:27

Transcriptome Analysis of Single Cells

Published on: April 25, 2011

30.1K

Related Experiment Videos

Last Updated: Sep 13, 2025

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.7K
An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing
10:00

An Ultrahigh-throughput Microfluidic Platform for Single-cell Genome Sequencing

Published on: May 23, 2018

17.9K
Transcriptome Analysis of Single Cells
07:27

Transcriptome Analysis of Single Cells

Published on: April 25, 2011

30.1K
  • Implementation of six staged algorithms for scRNA-seq data processing: reshaping, preprocessing, filtering, normalization, dimensionality reduction, and clustering.
  • Evaluation of scSPARKL's utility and performance on real-world scRNA-seq datasets.
  • Main Results:

    • scSPARKL enables rapid and accurate analysis of scRNA-seq datasets of any size.
    • The framework effectively handles the scale and complexity of big single-cell data.
    • Experiments demonstrate the practical utility and efficiency of scSPARKL and its algorithms.

    Conclusions:

    • scSPARKL is a powerful and flexible tool for analyzing single-cell transcriptomic data.
    • The framework addresses the data deluge challenge in single-cell genomics.
    • scSPARKL has broad applications in biology and medicine for advancing single-cell data analysis.