Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: May 30, 2026

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

Strategies for aggregating gene expression data: the collapseRows R function.

Jeremy A Miller1, Chaochao Cai, Peter Langfelder

  • 1Interdepartmental Program for Neuroscience, UCLA, Los Angeles, California, USA.

BMC Bioinformatics
|August 6, 2011
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

RNA-seq03:21

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Psychiatric disorders converge on common pathways but diverge in cellular context, spatial distribution, and directionality of genetic effects.

medRxiv : the preprint server for health sciences·2025
Same author

Multi-ancestry genome-wide meta-analysis of 56,241 individuals identifies known and novel cross-population and ancestry-specific associations as novel risk loci for Alzheimer's disease.

Genome biology·2025
Same author

A Single-Cell Atlas of DNA Methylation in Autism Spectrum Disorder Reveals Distinct Regulatory and Aging Signatures.

bioRxiv : the preprint server for biology·2025
Same author

A single cell multi-omic analysis identifies molecular and gene-regulatory mechanisms dysregulated in the developing Down syndrome neocortex.

bioRxiv : the preprint server for biology·2025
Same author

Author Correction: Large-scale network analysis of the cerebrospinal fluid proteome identifies molecular signatures of frontotemporal lobar degeneration.

Nature aging·2025
Same author

Diverse Genomes, Shared Health: Insights from a Health System Biobank.

medRxiv : the preprint server for health sciences·2025
Same journal

SNPio: a Python interface for population genomic data processing.

BMC bioinformatics·2026
Same journal

SpaHNR: a spatial domain identification method via sparse attention-based hierarchical node representation and multi-view contrastive learning.

BMC bioinformatics·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
See all related articles

Summarizing high-dimensional genomic data is essential. The R function collapseRows offers robust methods for collapsing variables, improving gene expression meta-analysis and cell type deconvolution.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • High-dimensional genomic analyses require summarizing multiple related variables into a single representative.
  • Existing methods for collapsing, combining, reducing, or aggregating variables are limited.
  • Network-based approaches offer alternative strategies for variable summarization.

Purpose of the Study:

  • To introduce and evaluate the R function collapseRows for variable summarization in genomic data.
  • To compare standard statistical and network-based collapsing methods.
  • To demonstrate the utility of collapseRows in gene expression meta-analysis, co-expression module summarization, and cell type deconvolution.

Main Methods:

  • Implementation of multiple collapsing methods within the R function collapseRows.

More Related Videos

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

Related Experiment Videos

Last Updated: May 30, 2026

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

  • Evaluation of performance in gene expression meta-analysis by collapsing probes for single genes.
  • Assessment of co-expression module summarization strategies.
  • Application to cell type deconvolution using marker gene aggregation.
  • Main Results:

    • Collapsing probes by highest average expression optimizes between-study consistency in microarray meta-analysis.
    • Optimal collapsing strategy for co-expression modules depends on the specific analysis goal.
    • Identifying the most highly connected 'hub' marker gene improves cell type abundance prediction accuracy.
    • The userListEnrichment function facilitates biological interpretation of collapsed gene lists.

    Conclusions:

    • The R function collapseRows provides robust and biologically relevant tools for genomic data summarization.
    • Both standard statistical and network-based collapsing methods are effective.
    • The function aids in gene expression meta-analysis, module analysis, and cell type deconvolution.