Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Using semantic search to find publicly available gene-expression datasets.

Bioinformatics (Oxford, England)·2026
Same author

Translating short-form Python exercises to other programming languages using diverse prompting strategies.

GigaScience·2025
Same author

Opportunities and considerations for using artificial intelligence in bioinformatics education.

Bioinformatics advances·2025
Same author

Using semantic search to find publicly available gene-expression datasets.

bioRxiv : the preprint server for biology·2025
Same author

CoMIT: a bioinformatic pipeline for risk-based prediction of COVID-19 test inclusivity.

BMC bioinformatics·2025
Same author

Comparison of Predictive Factors of Flu Vaccine Uptake Pre- and Post-COVID-19 Using the NIS-Teen Survey.

Vaccines·2024
Same journal

Updates and validation of the Compi RNA-seq pipeline with a case study in Alzheimer's disease.

Journal of integrative bioinformatics·2026
Same journal

Fragment-level FAIRness: annotating scientific data and its provenance using data fragment selectors.

Journal of integrative bioinformatics·2026
Same journal

Integrating cross-omics research through FAIR Digital Objects with DataPLANT.

Journal of integrative bioinformatics·2026
Same journal

Pheno-App 2.0 - a mobile app for collecting phenotypic data in plant research.

Journal of integrative bioinformatics·2026
Same journal

Evolving bioinformatics services - the journey of KPI metrics with Scorpion.

Journal of integrative bioinformatics·2026
Same journal

The community engagement and empowerment cycle: FAIRagro's framework to foster cultural change towards FAIR RDM practices in agrosystem science and beyond.

Journal of integrative bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jul 9, 2025

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

3.5K

TidyGEO: preparing analysis-ready datasets from Gene Expression Omnibus.

Avery Mecham1, Ashlie Stephenson1, Badi I Quinteros1

  • 1Department of Biology, Brigham Young University, Provo, UT, 84602, USA.

Journal of Integrative Bioinformatics
|December 4, 2023
PubMed
Summary
This summary is machine-generated.

TidyGEO simplifies Gene Expression Omnibus (GEO) data analysis by providing a web tool to clean and reformat sample annotations. This enables researchers to efficiently utilize vast biological datasets for secondary research.

Keywords:
data cleaningdata wranglinggene-expression analysisinteractive curationweb application

More Related Videos

Mapping the Structure-Function Relationships of Disordered Oncogenic Transcription Factors Using Transcriptomic Analysis
09:58

Mapping the Structure-Function Relationships of Disordered Oncogenic Transcription Factors Using Transcriptomic Analysis

Published on: June 27, 2020

2.8K
Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.6K

Related Experiment Videos

Last Updated: Jul 9, 2025

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

3.5K
Mapping the Structure-Function Relationships of Disordered Oncogenic Transcription Factors Using Transcriptomic Analysis
09:58

Mapping the Structure-Function Relationships of Disordered Oncogenic Transcription Factors Using Transcriptomic Analysis

Published on: June 27, 2020

2.8K
Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.6K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • The Gene Expression Omnibus (GEO) is a large public repository of high-throughput gene expression data.
  • GEO data requires significant cleaning and reformatting of sample-level annotations for effective secondary analysis.
  • Inconsistent annotation structures across GEO series hinder data usability.

Purpose of the Study:

  • To develop TidyGEO, a web-based tool to automate the tidying and reformatting of GEO sample annotations.
  • To facilitate secondary research by making GEO data more accessible and analyzable.
  • To address the challenges of manual data cleaning and the need for computational expertise.

Main Methods:

  • TidyGEO offers functionalities for selecting, renaming, splitting, merging, standardizing, and filtering annotation columns.
  • The tool supports integrating annotations with assay data and restructuring assay data.
  • TidyGEO generates reproducible code for data processing steps.

Main Results:

  • TidyGEO provides essential data-cleaning tasks for sample-level annotations from GEO.
  • Users can efficiently prepare GEO data for downstream analysis.
  • The tool enhances the reproducibility of data analysis.

Conclusions:

  • TidyGEO effectively addresses the challenges of processing GEO data for secondary research.
  • The tool empowers researchers with varying computational expertise to analyze complex biological datasets.
  • TidyGEO promotes efficient and reproducible use of public gene expression data.