Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Missing value imputation improves clustering and interpretation of gene expression microarray data.

Johannes Tuikkala1, Laura L Elo, Olli S Nevalainen

  • 1Department of Information Technology and TUCS, University of Turku, FI-20014 Turku, Finland. jotatu@utu.fi

BMC Bioinformatics
|April 22, 2008
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Predicting drug combination response surfaces.

npj drug discovery·2026
Same author

Standardized workflow enables reproducibility of drug synergism detection: Results from a multi-center in vitro ring test on complex drug combinations in pancreatic cancer models.

Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie·2026
Same author

Clarifying the scope and capabilities of ROTS in differential expression analysis.

Bioinformatics (Oxford, England)·2026
Same author

Joint modeling of longitudinal and time-to-event data for dynamic disease risk prediction using proteomics.

Protein science : a publication of the Protein Society·2026
Same author

A multi-center study on the consistency of drug sensitivity testing in patients with acute myeloid leukemia.

NPJ precision oncology·2026
Same author

REACTOR: REgulon Activity analysis and Comparison Tool for single-cell transcriptOmics Research.

Bioinformatics (Oxford, England)·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Missing values in gene expression microarray data complicate analysis. Advanced imputation methods, like Bayesian Principal Components Algorithm (BPCA), effectively reduce this impact on discovering gene groups, outperforming simpler methods.

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • Missing values in gene expression microarray data present significant analytical challenges.
  • Existing imputation methods vary in performance, lacking a consensus for selection.
  • This variability hinders reliable downstream analysis and interpretation of microarray datasets.

Purpose of the Study:

  • To evaluate and compare advanced missing value imputation methods for gene expression microarrays.
  • To assess the impact of different imputation strategies on the biological interpretation of gene expression data.
  • To determine the optimal approach for handling missing data in microarray experiments.

Main Methods:

  • Comparison of advanced imputation techniques on recent microarray datasets.

Related Experiment Videos

  • Evaluation of imputation accuracy at both measurement and biological interpretation levels.
  • Benchmarking against traditional methods like data deletion or mean imputation.
  • Main Results:

    • Discrepancies in measurement-level imputation accuracy across datasets become negligible when evaluating biological interpretation.
    • Advanced imputation methods consistently outperform ignoring missing data or using simple replacements (zeros, averages).
    • The Bayesian Principal Components Algorithm (BPCA) demonstrates effectiveness and efficiency.

    Conclusions:

    • Missing values complicate microarray analysis, but their impact on discovering biologically meaningful gene groups can be mitigated.
    • Readily available and fast imputation methods, such as BPCA, are crucial for reducing the negative effects of missing data.
    • Advanced imputation is essential for robust gene expression data analysis and biological discovery.