Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Gaussian mixture clustering and imputation of microarray data.

Ming Ouyang1, William J Welsh, Panos Georgopoulos

  • 1Environmental and Occupational Health Sciences Institute, UMDNJ-Robert Wood Johnson Medical School and Rutgers, The State University of New Jersey, 170 Frelinghuysen Road, Piscataway, NJ 08854, USA. ouyang@fidelio.rutgers.edu

Bioinformatics (Oxford, England)
|January 31, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comment on"Left ventricular systolic recovery after TAVI in severe aortic stenosis: A systematic review and meta-analysis".

International journal of cardiology. Cardiovascular risk and prevention·2026
Same author

A Helical Hydrogen-Bonded Aramide Macrocycle with a "Closed" Cavity: Dinuclear Ag(I) Coordination Assembly and Switchable Chirality Sensing.

Inorganic chemistry·2026
Same author

Comment on "The combination of glucagon-like peptide-1 receptor agonists and sodium-glucose cotransporter-2 inhibitors: a narrative review of existing meta-analysis".

Acta diabetologica·2026
Same author

Comment on "Beta-blocker therapy for acute myocardial infarction with preserved ejection fraction: A meta-analysis from randomized controlled trials".

Journal of cardiology·2026
Same author

Comment on "Postoperative Sleep Interventions and Cardiac Surgery: A Systematic Review and Meta-Analysis".

ANZ journal of surgery·2026
Same author

Modulating Co Species Charge Distribution in Co<sub>3</sub>O<sub>4</sub> for Boosted Low-Temperature Complete Oxidation of VOCs: Surface Co<sup>2+</sup>-O Sites Induced Reactant Adsorption and Activation.

Environmental science & technology·2026
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

This study compares missing value estimation methods for microarray data. Gaussian mixture clustering with model averaging imputation proved most accurate for both time-series and non-time series datasets.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Microarray experiments frequently encounter missing data due to chip imperfections.
  • Missing values affect over 90% of genes in large-scale studies, necessitating imputation or exclusion.
  • Standard analysis methods often require complete datasets, highlighting the need for effective missing value handling.

Purpose of the Study:

  • To compare various missing value estimation techniques in the context of microarray data analysis.
  • To evaluate the impact of different imputation methods on downstream analyses, particularly clustering.
  • To identify the most accurate and least biased imputation method for gene expression data.

Main Methods:

  • Utilized two key metrics for evaluating imputation accuracy: root mean squared error (RMSE) and the number of mis-clustered genes.

Related Experiment Videos

  • Assessed imputation performance on both time-series (correlated) and non-time series (uncorrelated) datasets.
  • Employed Gaussian mixture clustering with model averaging for imputation and comparison.
  • Main Results:

    • Gaussian mixture clustering with model averaging imputation demonstrated superior performance across both datasets.
    • This method showed minimal bias in clustering, as indicated by a lower number of mis-clustered genes.
    • It consistently outperformed other tested imputation techniques according to both RMSE and clustering accuracy metrics.

    Conclusions:

    • Model averaging imputation within a Gaussian mixture clustering framework is the recommended approach for handling missing values in microarray data.
    • This method provides accurate imputation and preserves the integrity of clustering results.
    • Effective missing value imputation is crucial for reliable gene expression data analysis.