Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

Missing value estimation methods for DNA microarrays.

O Troyanskaya¹, M Cantor, G Sherlock

¹Stanford Medical Informatics Stanford University School of Medicine, Stanford, CA, USA.

Bioinformatics (Oxford, England)

|June 8, 2001

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Catch Data Can Unravel Elasmobranch Aggregation Dynamics and Group Behaviours.

Ecology and evolution·2025

Same author

Canadian Surgery Forum: Abstracts of presentations to the Annual Meetings of the Canadian Association of Bariatric Physicians and Surgeons, Canadian Association of General Surgeons, Canadian Association of Thoracic Surgeons, Canadian Hepato-Pancreato-Biliary Association, Canadian Society of Surgical Oncology, Canadian Society of Colon and Rectal Surgeons, Vancouver, BC, Sept. 17-21, 2013.

Canadian journal of surgery. Journal canadien de chirurgie·2025

Same author

Association of a green tea extract with serum immunoglobulin G status and neonatal vitality in newborn dairy calves.

Journal of dairy science·2022

Same author

Homophily around specialized foraging underlies dolphin social preferences.

Biology letters·2019

Same author

The structure of a bottlenose dolphin society is coupled to a unique foraging cooperation with artisanal fishermen.

Biology letters·2012

Same author

Microarray karyotyping of maltose-fermenting Saccharomyces yeasts with differing maltotriose utilization profiles reveals copy number variation in genes involved in maltose and maltotriose utilization.

Journal of applied microbiology·2010

Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026

Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026

Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026

Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026

Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

See all related articles

Missing values in gene expression data hinder analysis. Weighted K-nearest neighbors (KNNimpute) offers a robust solution for estimating missing data, outperforming Singular Value Decomposition (SVDimpute) and row average methods.

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

Gene expression microarray experiments frequently produce datasets with missing values.
Many gene expression analysis algorithms require complete data matrices, limiting their application.
Missing data can reduce the effectiveness of clustering methods like hierarchical clustering and K-means.

Purpose of the Study:

To investigate and compare automated methods for estimating missing values in gene expression data.
To identify imputation methods that minimize the impact of incomplete datasets on downstream analyses.
To enhance the applicability of various gene expression analysis algorithms to datasets with missing values.

Main Methods:

Comparative evaluation of three missing value estimation methods: Singular Value Decomposition (SVDimpute), weighted K-nearest neighbors (KNNimpute), and row average.

Related Experiment Videos

Assessment of method performance across diverse real-world datasets and parameter settings.

Evaluation of imputation robustness with missing data ranging from 1% to 20%.

Main Results:

Weighted K-nearest neighbors (KNNimpute) demonstrated superior robustness and sensitivity for missing value estimation compared to SVDimpute.
Both SVDimpute and KNNimpute significantly outperformed the traditional row average method and zero-filling approaches.
The study provides a comprehensive comparison of imputation techniques under varying conditions of data incompleteness.

Conclusions:

KNNimpute is recommended as a reliable method for imputing missing values in gene expression microarray data.
The findings support the use of advanced imputation techniques to improve the accuracy and scope of gene expression data analysis.
Recommendations and tools are provided for accurate missing data estimation in microarray datasets.