Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Missing value estimation methods for DNA microarrays.

O Troyanskaya1, M Cantor, G Sherlock

  • 1Stanford Medical Informatics Stanford University School of Medicine, Stanford, CA, USA.

Bioinformatics (Oxford, England)
|June 8, 2001
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Catch Data Can Unravel Elasmobranch Aggregation Dynamics and Group Behaviours.

Ecology and evolution·2025
Same author

Canadian Surgery Forum: Abstracts of presentations to the Annual Meetings of the Canadian Association of Bariatric Physicians and Surgeons, Canadian Association of General Surgeons, Canadian Association of Thoracic Surgeons, Canadian Hepato-Pancreato-Biliary Association, Canadian Society of Surgical Oncology, Canadian Society of Colon and Rectal Surgeons, Vancouver, BC, Sept. 17-21, 2013.

Canadian journal of surgery. Journal canadien de chirurgie·2025
Same author

Association of a green tea extract with serum immunoglobulin G status and neonatal vitality in newborn dairy calves.

Journal of dairy science·2022
Same author

Homophily around specialized foraging underlies dolphin social preferences.

Biology letters·2019
Same author

The structure of a bottlenose dolphin society is coupled to a unique foraging cooperation with artisanal fishermen.

Biology letters·2012
Same author

Microarray karyotyping of maltose-fermenting Saccharomyces yeasts with differing maltotriose utilization profiles reveals copy number variation in genes involved in maltose and maltotriose utilization.

Journal of applied microbiology·2010
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
Same journal

KASSPer: Kinase Active Site Structure Prediction using Protein and Ligand Language Models and Its Application to Virtual Screening.

Bioinformatics (Oxford, England)·2026
Same journal

IDR searcher: a search engine solution for public image resources.

Bioinformatics (Oxford, England)·2026
Same journal

KCFtools: Rapid alignment-free method for introgression screening and GWAS using k-mer profiles.

Bioinformatics (Oxford, England)·2026
Same journal

Meta2DB: Curated shotgun metagenomic feature sets and metadata for health state prediction.

Bioinformatics (Oxford, England)·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
See all related articles

Missing values in gene expression data hinder analysis. Weighted K-nearest neighbors (KNNimpute) offers a robust solution for estimating missing data, outperforming Singular Value Decomposition (SVDimpute) and row average methods.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Gene expression microarray experiments frequently produce datasets with missing values.
  • Many gene expression analysis algorithms require complete data matrices, limiting their application.
  • Missing data can reduce the effectiveness of clustering methods like hierarchical clustering and K-means.

Purpose of the Study:

  • To investigate and compare automated methods for estimating missing values in gene expression data.
  • To identify imputation methods that minimize the impact of incomplete datasets on downstream analyses.
  • To enhance the applicability of various gene expression analysis algorithms to datasets with missing values.

Main Methods:

  • Comparative evaluation of three missing value estimation methods: Singular Value Decomposition (SVDimpute), weighted K-nearest neighbors (KNNimpute), and row average.

Related Experiment Videos

  • Assessment of method performance across diverse real-world datasets and parameter settings.
  • Evaluation of imputation robustness with missing data ranging from 1% to 20%.
  • Main Results:

    • Weighted K-nearest neighbors (KNNimpute) demonstrated superior robustness and sensitivity for missing value estimation compared to SVDimpute.
    • Both SVDimpute and KNNimpute significantly outperformed the traditional row average method and zero-filling approaches.
    • The study provides a comprehensive comparison of imputation techniques under varying conditions of data incompleteness.

    Conclusions:

    • KNNimpute is recommended as a reliable method for imputing missing values in gene expression microarray data.
    • The findings support the use of advanced imputation techniques to improve the accuracy and scope of gene expression data analysis.
    • Recommendations and tools are provided for accurate missing data estimation in microarray datasets.