Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

DNA Microarrays02:34

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Pathway-based analyses of gene expression profiles at low doses of ionizing radiation.

Frontiers in bioinformatics·2024
Same author

Optimal decision-making in high-throughput virtual screening pipelines.

Patterns (New York, N.Y.)·2023
Same author

Knowledge-driven learning, optimization, and experimental design under uncertainty for materials discovery.

Patterns (New York, N.Y.)·2023
Same author

Sensitivity Analysis of Genome-Scale Metabolic Flux Prediction.

Journal of computational biology : a journal of computational molecular cell biology·2023
Same author

Probabilistic boolean networks predict transcription factor targets to induce transdifferentiation.

iScience·2022
Same author

Can Machine Learning classifiers be used to regulate nutrients using small training datasets for aquaponic irrigation?: A comparative analysis.

PloS one·2022
Same journal

Learning directed acyclic graphs from large-scale genomics data.

EURASIP journal on bioinformatics & systems biology·2017
Same journal

Bayesian inference for biomarker discovery in proteomics: an analytic solution.

EURASIP journal on bioinformatics & systems biology·2017
Same journal

Review of stochastic hybrid systems with applications in biological systems modeling and analysis.

EURASIP journal on bioinformatics & systems biology·2017
Same journal

Using multi-step proposal distribution for improved MCMC convergence in Bayesian network structure learning.

EURASIP journal on bioinformatics & systems biology·2017
Same journal

On biometric systems: electrocardiogram Gaussianity and data synthesis.

EURASIP journal on bioinformatics & systems biology·2017
Same journal

Biomedical informatics with optimization and machine learning.

EURASIP journal on bioinformatics & systems biology·2017
See all related articles

Related Experiment Video

Updated: Jun 15, 2026

In Vivo Modeling of the Morbid Human Genome using Danio rerio
12:31

In Vivo Modeling of the Morbid Human Genome using Danio rerio

Published on: August 24, 2013

Impact of missing value imputation on classification for DNA microarray gene expression data--a model-based study.

Youting Sun1, Ulisses Braga-Neto, Edward R Dougherty

  • 1Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX 77843, USA.

EURASIP Journal on Bioinformatics & Systems Biology
|March 13, 2010
PubMed
Summary
This summary is machine-generated.

Missing value imputation in microarray data can improve classification accuracy under certain conditions, especially with high noise or strong correlations. However, imputation is not recommended for large amounts of missing data.

More Related Videos

Genotyping and Quantification of In Situ Hybridization Staining in Zebrafish
05:41

Genotyping and Quantification of In Situ Hybridization Staining in Zebrafish

Published on: January 28, 2020

Lung microRNA Profiling Across the Estrous Cycle in Ozone-exposed Mice
07:07

Lung microRNA Profiling Across the Estrous Cycle in Ozone-exposed Mice

Published on: January 7, 2019

Related Experiment Videos

Last Updated: Jun 15, 2026

In Vivo Modeling of the Morbid Human Genome using Danio rerio
12:31

In Vivo Modeling of the Morbid Human Genome using Danio rerio

Published on: August 24, 2013

Genotyping and Quantification of In Situ Hybridization Staining in Zebrafish
05:41

Genotyping and Quantification of In Situ Hybridization Staining in Zebrafish

Published on: January 28, 2020

Lung microRNA Profiling Across the Estrous Cycle in Ozone-exposed Mice
07:07

Lung microRNA Profiling Across the Estrous Cycle in Ozone-exposed Mice

Published on: January 7, 2019

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Numerous missing-value (MV) imputation methods exist for microarray data.
  • Few studies have rigorously examined the impact of MV imputation on classification accuracy.
  • Previous research often suffers from flawed MV generation and error estimation.

Purpose of the Study:

  • To conduct a model-based investigation into the relationship between MV imputation and classification accuracy in microarray data.
  • To address fundamental issues identified in prior studies concerning MV handling and evaluation.

Main Methods:

  • Evaluation of six popular MV imputation algorithms.
  • Incorporation of two feature selection methods.
  • Assessment using three distinct classification rules.
  • Model-based simulation to control for MV generation and error estimation.

Main Results:

  • MV imputation is beneficial when noise levels are high, variance is low, or gene-cluster correlations are strong, particularly at small to moderate MV rates.
  • Data quality metrics can guide imputation by identifying poor-quality points for imputation.
  • Performance of imputation methods exhibits a peaking phenomenon, improving up to an optimal MV rate before deteriorating rapidly.

Conclusions:

  • MV imputation is recommended for specific data characteristics and moderate missingness rates.
  • Imputation methods are not advised for high missing value rates.
  • The optimal MV rate for imputation performance should be considered to avoid accuracy degradation.