Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

Improving missing value estimation in microarray data with gene ontology.

Johannes Tuikkala¹, Laura Elo, Olli S Nevalainen

¹Department of Information Technology, University of Turku, Lemminkäisenkatu 14A, FIN-20520, Finland. jotatu@utu.fi

Bioinformatics (Oxford, England)

|December 27, 2005

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Predicting drug combination response surfaces.

npj drug discovery·2026

Same author

Standardized workflow enables reproducibility of drug synergism detection: Results from a multi-center in vitro ring test on complex drug combinations in pancreatic cancer models.

Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie·2026

Same author

A multi-center study on the consistency of drug sensitivity testing in patients with acute myeloid leukemia.

NPJ precision oncology·2026

Same author

Multimodal immunopharmacologic screens identify drugs rewiring the cancer-immune interface.

bioRxiv : the preprint server for biology·2026

Same author

Preclinical models of hepatosplenic γδ T-cell lymphoma with an activating STAT5B mutation display sensitivity to JAK inhibitor upadacitinib.

HemaSphere·2026

Same author

Prognostic biomarker discovery in pancreatic cancer through hybrid ensemble feature selection and multi-omics data.

BioData mining·2026

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

This study enhances gene expression data analysis by using Gene Ontology (GO) annotations to improve missing value estimation. Incorporating GO data boosts the accuracy of imputation methods, especially with limited experimental conditions and high missing data rates.

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

Gene expression microarray experiments frequently yield datasets with missing values, impacting downstream analysis.
Current missing value estimation methods rely solely on expression data, lacking external functional context.
Accurate imputation is crucial for statistical and machine learning techniques applied to microarray data.

Purpose of the Study:

To investigate the utility of functional similarity information from Gene Ontology (GO) annotations for improving missing value estimation in gene expression data.
To assess the impact of GO-based semantic similarity on the performance of imputation algorithms.

Main Methods:

Utilized Gene Ontology (GO) annotations to derive semantic similarity between genes.

Related Experiment Videos

Integrated GO information into the k-nearest neighbor (KNN) imputation algorithm.

Employed an adaptive weight selection procedure to automatically determine the contribution of different information sources.

Main Results:

Incorporating GO information significantly enhanced the performance of the KNN algorithm for missing value imputation in yeast cDNA microarray datasets.
Performance improvements were most pronounced under conditions with a small number of experimental variables and a high percentage of missing values.
The benefit of GO information was less pronounced with more complex imputation methods.

Conclusions:

Leveraging functional similarity from GO annotations is an effective strategy to improve missing value imputation in gene expression data.
Even a small proportion of annotated genes can lead to substantial improvements in data quality, aiding microarray experiment interpretation.
The developed approach offers a valuable tool for enhancing the reliability of genomic data analysis.