Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Improving missing value estimation in microarray data with gene ontology.

Johannes Tuikkala1, Laura Elo, Olli S Nevalainen

  • 1Department of Information Technology, University of Turku, Lemminkäisenkatu 14A, FIN-20520, Finland. jotatu@utu.fi

Bioinformatics (Oxford, England)
|December 27, 2005
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Predicting drug combination response surfaces.

npj drug discovery·2026
Same author

Standardized workflow enables reproducibility of drug synergism detection: Results from a multi-center in vitro ring test on complex drug combinations in pancreatic cancer models.

Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie·2026
Same author

A multi-center study on the consistency of drug sensitivity testing in patients with acute myeloid leukemia.

NPJ precision oncology·2026
Same author

Multimodal immunopharmacologic screens identify drugs rewiring the cancer-immune interface.

bioRxiv : the preprint server for biology·2026
Same author

Preclinical models of hepatosplenic γδ T-cell lymphoma with an activating STAT5B mutation display sensitivity to JAK inhibitor upadacitinib.

HemaSphere·2026
Same author

Prognostic biomarker discovery in pancreatic cancer through hybrid ensemble feature selection and multi-omics data.

BioData mining·2026
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

This study enhances gene expression data analysis by using Gene Ontology (GO) annotations to improve missing value estimation. Incorporating GO data boosts the accuracy of imputation methods, especially with limited experimental conditions and high missing data rates.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Gene expression microarray experiments frequently yield datasets with missing values, impacting downstream analysis.
  • Current missing value estimation methods rely solely on expression data, lacking external functional context.
  • Accurate imputation is crucial for statistical and machine learning techniques applied to microarray data.

Purpose of the Study:

  • To investigate the utility of functional similarity information from Gene Ontology (GO) annotations for improving missing value estimation in gene expression data.
  • To assess the impact of GO-based semantic similarity on the performance of imputation algorithms.

Main Methods:

  • Utilized Gene Ontology (GO) annotations to derive semantic similarity between genes.

Related Experiment Videos

  • Integrated GO information into the k-nearest neighbor (KNN) imputation algorithm.
  • Employed an adaptive weight selection procedure to automatically determine the contribution of different information sources.
  • Main Results:

    • Incorporating GO information significantly enhanced the performance of the KNN algorithm for missing value imputation in yeast cDNA microarray datasets.
    • Performance improvements were most pronounced under conditions with a small number of experimental variables and a high percentage of missing values.
    • The benefit of GO information was less pronounced with more complex imputation methods.

    Conclusions:

    • Leveraging functional similarity from GO annotations is an effective strategy to improve missing value imputation in gene expression data.
    • Even a small proportion of annotated genes can lead to substantial improvements in data quality, aiding microarray experiment interpretation.
    • The developed approach offers a valuable tool for enhancing the reliability of genomic data analysis.