Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Jun 18, 2026

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies
14:56

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies

Published on: May 6, 2022

Dealing with missing values in large-scale studies: microarray data imputation and beyond.

Tero Aittokallio1

  • 1Biomathematics Research Group, Department of Mathematics, FI-20014 University of Turku, Finland. tero.aittokallio@utu.fi

Briefings in Bioinformatics
|December 8, 2009
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

DNA Microarrays02:34

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Predicting drug combination response surfaces.

npj drug discovery·2026
Same author

Standardized workflow enables reproducibility of drug synergism detection: Results from a multi-center in vitro ring test on complex drug combinations in pancreatic cancer models.

Biomedicine & pharmacotherapy = Biomedecine & pharmacotherapie·2026
Same author

A multi-center study on the consistency of drug sensitivity testing in patients with acute myeloid leukemia.

NPJ precision oncology·2026
Same author

Multimodal immunopharmacologic screens identify drugs rewiring the cancer-immune interface.

bioRxiv : the preprint server for biology·2026
Same author

Preclinical models of hepatosplenic γδ T-cell lymphoma with an activating STAT5B mutation display sensitivity to JAK inhibitor upadacitinib.

HemaSphere·2026
Same author

Prognostic biomarker discovery in pancreatic cancer through hybrid ensemble feature selection and multi-omics data.

BioData mining·2026
Same journal

STED: flexible cross-modal topic modeling infers cell-type-specific regulatory landscapes from bulk epigenomics.

Briefings in bioinformatics·2026
Same journal

A knowledge-guided deep learning framework for quantitative nucleic acid testing.

Briefings in bioinformatics·2026
Same journal

Optimal transport for label transfer in single-cell multi-omics integration.

Briefings in bioinformatics·2026
Same journal

Continuous multi-omics pathway enrichment analysis resolves hidden functional heterogeneity.

Briefings in bioinformatics·2026
Same journal

Evaluating completeness, coherence, and consistency of genome-scale function annotations.

Briefings in bioinformatics·2026
Same journal

Transformers for single-cell RNA sequencing: a survey.

Briefings in bioinformatics·2026
See all related articles

Missing values in high-throughput data hinder analysis. This review details imputation strategies and evaluation methods, offering guidance for choosing tools and suggesting future research directions for missing data imputation.

Area of Science:

  • Biotechnology
  • Bioinformatics
  • Genomics
  • Proteomics

Background:

  • High-throughput biotechnologies like gene expression microarrays and proteomic assays frequently generate missing values due to experimental variability.
  • These missing data points can significantly impede downstream data analyses, necessitating robust handling strategies.
  • Missing value imputation has become a standard preprocessing step in large-scale biological data analysis.

Purpose of the Study:

  • To systematically review current missing value imputation strategies for high-throughput biological data.
  • To describe performance evaluation measures for imputation methods.
  • To provide practical guidance for selecting appropriate imputation tools and identify future research directions.

Main Methods:

Related Experiment Videos

Last Updated: Jun 18, 2026

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies
14:56

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies

Published on: May 6, 2022

  • Review of existing literature on missing value imputation techniques.
  • Categorization of imputation methods based on their principles and applications.
  • Discussion of performance metrics for evaluating imputation accuracy and effectiveness.

Main Results:

  • A comprehensive overview of imputation methods, initially focusing on gene expression microarray data, and extending to other large-scale datasets.
  • Description of various strategies for addressing missing data, including their strengths and weaknesses.
  • Identification of a gap in systematic evaluations of imputation methods across different data types and research questions.

Conclusions:

  • Effective imputation is crucial for reliable analysis of high-throughput biological data.
  • A systematic evaluation framework is needed to guide the selection of imputation methods.
  • Further research is required to develop and refine imputation methodologies for diverse biological datasets.