Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

DNA Microarrays02:34

DNA Microarrays

21.6K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
21.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dysregulated Intestinal Nutrient Absorption in Obesity Is Associated with Altered Chromatin Accessibility.

Organoids·2025
Same author

CDK12 Deficiency Induces BRD4S to Promote Cancer Metastasis by Enhancing Phase Separation and Chromatin Engagement.

Cancer research·2025
Same author

DNA2 protein destruction dictates DNA hyperexcision, cGAS-STING activation, and innate immune response in CDK12-deregulated cancers.

Proceedings of the National Academy of Sciences of the United States of America·2025
Same author

SNP rs9364554 Modulates Androgen Receptor Binding and Drug Response in Prostate Cancer.

Biomolecules·2025
Same author

Single-cell multiomics reveal divergent effects of DNMT3A- and TET2-mutant clonal hematopoiesis in inflammatory response.

Blood advances·2024
Same author

Identify Non-mutational p53 Functional Deficiency in Human Cancers.

Genomics, proteomics & bioinformatics·2024
Same journal

Region-aware bridge modeling enables interpretable mesoscale representation of spatial transcriptomic tissue sections.

Bioinformatics advances·2026
Same journal

Microbiome differential abundance methodologies to detect relevant taxa associated with chemotherapy toxicity rate in colorectal cancer.

Bioinformatics advances·2026
Same journal

maldipickr dereplicates microbial MALDI-TOF spectra to facilitate multiplexed isolation.

Bioinformatics advances·2026
Same journal

RAM-MSA: an anytime memory-bounded method for exact multiple sequence alignment using path finding.

Bioinformatics advances·2026
Same journal

Interpretable machine learning for low-sample multi-omics: a case study of ferret vaccine response.

Bioinformatics advances·2026
Same journal

DeepTaxa: a hybrid CNN-BERT framework for 16S rRNA taxonomic classification.

Bioinformatics advances·2026
See all related articles

Related Experiment Video

Updated: Mar 6, 2026

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies
14:56

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies

Published on: May 6, 2022

5.3K

Multi-output learning for systematic missing value imputation in DNA methylation arrays.

Tao Ma1, Jinfu Nie2, Jian Huang3

  • 1Division of Computational Biology, Department of Quantitative Health Sciences, Mayo Clinic College of Medicine & Science, Rochester, MN 55905, United States.

Bioinformatics Advances
|March 5, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a novel imputation framework to address missing DNA methylation data caused by Illumina array updates. The method effectively integrates diverse datasets and improves epigenetic age prediction models.

More Related Videos

Enhanced Reduced Representation Bisulfite Sequencing for Assessment of DNA Methylation at Base Pair Resolution
13:47

Enhanced Reduced Representation Bisulfite Sequencing for Assessment of DNA Methylation at Base Pair Resolution

Published on: February 24, 2015

26.5K
Comprehensive DNA Methylation Analysis Using a Methyl-CpG-binding Domain Capture-based Method in Chronic Lymphocytic Leukemia Patients
13:21

Comprehensive DNA Methylation Analysis Using a Methyl-CpG-binding Domain Capture-based Method in Chronic Lymphocytic Leukemia Patients

Published on: June 16, 2017

10.6K

Related Experiment Videos

Last Updated: Mar 6, 2026

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies
14:56

Sample Preparation to Bioinformatics Analysis of DNA Methylation: Association Strategy for Obesity and Related Trait Studies

Published on: May 6, 2022

5.3K
Enhanced Reduced Representation Bisulfite Sequencing for Assessment of DNA Methylation at Base Pair Resolution
13:47

Enhanced Reduced Representation Bisulfite Sequencing for Assessment of DNA Methylation at Base Pair Resolution

Published on: February 24, 2015

26.5K
Comprehensive DNA Methylation Analysis Using a Methyl-CpG-binding Domain Capture-based Method in Chronic Lymphocytic Leukemia Patients
13:21

Comprehensive DNA Methylation Analysis Using a Methyl-CpG-binding Domain Capture-based Method in Chronic Lymphocytic Leukemia Patients

Published on: June 16, 2017

10.6K

Area of Science:

  • Genomics
  • Epigenetics
  • Bioinformatics

Background:

  • Illumina DNA methylation arrays have undergone rapid evolution, leading to genomic coverage expansion but also backward incompatibilities.
  • The removal of CpG sites across array versions creates systematic missing values, hindering the integration and reuse of legacy datasets.

Purpose of the Study:

  • To develop and validate a robust imputation framework for addressing systematic missing DNA methylation values.
  • To enable seamless data integration across different Illumina array generations and other epigenomic data types.
  • To enhance the performance of downstream epigenetic analyses, such as epigenetic age prediction.

Main Methods:

  • A two-stage imputation framework was developed, initially addressing random missingness with standard techniques.
  • The second stage employs multi-output machine learning models (SVR, k-NN, Random Forest, DNN) to impute systematic missing values.
  • The framework was evaluated on real datasets with induced missingness and compared against conventional imputation methods.

Main Results:

  • The proposed framework consistently outperformed conventional imputation approaches in datasets with up to 50% missingness.
  • Accurate imputation of missing CpG sites was achieved between methylation arrays and reduced representation bisulfite sequencing data, facilitating cross-platform integration.
  • Analysis of brain tumor methylation datasets showed restoration of array-specific patterns and preservation of biological complexity.
  • Imputation of missing methylation sites significantly improved epigenetic age prediction model performance.

Conclusions:

  • The developed imputation framework effectively handles systematic missing DNA methylation data, enabling robust cross-platform and cross-array integration.
  • This approach preserves biological complexity and enhances the accuracy of epigenetic age prediction.
  • The 'ultra-impute' Python package provides a freely available tool for researchers to implement this imputation strategy.