Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

A comparison of cluster analysis methods using DNA methylation data.

Kimberly D Siegmund1, Peter W Laird, Ite A Laird-Offringa

  • 1Department of Preventive Medicine, Norris Cancer Center, Keck School of Medicine, University of Southern California, Los Angeles 90089, USA. kims@usc.edu

Bioinformatics (Oxford, England)
|March 27, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multidimensional exposure architecture shapes vaping-associated transcriptomic dysregulation in oral epithelium.

Frontiers in oncology·2026
Same author

The association of epigenetic age acceleration with internal smoking dose, risk of lung cancer, and all-cause mortality in cigarette smokers: the Multiethnic Cohort study.

Clinical epigenetics·2026
Same author

Interaction of genetic and lifestyle risk scores on colorectal cancer risk across five racial and ethnic populations.

Journal of the National Cancer Institute·2026
Same author

The effect of cigarette exposure on placental epigenetics: A systematic review.

Reproductive toxicology (Elmsford, N.Y.)·2026
Same author

Systematic review and meta-analysis corrected for history of smoking tobacco identifies type 1 diabetes as a possible risk factor for bladder cancer.

Diabetes research and clinical practice·2025
Same author

Genome-wide profiling of unmodified DNA using methyltransferase-directed tagging and enrichment.

Cell reports methods·2025
Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026
Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026
Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026
Same journal

Informative Relational Learning for Adverse Reaction Prediction with Enhanced Generalization to Novel Drugs.

Bioinformatics (Oxford, England)·2026
Same journal

An interpretable deep learning framework uncovers features governing CRISPR-Cas9 genome-editing efficiency.

Bioinformatics (Oxford, England)·2026
Same journal

3DICE: Interpretable 3D Cross-Modal Learning for Drug-Target Interaction Prediction and Large-Scale Drug Discovery.

Bioinformatics (Oxford, England)·2026
See all related articles

A new Bernoulli-lognormal mixture model accurately clusters DNA methylation data, outperforming existing methods for cancer subtype identification. This approach offers improved reliability for analyzing complex biological datasets.

Area of Science:

  • Genomics
  • Bioinformatics
  • Cancer Research

Background:

  • Aberrant DNA methylation is a hallmark of cancer, with distinct profiles aiding in tumor classification.
  • Quantitative DNA methylation data from MethyLight technology is often zero-inflated and non-normally distributed.
  • Existing cluster analysis tools are inadequate for this type of data.

Purpose of the Study:

  • To evaluate and identify reliable cluster analysis methods for quantitative DNA methylation data.
  • To introduce a novel statistical model for analyzing MethyLight data.

Main Methods:

  • Development and evaluation of a Bernoulli-lognormal mixture model.
  • Comparison with standard cluster analysis methods for continuous and discrete data.
  • Application to DNA methylation data from lung cancer cell lines.

Related Experiment Videos

Main Results:

  • The Bernoulli-lognormal mixture model demonstrated the lowest classification error in simulation studies.
  • The proposed model achieved lower cross-validation error than hierarchical clustering for lung cancer subtype detection.
  • The mixture model provides more certain subgroup assignments for DNA methylation data.

Conclusions:

  • The Bernoulli-lognormal mixture model is a reliable and accurate method for clustering DNA methylation data.
  • This approach enhances the diagnostic potential of DNA methylation profiling in cancer research.
  • The developed model offers improved uncertainty quantification for subgroup assignments.