Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Iterative class discovery and feature selection using Minimal Spanning Trees.

Sudhir Varma1, Richard Simon

  • 1Biometric Research Branch, National Cancer Institute, Rockville, USA. varmas@mail.nih.gov

BMC Bioinformatics
|September 10, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Evaluation of periodontal ligament stem cell viability and proliferation on native collagen membranes: An <i>in vitro</i> study.

Journal of Indian Society of Periodontology·2026
Same author

2D Ultrasound Elasticity Imaging of Abdominal Aortic Aneurysms Using Deep Neural Networks.

IEEE transactions on computational imaging·2026
Same author

Toward Patient-Specific Partial Point Cloud to Surface Completion for Pre to Intra-operative Registration in Image-Guided Liver Interventions.

Medical Image Understanding and Analysis. Medical Image Understanding and Analysis (Conference)·2026
Same author

Evaluation of Intra-operative Patient-specific Methods for Point Cloud Completion for Minimally Invasive Liver Interventions.

Proceedings of SPIE--the International Society for Optical Engineering·2026
Same author

Investigating the Domain Adaptability of General-Purpose Foundation Models for Left Atrium Segmentation from MR Images.

Functional imaging and modeling of the heart : ... International Workshop, FIMH ..., proceedings. FIMH (Conference)·2026
Same author

Assessing the Performance of the DINOv2 Self-supervised Learning Vision Transformer Model for the Segmentation of the Left Atrium from MRI Images.

Proceedings of SPIE--the International Society for Optical Engineering·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

This study introduces an iterative clustering algorithm that improves gene expression data analysis by focusing on relevant gene subsets. It effectively identifies hidden sample clusters obscured by noisy data, enhancing biological discovery.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Microarray gene expression data analysis commonly employs clustering methods.
  • Existing sample clustering approaches often use all genes, masking biologically relevant signals within subsets due to noise from irrelevant genes.

Purpose of the Study:

  • To develop an algorithm for automatically detecting sample clusters discernible in specific gene subsets.
  • To address the challenge of noise from irrelevant genes obscuring true biological signals in gene expression data.

Main Methods:

  • An iterative approach combining Minimal Spanning Tree-based clustering and feature selection.
  • Step-wise removal of noise genes to enhance cluster detection and sharpen clustering.

Main Results:

Related Experiment Videos

  • The algorithm accurately resolves planted clusters in synthetic data, even with significant noise.
  • Demonstrates a low probability of detecting spurious clusters.
  • Identified known biological classes and novel clusters in real-world microarray datasets.

Conclusions:

  • The iterative clustering method significantly improves upon clustering with all genes.
  • Enables discovery of partitions whose biological significance can be elucidated through clinical correlates and gene annotations.
  • MATLAB programs for the algorithm are publicly available.