Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Simultaneous gene clustering and subset selection for sample classification via MDL.

Rebecka Jörnsten1, Bin Yu

  • 1Department of Statistics, Rutgers University, 501 Hill Center, Piscataway, NJ 08854, USA. rebecka@stat.rutgers.edu

Bioinformatics (Oxford, England)
|June 13, 2003
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Exosomes secreted from GATA-4 overexpressing mesenchymal stem cells serve as a reservoir of anti-apoptotic microRNAs for cardioprotection.

International journal of cardiology·2015
Same author

[Etiology and pathogens of fungal endophthalmitis].

[Zhonghua yan ke za zhi] Chinese journal of ophthalmology·2015
Same author

Eucommia ulmoides Oliv. bark aqueous extract inhibits osteoarthritis in a rat model of osteoarthritis.

Journal of ethnopharmacology·2015
Same author

Aucubin prevents interleukin-1 beta induced inflammation and cartilage matrix degradation via inhibition of NF-κB signaling pathway in rat articular chondrocytes.

International immunopharmacology·2015
Same author

Treatment with recombinant lubricin attenuates osteoarthritis by positive feedback loop between articular cartilage and subchondral bone in ovariectomized rats.

Bone·2015
Same author

Tet1-mediated DNA demethylation regulates neuronal cell death induced by oxidative stress.

Scientific reports·2015
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

This study introduces a novel algorithm for simultaneous gene clustering and sample classification, improving data interpretability and classification accuracy. The method provides sparse, stable, and interpretable classification rules for gene expression data.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Microarray technology enables simultaneous monitoring of thousands of genes.
  • Gene expression data analysis often separates gene clustering and sample classification.
  • This separation can obscure informative data structures.

Purpose of the Study:

  • To develop an algorithm for simultaneous gene clustering and subset selection for sample classification.
  • To introduce a novel model selection criterion based on the minimum description length (MDL) principle.
  • To generate sparse and interpretable classification rules.

Main Methods:

  • Developed a new MDL-based model selection criterion for both genes and sample class labels.
  • Implemented an algorithm for simultaneous gene clustering and subset selection.

Related Experiment Videos

  • Applied the algorithm to publicly available gene expression datasets.
  • Main Results:

    • Achieved sparse and interpretable classification models based on cluster centroids.
    • Obtained competitive test error rates compared to existing methods.
    • Demonstrated stability and consistency of cluster centroids across cross-validation samples.

    Conclusions:

    • The simultaneous approach effectively integrates gene clustering and sample classification.
    • The algorithm yields robust, interpretable, and accurate classification rules.
    • The method enhances the discovery of class-informative structures within gene expression data.