Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Ensemble machine learning on gene expression data for cancer classification.

Aik Choon Tan1, David Gilbert

  • 1Bioinformatics Research Centre, Department of Computing Science, University of Glasgow, Glasgow, UK. actan@brc.dcs.gla.ac.uk

Applied Bioinformatics
|May 8, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

irAE.AI: AI-powered exploration of real-world immune-related adverse events.

JAMIA open·2026
Same author

Targeting of MEK and Autophagy in Pancreatic Adenocarcinoma and Analysis of Treatment Sensitivity in Preclinical and Clinical Models: MEKiAUTO.

JCO precision oncology·2026
Same author

Clinico-genomic nomograms for estimation of survival risk in metastatic castrate-resistant prostate cancer.

JNCI cancer spectrum·2026
Same author

Associations Between Two-Year Immune-Related Adverse Events and Psychological Distress and Subsequent Survival Outcomes Among ICI-Treated Survivors Diagnosed With Advanced Cancers in the US.

Psycho-oncology·2026
Same author

Breast Cancer in Transgender Patients Following Medical and Surgical Intervention.

Clinical breast cancer·2026
Same author

Impact of baseline medications on real-world overall survival in immune checkpoint inhibitor-treated patients with cancer in the RADIOHEAD cohort.

Med (New York, N.Y.)·2026
Same journal

Statistically consistent identification of differentially expressed genes in DNA chip data over the whole expression range: relative variance method.

Applied bioinformatics·2006
Same journal

A nonparametric likelihood ratio test to identify differentially expressed genes from microarray data.

Applied bioinformatics·2006
Same journal

Simulation study of ratio calculation formulae of two-colour cDNA microarray data.

Applied bioinformatics·2006
Same journal

Alternative mRNA polyadenylation can potentially affect detection of gene expression by affymetrix genechip arrays.

Applied bioinformatics·2006
Same journal

Comparisons of annotation predictions for affymetrix GeneChips.

Applied bioinformatics·2006
Same journal

Ontology annotation treebrowser : an interactive tool where the complementarity of medical subject headings and gene ontology improves the interpretation of gene lists.

Applied bioinformatics·2006
See all related articles

Ensemble machine learning methods, like bagged and boosted decision trees, show superior performance in classifying cancer gene expression profiles compared to single decision trees. This aids in identifying crucial cancer-related genes from microarray data.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Whole genome RNA expression studies, particularly microarray analysis, are vital for understanding cellular processes and disease states.
  • Identifying differentially expressed genes between normal and cancerous cells is a key challenge in cancer research.
  • Machine learning has shown promise in classifying complex biological data.

Purpose of the Study:

  • To compare the efficacy of three supervised machine learning techniques for cancer classification using microarray data.
  • To evaluate the performance of C4.5 decision trees, bagged decision trees, and boosted decision trees in distinguishing cancer gene expression profiles.

Main Methods:

  • Utilized seven publicly available cancerous microarray datasets.

Related Experiment Videos

  • Applied C4.5 decision tree, bagged decision trees, and boosted decision trees for classification tasks.
  • Compared the classification and prediction accuracy of the employed machine learning methods.
  • Main Results:

    • Ensemble learning methods (bagged and boosted decision trees) generally outperformed single decision trees (C4.5).
    • Bagged and boosted decision trees demonstrated robust performance in classifying cancerous gene expression profiles.
    • The study identified specific patterns in gene expression that differentiate tumor cells from normal cells.

    Conclusions:

    • Ensemble machine learning approaches are effective for cancer classification based on gene expression data.
    • Bagged and boosted decision trees offer improved accuracy for identifying cancer-specific gene signatures.
    • These findings support the application of ensemble methods in cancer diagnosis and therapeutic development.