Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

On the statistical assessment of classifiers using DNA microarray data.

N Ancona1, R Maglietta, A Piepoli

  • 1lstituto di Studi sui Sistemi Intelligenti per I'Automazione-CNR, Via Amendola 122/D-l, 70126 Bari, Italy. ancona@ba.issia.cnr.it

BMC Bioinformatics
|August 22, 2006
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Half-Life and Precision Shape Measurement of the 2νββ Decay of ^{130}Te.

Physical review letters·2025
Same author

Search for Fractionally Charged Particles with CUORE.

Physical review letters·2025
Same author

Optimal Operation of Cryogenic Calorimeters Through Deep Reinforcement Learning.

Computing and software for big science·2024
Same author

Human neural stem cells derived from fetal human brain communicate with each other and rescue ischemic neuronal cells through tunneling nanotubes.

Cell death & disease·2024
Same author

Aggressive systemic mastocytosis with the co-occurrence of PRKG2::PDGFRB, KAT6A::NCOA2, and RXRA::NOTCH1 fusion transcripts and a heterozygous RUNX1 frameshift mutation.

Cancer genetics·2024
Same author

A National audit of the care of patients with acute kidney injury in England and Wales in 2019 and the association with patient outcomes.

Clinical medicine (London, England)·2024
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Accurate colon cancer classification is achievable with minimal data. Statistical methods assess gene expression profiles, identifying key genes for reliable diagnosis and prognosis.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Presents a novel statistical method for assessing cancer predictors using gene expression profiles.
  • Utilizes a new dataset of microarray gene expression data from colon cancer patients (22 normal, 25 tumor specimens).

Purpose of the Study:

  • To determine if available data is sufficient for accurate cancer classifiers.
  • To assess the statistical significance of error rates in classification.
  • To identify the number of genes crucial for accurate colon cancer classification.

Main Methods:

  • Employs Leave-K-Out Cross Validation to estimate generalization error for three classification schemes.
  • Uses permutation tests to measure the statistical significance of error rates.

Related Experiment Videos

  • Analyzes gene frequencies and their correlation with colon cancer pathology.
  • Main Results:

    • Weighted Voting Algorithm (WVA) achieved 21% error with 25 training examples (p=0.045).
    • Regularized Least Squares (RLS) and Support Vector Machines (SVM) achieved 19% (p=0.035) and 18% (p=0.037) error with 15 training examples, respectively.
    • RLS and SVM reached optimal performance with 35 training examples (14% and 11% error, respectively) and identified ~6000 significant genes (p<0.05).

    Conclusions:

    • The proposed method offers statistically significant insights for cancer diagnosis and prognosis.
    • Effective colon cancer classifiers can be trained with as few as 15 examples.
    • The number of genes required for reliable classification is dependent on the desired accuracy.