Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Using the information embedded in the testing sample to break the limits caused by the small sample size in

Manli Zhu1, Aleix M Martinez

  • 1Department of Electrical and Computer Engineering, The Ohio State University, Columbus, OH 43210, USA. zhum.osu@gmail.com

BMC Bioinformatics
|June 17, 2008
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Current Status and Future Prospects of Research on Sepsis-Related Acute Kidney Injury.

International journal of molecular sciences·2026
Same author

Statistical modeling of immunoprecipitation efficiency of MeRIP-seq data enabled accurate detection and quantification of epitranscriptome.

Computational and structural biotechnology journal·2025
Same author

Evaluation of the nutritional, <i>in vitro</i> protein digestive and bioactive characteristics of a quinoa-based protein beverage.

Food chemistry: X·2025
Same author

Ultrahigh Exchange Bias Field/Coercive Field Ratio in In Situ Formed Two-Dimensional Magnetic Te-Cr<sub>2</sub>O<sub>3</sub>/Cr<sub>5</sub>Te<sub>6</sub> Heterostructures.

Advanced materials (Deerfield Beach, Fla.)·2025
Same author

Association of Circulating Homocysteine Level with the Risk of Nephropathy in Type 2 Diabetes Mellitus: A Meta-Analysis.

Hormone and metabolic research = Hormon- und Stoffwechselforschung = Hormones et metabolisme·2024
Same author

Predictive models of sepsis-associated acute kidney injury based on machine learning: a scoping review.

Renal failure·2024
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

This study introduces a novel tumor classification strategy that avoids pre-selecting genes. By leveraging information from testing samples, it improves classification accuracy and reduces overfitting in microarray data analysis.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Microarray analysis presents challenges in tumor classification due to a high gene-to-sample ratio.
  • Traditional statistical methods struggle to identify gene correlations with tumor types in small sample sets.
  • Pre-selecting gene subsets is a common but potentially limiting approach.

Purpose of the Study:

  • To develop a new classification strategy for microarray data that mitigates overfitting.
  • To eliminate the need for pre-selecting specific gene subsets.
  • To enhance tumor classification accuracy by utilizing testing sample information.

Main Methods:

  • A novel classification strategy is proposed that leverages information within testing samples.
  • The algorithm assumes the correct class for testing samples to optimize discrimination.

Related Experiment Videos

  • The approach is evaluated against established tumor classification methods using public datasets.
  • Main Results:

    • The proposed strategy effectively reduces the impact of overfitting in tumor classification.
    • It consistently achieves superior classification results compared to existing alternatives.
    • The method demonstrates robust performance across various publicly available datasets.

    Conclusions:

    • The challenge of limited samples in microarray analysis can be overcome.
    • Utilizing information embedded in testing samples provides a viable alternative to large sample requirements.
    • This approach offers a more efficient way to extract meaningful statistical information from microarray data.