Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Superior feature-set ranking for small samples using bolstered error estimation.

Chao Sima1, Ulisses Braga-Neto, Edward R Dougherty

  • 1Department of Electrical Engineering, Texas A&M University College Station, TX, USA.

Bioinformatics (Oxford, England)
|October 30, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Investigating the molecular mechanisms of resveratrol in treating diabetic foot ulcers: a comprehensive analysis of network pharmacology and experiment validation.

Frontiers in molecular biosciences·2025
Same author

AuNRs-PPARγmAb Induce Targeted Adipocyte Apoptosis Through Photothermal Effects for Effective Localized Fat Reduction.

International journal of nanomedicine·2025
Same author

Correction: facilitation of diabetic wound healing by far upstream element binding protein 1 through augmentation of dermal fibroblast activity.

Acta diabetologica·2025
Same author

Facilitation of diabetic wound healing by far upstream element binding protein 1 through augmentation of dermal fibroblast activity.

Acta diabetologica·2024
Same author

Pathway-based analyses of gene expression profiles at low doses of ionizing radiation.

Frontiers in bioinformatics·2024
Same author

Optimal decision-making in high-throughput virtual screening pipelines.

Patterns (New York, N.Y.)·2023
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Bolstered error estimation effectively ranks feature sets for classification with small samples, outperforming bootstrap and cross-validation. This computationally feasible method is ideal for large-scale feature selection in gene expression studies.

Area of Science:

  • Computational biology
  • Machine learning
  • Statistical learning

Background:

  • Feature set ranking is crucial for classification tasks, particularly in gene expression-based phenotype classification.
  • Error estimators used for ranking can be imprecise with small sample sizes, necessitating computationally feasible options.
  • Selecting an appropriate error estimator is vital for reliable feature-set ranking.

Purpose of the Study:

  • To evaluate the feature-ranking performance of various error estimators.
  • To compare these estimators across different classification rules and sample sizes.
  • To identify the most effective and computationally feasible error estimator for small-sample settings.

Main Methods:

  • Examined resubstitution, cross-validation, bootstrap, and bolstered error estimation.

Related Experiment Videos

  • Assessed performance using linear discriminant analysis, three-nearest-neighbor classification, and classification trees.
  • Utilized two performance measures: count of truly best feature sets and mean absolute error in ranks.
  • Main Results:

    • Bolstered error estimation demonstrated superior performance over bootstrap and cross-validation in identifying top feature sets for small samples.
    • Bootstrap outperformed cross-validation.
    • Bolstered error estimation is significantly faster than bootstrap and cross-validation, making it suitable for large feature sets.

    Conclusions:

    • Bolstered error estimation is the most effective and computationally efficient method for feature-set ranking in small-sample classification.
    • This approach is particularly advantageous when dealing with a very large number of feature sets.
    • The findings support the use of bolstered error estimation for robust feature selection in bioinformatics and machine learning.