Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

Superior feature-set ranking for small samples using bolstered error estimation.

Chao Sima¹, Ulisses Braga-Neto, Edward R Dougherty

¹Department of Electrical Engineering, Texas A&M University College Station, TX, USA.

Bioinformatics (Oxford, England)

|October 30, 2004

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Investigating the molecular mechanisms of resveratrol in treating diabetic foot ulcers: a comprehensive analysis of network pharmacology and experiment validation.

Frontiers in molecular biosciences·2025

Same author

AuNRs-PPARγmAb Induce Targeted Adipocyte Apoptosis Through Photothermal Effects for Effective Localized Fat Reduction.

International journal of nanomedicine·2025

Same author

Correction: facilitation of diabetic wound healing by far upstream element binding protein 1 through augmentation of dermal fibroblast activity.

Acta diabetologica·2025

Same author

Facilitation of diabetic wound healing by far upstream element binding protein 1 through augmentation of dermal fibroblast activity.

Acta diabetologica·2024

Same author

Pathway-based analyses of gene expression profiles at low doses of ionizing radiation.

Frontiers in bioinformatics·2024

Same author

Optimal decision-making in high-throughput virtual screening pipelines.

Patterns (New York, N.Y.)·2023

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Bolstered error estimation effectively ranks feature sets for classification with small samples, outperforming bootstrap and cross-validation. This computationally feasible method is ideal for large-scale feature selection in gene expression studies.

Area of Science:

Computational biology
Machine learning
Statistical learning

Background:

Feature set ranking is crucial for classification tasks, particularly in gene expression-based phenotype classification.
Error estimators used for ranking can be imprecise with small sample sizes, necessitating computationally feasible options.
Selecting an appropriate error estimator is vital for reliable feature-set ranking.

Purpose of the Study:

To evaluate the feature-ranking performance of various error estimators.
To compare these estimators across different classification rules and sample sizes.
To identify the most effective and computationally feasible error estimator for small-sample settings.

Main Methods:

Examined resubstitution, cross-validation, bootstrap, and bolstered error estimation.

Related Experiment Videos

Assessed performance using linear discriminant analysis, three-nearest-neighbor classification, and classification trees.

Utilized two performance measures: count of truly best feature sets and mean absolute error in ranks.

Main Results:

Bolstered error estimation demonstrated superior performance over bootstrap and cross-validation in identifying top feature sets for small samples.
Bootstrap outperformed cross-validation.
Bolstered error estimation is significantly faster than bootstrap and cross-validation, making it suitable for large feature sets.

Conclusions:

Bolstered error estimation is the most effective and computationally efficient method for feature-set ranking in small-sample classification.
This approach is particularly advantageous when dealing with a very large number of feature sets.
The findings support the use of bolstered error estimation for robust feature selection in bioinformatics and machine learning.