Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Combined 5 x 2 cv F test for comparing supervised classification learning algorithms.

E Alpaydin1

  • 1IDIAP, CP 592 CH-1920 Martigny, Switzerland, and Department of Computer Engineering, Boğaziçi University, TR-80815 Istanbul, Turkey.

Neural Computation
|December 1, 1999
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Omnivariate decision trees.

IEEE transactions on neural networks·2008
Same author

Constructive feedforward ART clustering networks. I.

IEEE transactions on neural networks·2008
Same author

Constructive feedforward ART clustering networks. II.

IEEE transactions on neural networks·2008
Same author

Local linear perceptrons for classification.

IEEE transactions on neural networks·1996
Same journal

A Model-Free Reinforcement Learning Implementation of Decision Making Under Uncertainty by Sequential Sampling.

Neural computation·2026
Same journal

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026
Same journal

Hierarchical Active Inference Using Successor Representations.

Neural computation·2026
Same journal

W-Kernel and Its Principal Space for Frequentist Evaluation of Bayesian Estimators.

Neural computation·2026
Same journal

A Hidden Markov Model-Inspired Sequence Classification Method for Hyperdimensional Computing.

Neural computation·2026
Same journal

Sparse Graphical Modeling for Electrophysiological Phase-Based Connectivity Using Circular Statistics.

Neural computation·2026
See all related articles

The 5 x 2 cv t test for comparing classifier error rates can be unreliable. A new combined 5 x 2 cv F test offers a more robust alternative with improved accuracy and statistical power.

Area of Science:

  • Machine Learning
  • Statistical Modeling
  • Computational Statistics

Background:

  • Classifier performance evaluation is crucial in machine learning.
  • Existing statistical tests like the 5 x 2 cv t test have limitations in reliability.
  • Variability in test results can impact accurate classifier comparison.

Purpose of the Study:

  • To address the variability issues in the 5 x 2 cv t test.
  • To propose a more robust statistical test for comparing classifier error rates.
  • To enhance the reliability of statistical comparisons between machine learning classifiers.

Main Methods:

  • Review of existing statistical tests for classifier comparison.
  • Development of a variant test: the combined 5 x 2 cv F test.

Related Experiment Videos

  • Monte Carlo simulations to evaluate test performance.
  • Main Results:

    • The standard 5 x 2 cv t test demonstrated sensitivity to irrelevant factors.
    • The proposed combined 5 x 2 cv F test showed reduced Type I error rates.
    • The combined 5 x 2 cv F test exhibited increased statistical power compared to the standard test.

    Conclusions:

    • The combined 5 x 2 cv F test provides a more stable and reliable method for comparing classifier error rates.
    • This new test improves upon the limitations of the original 5 x 2 cv t test.
    • The findings contribute to more dependable performance evaluation in machine learning.