Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Receiver Operating Characteristic Plot

Receiver Operating Characteristic Plot

A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...

Introduction to Nonparametric Statistics

Introduction to Nonparametric Statistics

Nonparametric statistics offer a powerful alternative to traditional parametric methods, useful when assumptions about the population distribution cannot be made. Unlike parametric tests, which require data to follow a specific distribution with well-defined parameters (such as the mean and standard deviation), nonparametric tests do not require such constraints. This makes them particularly valuable when dealing with small sample sizes, skewed data, or ordinal and categorical variables.
One of...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Tension pneumoperitoneum combined with CO<sub>2</sub> gas embolism during peroral endoscopic myotomy: a case report and review of literature.

Frontiers in medicine·2026

Same author

Maintenance treatment and survival in patients with newly diagnosed diffuse large B cell lymphoma in the immunochemotherapy era: a systematic review and network meta-analysis.

Clinical & translational oncology : official publication of the Federation of Spanish Oncology Societies and of the National Cancer Institute of Mexico·2026

Same author

Hyperpolarized Molecular Nuclear Spins Achieve Magnetic Amplification.

Physical review letters·2026

Same author

A deep learning PET/CT biomarker for early progression (POD24) and survival stratification in follicular lymphoma: a multicenter study.

European journal of nuclear medicine and molecular imaging·2026

Same author

Zero- to ultralow-field J-spectroscopy with a diamond magnetometer.

Communications chemistry·2026

Same author

Association between body mass index and outcomes in lymphoma-associated haemophagocytic lymphohistiocytosis: A retrospective multicentre cohort study of Jiangsu Cooperative Lymphoma Group (JCLG).

British journal of haematology·2026

Same journal

CrunchLLM: Multitask LLMs for Structured Business Reasoning and Outcome Prediction.

Neurocomputing·2026

Same journal

Deep Learning for analyzing chaotic dynamics in biological time series: Insights from frog heart signals.

Neurocomputing·2026

Same journal

SymRefine: A symbolic regression approach for refining and compressing neural networks.

Neurocomputing·2026

Same journal

Artificial intelligence without restriction surpassing human intelligence with probability one: Theoretical insight into secrets of the brain with AI twins of the brain.

Neurocomputing·2025

Same journal

ShaderNN: A Lightweight and Efficient Inference Engine for Real-time Applications on Mobile GPUs.

Neurocomputing·2025

Same journal

Improving Adversarial Robustness of Deep Neural Networks via Adaptive Margin Evolution.

Neurocomputing·2023

See all related articles

Search research articles

Home
Comparing Multi-class Classifier Performance By Multi-class Roc Analysis: A Nonparametric Approach.

Home
Comparing Multi-class Classifier Performance By Multi-class Roc Analysis: A Nonparametric Approach.

Related Experiment Video

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Published on: August 30, 2013

Comparing multi-class classifier performance by multi-class ROC analysis: A nonparametric approach.

¹Department of Radiology, Johns Hopkins University, MD, USA.

|April 22, 2024

View abstract on PubMed

Summary

This summary is machine-generated.

This study introduces a new method to estimate the variance of multi-class area under the ROC curve (MAUC) for machine learning classifiers. The approach accurately quantifies classifier performance and aids in comparing multiple models.

Keywords:

Ustatistics area under the ROC curve (AUC)jackknife multi-class AUC multi-class classification receiver operating characteristic (ROC)

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Related Experiment Videos

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Detection of Architectural Distortion in Prior Mammograms via Analysis of Oriented Patterns

Published on: August 30, 2013

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Area of Science:

Machine Learning and Statistical Analysis
Computational Statistics
Pattern Recognition

Background:

The area under the ROC curve (AUC) is a standard metric for binary classification performance.
Real-world applications frequently involve multi-class classification, necessitating metrics beyond binary AUC.
Existing multi-class AUC (MAUC) variance estimation often relies on computationally intensive resampling techniques due to complex correlation patterns.

Purpose of the Study:

To generalize DeLong's non-parametric approach for binary AUC variance estimation to multi-class AUC (MAUC).
To develop an accurate and efficient method for estimating the variance of MAUC and the covariance of correlated MAUCs.
To provide a computationally tractable solution for comparing multi-class classifiers.

Main Methods:

Derived a closed-form expression for the covariance matrix of pairwise AUCs within a single MAUC.
Obtained an approximate covariance matrix with a compact, matrix factorization form by dropping higher-order terms.
Extended the approach to estimate the covariance of correlated MAUCs from competing multi-class classifiers.

Main Results:

The proposed method provides accurate variance and covariance estimates for MAUC, confirmed by numerical studies.
The derived covariance matrix offers a computationally efficient basis for MAUC variance estimation.
For binary correlated AUCs, the results align with DeLong's established method, validating the generalization.

Conclusions:

The developed method offers a statistically sound and computationally efficient alternative to resampling for MAUC variance estimation.
This work facilitates more reliable quantification and comparison of multi-class classifiers in machine learning and statistical analysis.
Source code is available on GitHub for broad adoption and implementation.