Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Receiver Operating Characteristic Plot

Receiver Operating Characteristic Plot

A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

Accuracy and Errors in Hypothesis Testing

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...

Bonferroni Test

Bonferroni Test

The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...

Bioequivalence Experimental Study Designs: Repeated Measures, Cross-Over, Carry-Over, and Latin Square Designs

Bioequivalence Experimental Study Designs: Repeated Measures, Cross-Over, Carry-Over, and Latin Square Designs

Bioequivalence experimental study designs play a pivotal role in testing the effectiveness of various treatments. Key among these are the repeated measures, cross-over, carry-over, and Latin square designs. In the repeated measures design, each subject receives all treatments, allowing for temporal comparisons. This type of design is useful in reducing variability but requires careful planning to avoid bias.The cross-over design, an economical method, involves sequential administration of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Integrative learning of individualized treatment rules from multiple studies with partially overlapping treatments.

Biometrics·2026

Same author

SEMIPARAMETRIC ANALYSIS OF INTERVAL-CENSORED DATA SUBJECT TO INACCURATE DIAGNOSES WITH A TERMINAL EVENT.

The annals of applied statistics·2026

Same author

DYNAMIC CLASSIFICATION OF LATENT DISEASE PROGRESSION WITH AUXILIARY SURROGATE LABELS.

The annals of applied statistics·2026

Same author

Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection.

Journal of machine learning research : JMLR·2026

Same author

Data fusion methods for the heterogeneity of treatment effect and confounding function.

Bernoulli : official journal of the Bernoulli Society for Mathematical Statistics and Probability·2026

Same author

Leveraging precision medicine analytics to optimize inflammation reduction and enhance physical function in older adults.

The journals of gerontology. Series A, Biological sciences and medical sciences·2026

Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026

Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026

Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026

Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026

Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026

Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 21, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Power calculation for comparing diagnostic accuracies in a multi-reader, multi-test design.

Eunhee Kim¹, Zheng Zhang, Youdan Wang

¹Department of Biostatistics and Center for Statistical Sciences, Brown University, Providence, Rhode Island, U.S.A.

|October 31, 2014

Summary

This summary is machine-generated.

This study introduces a new power formula for comparing correlated areas under the ROC curve (AUC) in multi-reader, multi-test diagnostic accuracy studies. The method enhances sample size and power calculations for these complex research designs.

Keywords:

Multi-reader Multi-test design Power Receiver operating characteristic curve Sample size U-statistics

More Related Videos

Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes

Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes

Published on: March 22, 2022

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

Related Experiment Videos

Last Updated: Apr 21, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes

Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes

Published on: March 22, 2022

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

Area of Science:

Biostatistics
Medical Diagnostics
Statistical Modeling

Background:

Receiver operating characteristic (ROC) analysis is crucial for evaluating diagnostic test performance.
The multi-reader, multi-test design is common for assessing diagnostic accuracy but lacks robust sample size and power methodologies.
Existing analytical approaches for this design have limitations regarding power and sample size considerations.

Purpose of the Study:

To develop a power formula for comparing correlated areas under the ROC curve (AUC) within a multi-reader, multi-test framework.
To provide a method for accurate sample size and power estimations in diagnostic accuracy research.
To extend existing nonparametric approaches for analyzing correlated AUCs.

Main Methods:

Developed a power formula based on the asymptotic distribution of nonparametric AUCs.
Extended DeLong et al.'s approach for estimating and comparing correlated AUCs.
Utilized simulation studies to validate the proposed power formula's performance.

Main Results:

The proposed power formula accurately estimates sample size and power for multi-reader, multi-test designs.
The nonparametric approach effectively compares correlated AUCs.
Simulation results demonstrate the reliability of the developed power formula.

Conclusions:

The new power formula provides a valuable tool for researchers conducting diagnostic accuracy studies using the multi-reader, multi-test design.
This methodology improves the planning and statistical rigor of studies evaluating diagnostic tests.
The findings facilitate more efficient and reliable assessment of diagnostic test performance.