Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Receiver Operating Characteristic Plot01:15

Receiver Operating Characteristic Plot

561
A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...
561
Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

1.8K
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
1.8K
Accuracy and Errors in Hypothesis Testing01:13

Accuracy and Errors in Hypothesis Testing

708
Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...
708
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

702
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
702
Bonferroni Test01:10

Bonferroni Test

2.5K
The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...
2.5K
Bioequivalence Experimental Study Designs: Repeated Measures, Cross-Over, Carry-Over, and Latin Square Designs01:15

Bioequivalence Experimental Study Designs: Repeated Measures, Cross-Over, Carry-Over, and Latin Square Designs

412
Bioequivalence experimental study designs play a pivotal role in testing the effectiveness of various treatments. Key among these are the repeated measures, cross-over, carry-over, and Latin square designs. In the repeated measures design, each subject receives all treatments, allowing for temporal comparisons. This type of design is useful in reducing variability but requires careful planning to avoid bias.The cross-over design, an economical method, involves sequential administration of...
412

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrative learning of individualized treatment rules from multiple studies with partially overlapping treatments.

Biometrics·2026
Same author

SEMIPARAMETRIC ANALYSIS OF INTERVAL-CENSORED DATA SUBJECT TO INACCURATE DIAGNOSES WITH A TERMINAL EVENT.

The annals of applied statistics·2026
Same author

DYNAMIC CLASSIFICATION OF LATENT DISEASE PROGRESSION WITH AUXILIARY SURROGATE LABELS.

The annals of applied statistics·2026
Same author

Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection.

Journal of machine learning research : JMLR·2026
Same author

Data fusion methods for the heterogeneity of treatment effect and confounding function.

Bernoulli : official journal of the Bernoulli Society for Mathematical Statistics and Probability·2026
Same author

Leveraging precision medicine analytics to optimize inflammation reduction and enhance physical function in older adults.

The journals of gerontology. Series A, Biological sciences and medical sciences·2026
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: Apr 21, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

1.3K

Power calculation for comparing diagnostic accuracies in a multi-reader, multi-test design.

Eunhee Kim1, Zheng Zhang, Youdan Wang

  • 1Department of Biostatistics and Center for Statistical Sciences, Brown University, Providence, Rhode Island, U.S.A.

Biometrics
|October 31, 2014
PubMed
Summary
This summary is machine-generated.

This study introduces a new power formula for comparing correlated areas under the ROC curve (AUC) in multi-reader, multi-test diagnostic accuracy studies. The method enhances sample size and power calculations for these complex research designs.

Keywords:
Multi-readerMulti-test designPowerReceiver operating characteristic curveSample sizeU-statistics

More Related Videos

Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes
05:58

Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes

Published on: March 22, 2022

3.8K
Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease
06:16

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

1.2K

Related Experiment Videos

Last Updated: Apr 21, 2026

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education
09:00

Author Spotlight: Validation of SICOLE-R for Assessing Cognitive and Reading Skills in Spanish-Speaking Children and Its Role in Personalized Education

Published on: August 16, 2024

1.3K
Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes
05:58

Evaluation of a Point-of-Care Testing Analyzer for Measuring Peripheral Blood Leukocytes

Published on: March 22, 2022

3.8K
Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease
06:16

Signal Acquisition, Score Interpretation, and Economics of a Non-Invasive Point-of-Care Test for Coronary Artery Disease

Published on: August 9, 2024

1.2K

Area of Science:

  • Biostatistics
  • Medical Diagnostics
  • Statistical Modeling

Background:

  • Receiver operating characteristic (ROC) analysis is crucial for evaluating diagnostic test performance.
  • The multi-reader, multi-test design is common for assessing diagnostic accuracy but lacks robust sample size and power methodologies.
  • Existing analytical approaches for this design have limitations regarding power and sample size considerations.

Purpose of the Study:

  • To develop a power formula for comparing correlated areas under the ROC curve (AUC) within a multi-reader, multi-test framework.
  • To provide a method for accurate sample size and power estimations in diagnostic accuracy research.
  • To extend existing nonparametric approaches for analyzing correlated AUCs.

Main Methods:

  • Developed a power formula based on the asymptotic distribution of nonparametric AUCs.
  • Extended DeLong et al.'s approach for estimating and comparing correlated AUCs.
  • Utilized simulation studies to validate the proposed power formula's performance.

Main Results:

  • The proposed power formula accurately estimates sample size and power for multi-reader, multi-test designs.
  • The nonparametric approach effectively compares correlated AUCs.
  • Simulation results demonstrate the reliability of the developed power formula.

Conclusions:

  • The new power formula provides a valuable tool for researchers conducting diagnostic accuracy studies using the multi-reader, multi-test design.
  • This methodology improves the planning and statistical rigor of studies evaluating diagnostic tests.
  • The findings facilitate more efficient and reliable assessment of diagnostic test performance.