Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

Receiver Operating Characteristic Plot

Receiver Operating Characteristic Plot

A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...

Contingency Table

Contingency Table

A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...

McNemar's Test

McNemar's Test

McNemar's Test is a nonparametric statistical test used to determine if there is a significant difference in proportions between two related groups when the outcome is binary (e.g., yes/no, success/failure). It is beneficial when we have paired data, such as pre-test/post-test designs, where the same subjects are measured under two different conditions. The test is named after the statistician Quinn McNemar, who introduced it in 1947. It is commonly used in situations where subjects are...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Sample size calculation for comparing two screening tests when the gold standard is missing at random.

Statistics in medicine·2024

Same author

Weighted generalized score test for comparing predictive values in the presence of verification bias.

Statistics in medicine·2022

Same author

Joint comparison of the predictive values of multiple binary diagnostic tests: an extension of McNemar's test.

Journal of biopharmaceutical statistics·2022

Same author

Weighted McNemar's test for the comparison of two screening tests in the presence of verification bias.

Statistics in medicine·2022

Same author

Nonparametric inference of the area under ROC curve under two-phase cluster sampling.

Journal of biopharmaceutical statistics·2021

Same author

Simple confidence interval and region formulas for comparing diagnostic likelihood ratios under a paired design.

Biometrical journal. Biometrische Zeitschrift·2021

Same journal

Correction.

Journal of biopharmaceutical statistics·2026

Same journal

Leveraging external controls in clinical trials: estimands, estimation, assumptions.

Journal of biopharmaceutical statistics·2026

Same journal

Special issue of nonclinical statistics in regulatory applications guest editors' notes.

Journal of biopharmaceutical statistics·2026

Same journal

Comparison of flexible parametric modeling and nonparametric methods to estimate restricted mean survival time: A simulation study.

Journal of biopharmaceutical statistics·2026

Same journal

Simulated treatment comparisons with jackknife pseudo values for estimating population-adjusted marginal treatment effects.

Journal of biopharmaceutical statistics·2026

Same journal

Sample sizes for randomized controlled trials utilizing Bayesian response adaptive randomization for continuous outcomes.

Journal of biopharmaceutical statistics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 19, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Assess predictive values of a binary diagnostic test under a nested case-control design.

¹Department of Epidemiology and Biostatistics, College of Public Health, University of South Florida, Tampa, Florida, USA.

Journal of Biopharmaceutical Statistics

|September 21, 2021

Summary

This summary is machine-generated.

For rare diseases, a nested case-control design offers greater efficiency than random sampling for evaluating diagnostic tests. This study provides variance formulas for inverse probability weighting (IPW) estimators, showing significant variance reduction.

Keywords:

Predictive values inverse probability weighting nested case–control design relative efficiency weighted logistic regression

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

Related Experiment Videos

Last Updated: Oct 19, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

Area of Science:

Biostatistics
Epidemiology
Diagnostic Test Evaluation

Background:

Diagnostic test predictive values are typically assessed using random sampling.
Rare diseases necessitate more efficient designs like nested case-control studies, which oversample cases.
Direct estimators for predictive values in nested case-control designs are biased due to case oversampling.

Purpose of the Study:

To derive asymptotic variance formulas for inverse probability weighting (IPW) estimators of predictive values.
To quantify the efficiency gains of nested case-control designs over simple random sampling for rare diseases.
To address the lack of variance expressions for IPW estimators in this context.

Main Methods:

Developed asymptotic variance formulas for IPW estimators of predictive values.
Incorporated the variance of estimated weights into the variance formulas.
Compared efficiency gains between nested case-control and random sampling designs.
Validated formulas via simulation and a real-world D-dimer test accuracy example.

Main Results:

The proposed variance formulas accurately estimate the precision of IPW estimators.
Nested case-control designs demonstrate substantial variance reduction for rare diseases compared to random sampling.
The variance formulas account for the uncertainty introduced by estimated weights.

Conclusions:

The derived variance formulas are crucial for accurate assessment of predictive values using IPW in nested case-control studies.
Nested case-control designs are highly efficient for evaluating diagnostic tests in rare disease populations.
This work provides essential tools for biostatisticians and epidemiologists studying diagnostic accuracy.