Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

814
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
814
Receiver Operating Characteristic Plot01:15

Receiver Operating Characteristic Plot

354
A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...
354
Contingency Table01:29

Contingency Table

2.8K
A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...
2.8K
McNemar's Test01:23

McNemar's Test

509
McNemar's Test is a nonparametric statistical test used to determine if there is a significant difference in proportions between two related groups when the outcome is binary (e.g., yes/no, success/failure). It is beneficial when we have paired data, such as pre-test/post-test designs, where the same subjects are measured under two different conditions. The test is named after the statistician Quinn McNemar, who introduced it in 1947. It is commonly used in situations where subjects are...
509
Survival Tree01:19

Survival Tree

183
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
183
Comparing the Survival Analysis of Two or More Groups01:20

Comparing the Survival Analysis of Two or More Groups

350
Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...
350

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Sample size calculation for comparing two screening tests when the gold standard is missing at random.

Statistics in medicine·2024
Same author

Weighted generalized score test for comparing predictive values in the presence of verification bias.

Statistics in medicine·2022
Same author

Joint comparison of the predictive values of multiple binary diagnostic tests: an extension of McNemar's test.

Journal of biopharmaceutical statistics·2022
Same author

Weighted McNemar's test for the comparison of two screening tests in the presence of verification bias.

Statistics in medicine·2022
Same author

Nonparametric inference of the area under ROC curve under two-phase cluster sampling.

Journal of biopharmaceutical statistics·2021
Same author

Simple confidence interval and region formulas for comparing diagnostic likelihood ratios under a paired design.

Biometrical journal. Biometrische Zeitschrift·2021
Same journal

Correction.

Journal of biopharmaceutical statistics·2026
Same journal

Leveraging external controls in clinical trials: estimands, estimation, assumptions.

Journal of biopharmaceutical statistics·2026
Same journal

Special issue of nonclinical statistics in regulatory applications guest editors' notes.

Journal of biopharmaceutical statistics·2026
Same journal

Comparison of flexible parametric modeling and nonparametric methods to estimate restricted mean survival time: A simulation study.

Journal of biopharmaceutical statistics·2026
Same journal

Simulated treatment comparisons with jackknife pseudo values for estimating population-adjusted marginal treatment effects.

Journal of biopharmaceutical statistics·2026
Same journal

Sample sizes for randomized controlled trials utilizing Bayesian response adaptive randomization for continuous outcomes.

Journal of biopharmaceutical statistics·2026
See all related articles

Related Experiment Video

Updated: Oct 19, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.3K

Assess predictive values of a binary diagnostic test under a nested case-control design.

Yougui Wu1

  • 1Department of Epidemiology and Biostatistics, College of Public Health, University of South Florida, Tampa, Florida, USA.

Journal of Biopharmaceutical Statistics
|September 21, 2021
PubMed
Summary
This summary is machine-generated.

For rare diseases, a nested case-control design offers greater efficiency than random sampling for evaluating diagnostic tests. This study provides variance formulas for inverse probability weighting (IPW) estimators, showing significant variance reduction.

Keywords:
Predictive valuesinverse probability weightingnested case–control designrelative efficiencyweighted logistic regression

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.7K
Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

271

Related Experiment Videos

Last Updated: Oct 19, 2025

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.3K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.7K
Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

271

Area of Science:

  • Biostatistics
  • Epidemiology
  • Diagnostic Test Evaluation

Background:

  • Diagnostic test predictive values are typically assessed using random sampling.
  • Rare diseases necessitate more efficient designs like nested case-control studies, which oversample cases.
  • Direct estimators for predictive values in nested case-control designs are biased due to case oversampling.

Purpose of the Study:

  • To derive asymptotic variance formulas for inverse probability weighting (IPW) estimators of predictive values.
  • To quantify the efficiency gains of nested case-control designs over simple random sampling for rare diseases.
  • To address the lack of variance expressions for IPW estimators in this context.

Main Methods:

  • Developed asymptotic variance formulas for IPW estimators of predictive values.
  • Incorporated the variance of estimated weights into the variance formulas.
  • Compared efficiency gains between nested case-control and random sampling designs.
  • Validated formulas via simulation and a real-world D-dimer test accuracy example.

Main Results:

  • The proposed variance formulas accurately estimate the precision of IPW estimators.
  • Nested case-control designs demonstrate substantial variance reduction for rare diseases compared to random sampling.
  • The variance formulas account for the uncertainty introduced by estimated weights.

Conclusions:

  • The derived variance formulas are crucial for accurate assessment of predictive values using IPW in nested case-control studies.
  • Nested case-control designs are highly efficient for evaluating diagnostic tests in rare disease populations.
  • This work provides essential tools for biostatisticians and epidemiologists studying diagnostic accuracy.