Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Calibration Curves: Linear Least Squares

Calibration Curves: Linear Least Squares

A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...

Contingency Table

Contingency Table

A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

Calibration Curves: Correlation Coefficient

Calibration Curves: Correlation Coefficient

In a linear calibration curve, there is a value called the calibration coefficient, denoted by 'r,' which measures the strength and the direction of association between two variables. The correlation coefficient value ranges from −1 to +1. A value of +1 indicates a perfect positive linear correlation, −1 denotes a perfect negative correlation, and 0 implies no correlation between the two variables. A positive correlation value establishes that as one variable increases, the...

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Missing infrastructure for real-world predictive AI impact.

BMJ health & care informatics·2026

Same author

Using routinely collected data for research purposes: challenges and mitigation strategies.

BMJ (Clinical research ed.)·2026

Same author

Critical appraisal of fairness metrics for artificial intelligence-based clinical prediction models: a scoping review.

The Lancet. Digital health·2026

Same author

Comparing methods for handling missing data in electronic health records for dynamic risk prediction of central-line associated bloodstream infection.

BMC medical research methodology·2026

Same author

Performance of the Dutch Triage standard in managing fever in children in out-of-hours primary care: a secondary analysis of the chili study.

Family practice·2026

Same author

The continuous net benefit: assessing the clinical utility of prediction models when informing a continuum of decisions.

Diagnostic and prognostic research·2026

Same journal

Prompt engineering of large language models for paper screening in medical meta-analyses and systematic reviews: A prospective comparative study - CORRIGENDUM.

Research synthesis methods·2026

Same journal

Evaluating the accuracy and speed of eight deduplication tools: A comparative study.

Research synthesis methods·2026

Same journal

A comparison of preprint search aggregators: comprehensive identification of preprints in the information retrieval stage of evidence syntheses.

Research synthesis methods·2026

Same journal

Meta-research on key metrics of preregistered scoping reviews.

Research synthesis methods·2026

Same journal

Facilitators and barriers to engaging patient partners in knowledge syntheses: A stage-based approach.

Research synthesis methods·2026

Same journal

Response to: Five methodological considerations for validating LLMs in risk of bias assessment.

Research synthesis methods·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 18, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Clustered flexible calibration plots for binary outcomes using random effects modeling.

Lasai Barreñada^1,2, Bavo De Cock Campo³, Laure Wynants^1,2,4

¹Department of Development and regeneration, https://ror.org/05f950310KU Leuven, Belgium.

Research Synthesis Methods

|April 17, 2026

Summary

This summary is machine-generated.

Evaluating clinical prediction models requires assessing calibration across different data clusters. This study introduces three methods—clustered group calibration, two-stage meta-analysis calibration, and mixed model calibration—to account for this clustering, improving model evaluation.

Keywords:

Binary Outcomes Calibration Clustered Data Meta-analysis Model validation prediction models

More Related Videos

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: Apr 18, 2026

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Biostatistics
Clinical Epidemiology
Health Informatics

Background:

Clinical prediction models are increasingly evaluated across multiple clusters (centers or datasets).
Model calibration, assessing agreement between predicted risks and observed outcomes, is crucial for clinical decision-making.
Calibration performance often varies significantly between clusters.

Purpose of the Study:

To present and evaluate three novel approaches for assessing calibration that explicitly account for clustered data.
To compare the performance of these methods using a case study, simulation, and synthetic data.
To provide practical recommendations and tools for robust calibration assessment in multi-center studies.

Main Methods:

Three methods were developed: clustered group calibration (CG-C), two-stage meta-analysis calibration (2MA-C), and mixed model calibration (MIX-C).
Methods were evaluated using an external validation of an ovarian tumor malignancy risk model (N=2489).
Simulation and synthetic data studies were conducted to assess performance under known data structures.

Main Results:

Mixed model calibration (MIX-C) and two-stage meta-analysis calibration (2MA-C) with splines produced overall curves closest to the true curve in simulations.
MIX-C generated cluster-specific curves closest to the truth in the synthetic data study.
Two-stage meta-analysis calibration (2MA-C) with splines demonstrated the best prediction interval coverage.

Conclusions:

Recommend 2MA-C with splines for overall curve estimation and prediction intervals, and MIX-C for cluster-specific curves, especially with limited per-cluster sample sizes.
These methods facilitate flexible calibration plots with confidence and prediction intervals to assess calibration heterogeneity.
Ready-to-use code is provided to aid in the construction of summary flexible calibration curves.