Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Kendall's Coefficient of Concordance

Kendall's Coefficient of Concordance

Kendall's Coefficient of Concordance (W), also known as Kendall's W, is a non-parametric statistical measure used to assess the agreement or concordance between multiple raters or judges when they rank a set of items. It is often used when you have ordinal data (ranks) and you want to see if there is consistency or consensus among the raters. It is widely applied in research areas such as psychology, medicine, and social sciences, where multiple judges are asked to rank or rate subjects...

Sign Test for Matched Pairs

Sign Test for Matched Pairs

The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...

Reliability and Validity

Reliability and Validity

Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

Receiver Operating Characteristic Plot

Receiver Operating Characteristic Plot

A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...

Bonferroni Test

Bonferroni Test

The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

From Turing to ChatGPT: The origins and transformations of medical artificial intelligence

Nephrologie & therapeutique·2026

Same author

Cultural bias in large language models' ability to follow neuroradiology guidelines.

European radiology·2026

Same author

Sante publique (Vandoeuvre-les-Nancy, France)·2026

Same author

[The role of nurses in the conservative management of advanced chronic kidney disease].

Revue de l'infirmiere·2026

Same author

Living alone and risk of dementia, cognitive decline, and institutionalization in the MEMENTO cohort.

Frontiers in aging·2026

Same author

Discovery of gene-alcohol interaction loci influencing blood pressure in 1.1 million individuals from multiple populations.

Research square·2026

Same journal

[Abdominal pain, fever and arthralgia in a 49-year-old woman].

La Revue de medecine interne·2026

Same journal

[Cardiorespiratory functional disorders: A transnosologic approach].

La Revue de medecine interne·2026

Same journal

[Diagnostic evaluation for suspected polycythemia].

La Revue de medecine interne·2026

Same journal

Heart involvements in systemic sclerosis beyond pulmonary hypertension: From conduction, rhythm and function defects to coronary artery disease.

La Revue de medecine interne·2026

Same journal

[Acute intermittent porphyria: When diagnostic errance jeopardizes patient health].

La Revue de medecine interne·2026

Same journal

Autosomal dominant polycystic kidney disease: Current perspectives in 2026.

La Revue de medecine interne·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 21, 2025

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

[The expert panel for Script Concordance Tests: A truly adequate reference?]

Luc Dauchet¹, Raphaël Bentegeac¹, Haress Ghauss¹

¹Service de santé publique, épidémiologie, économie de la santé et prévention, CHU de Lille, 59000 Lille, France; UMR1167 RID-AGE, Institut Pasteur de Lille, Inserm, université de Lille, CHU de Lille, 59000 Lille, France.

La Revue De Medecine Interne

|July 10, 2024

Summary

This summary is machine-generated.

Script Concordance Tests (SCTs) evaluate medical students' clinical reasoning in uncertainty. However, physician response biases may impact the reliability of SCT scoring scales.

Keywords:

Bayesian reasoning Clinical reasoning Expert panel Panel d’experts Probabilistic reasoning Raisonnement bayésien Raisonnement clinique Raisonnement probabiliste Script concordance test TCS Tests de concordance de script

More Related Videos

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Published on: December 24, 2015

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Related Experiment Videos

Last Updated: Jun 21, 2025

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Published on: December 24, 2015

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Area of Science:

Medical Education
Cognitive Psychology
Clinical Reasoning Assessment

Context:

The French National Ranking Exam for medical students introduced Script Concordance Tests (SCTs) in 2024.
SCTs aim to assess clinical reasoning in uncertain medical scenarios.

Purpose:

To evaluate the impact of new information on diagnostic hypothesis probability.
To model clinical reasoning using probabilistic (Bayesian) principles.

Summary:

SCTs present authentic clinical scenarios and assess how students adjust probability estimates with new data.
Scoring relies on response distributions from expert physicians, not a fixed correct answer.
Human probabilistic reasoning is prone to biases, potentially affecting expert-based scoring.

Impact:

Raises concerns about the validity of expert panels for scoring SCTs due to cognitive biases.
Highlights the need to investigate alternative or bias-corrected scoring methods for clinical reasoning assessments.
Suggests a potential limitation in using SCTs as a definitive measure of unbiased clinical judgment.