Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Kendall's Coefficient of Concordance01:20

Kendall's Coefficient of Concordance

310
Kendall's Coefficient of Concordance (W), also known as Kendall's W, is a non-parametric statistical measure used to assess the agreement or concordance between multiple raters or judges when they rank a set of items. It is often used when you have ordinal data (ranks) and you want to see if there is consistency or consensus among the raters. It is widely applied in research areas such as psychology, medicine, and social sciences, where multiple judges are asked to rank or rate subjects...
310
Sign Test for Matched Pairs01:17

Sign Test for Matched Pairs

124
The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...
124
Reliability and Validity01:29

Reliability and Validity

12.7K
Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
12.7K
Multiple Comparison Tests01:13

Multiple Comparison Tests

3.9K
Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...
3.9K
Receiver Operating Characteristic Plot01:15

Receiver Operating Characteristic Plot

127
A ROC (Receiver Operating Characteristic) plot is a graphical tool used to assess the performance of a binary classification model by illustrating the trade-off between sensitivity (true positive rate) and specificity (false positive rate). By plotting sensitivity against 1 - specificity across various threshold settings, the ROC curve shows how well the model distinguishes between classes, with a curve closer to the top-left corner indicating a more accurate model. The area under the ROC curve...
127
Bonferroni Test01:10

Bonferroni Test

2.7K
The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

From Turing to ChatGPT: The origins and transformations of medical artificial intelligence

Nephrologie & therapeutique·2026
Same author

Cultural bias in large language models' ability to follow neuroradiology guidelines.

European radiology·2026
Same author

Sante publique (Vandoeuvre-les-Nancy, France)·2026
Same author

[The role of nurses in the conservative management of advanced chronic kidney disease].

Revue de l'infirmiere·2026
Same author

Living alone and risk of dementia, cognitive decline, and institutionalization in the MEMENTO cohort.

Frontiers in aging·2026
Same author

Discovery of gene-alcohol interaction loci influencing blood pressure in 1.1 million individuals from multiple populations.

Research square·2026
Same journal

[Abdominal pain, fever and arthralgia in a 49-year-old woman].

La Revue de medecine interne·2026
Same journal

[Cardiorespiratory functional disorders: A transnosologic approach].

La Revue de medecine interne·2026
Same journal

[Diagnostic evaluation for suspected polycythemia].

La Revue de medecine interne·2026
Same journal

Heart involvements in systemic sclerosis beyond pulmonary hypertension: From conduction, rhythm and function defects to coronary artery disease.

La Revue de medecine interne·2026
Same journal

[Acute intermittent porphyria: When diagnostic errance jeopardizes patient health].

La Revue de medecine interne·2026
Same journal

Autosomal dominant polycystic kidney disease: Current perspectives in 2026.

La Revue de medecine interne·2026
See all related articles

Related Experiment Video

Updated: Jun 21, 2025

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing
15:00

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

541

[The expert panel for Script Concordance Tests: A truly adequate reference?]

Luc Dauchet1, Raphaël Bentegeac1, Haress Ghauss1

  • 1Service de santé publique, épidémiologie, économie de la santé et prévention, CHU de Lille, 59000 Lille, France; UMR1167 RID-AGE, Institut Pasteur de Lille, Inserm, université de Lille, CHU de Lille, 59000 Lille, France.

La Revue De Medecine Interne
|July 10, 2024
PubMed
Summary
This summary is machine-generated.

Script Concordance Tests (SCTs) evaluate medical students' clinical reasoning in uncertainty. However, physician response biases may impact the reliability of SCT scoring scales.

Keywords:
Bayesian reasoningClinical reasoningExpert panelPanel d’expertsProbabilistic reasoningRaisonnement bayésienRaisonnement cliniqueRaisonnement probabilisteScript concordance testTCSTests de concordance de script

More Related Videos

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm
09:49

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Published on: December 24, 2015

14.1K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

440

Related Experiment Videos

Last Updated: Jun 21, 2025

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing
15:00

A Tablet-Based Curriculum-Based Measurement Protocol for Kindergarten Writing

Published on: February 7, 2025

541
Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm
09:49

Holistic Facial Composite Creation and Subsequent Video Line-up Eyewitness Identification Paradigm

Published on: December 24, 2015

14.1K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

440

Area of Science:

  • Medical Education
  • Cognitive Psychology
  • Clinical Reasoning Assessment

Context:

  • The French National Ranking Exam for medical students introduced Script Concordance Tests (SCTs) in 2024.
  • SCTs aim to assess clinical reasoning in uncertain medical scenarios.

Purpose:

  • To evaluate the impact of new information on diagnostic hypothesis probability.
  • To model clinical reasoning using probabilistic (Bayesian) principles.

Summary:

  • SCTs present authentic clinical scenarios and assess how students adjust probability estimates with new data.
  • Scoring relies on response distributions from expert physicians, not a fixed correct answer.
  • Human probabilistic reasoning is prone to biases, potentially affecting expert-based scoring.

Impact:

  • Raises concerns about the validity of expert panels for scoring SCTs due to cognitive biases.
  • Highlights the need to investigate alternative or bias-corrected scoring methods for clinical reasoning assessments.
  • Suggests a potential limitation in using SCTs as a definitive measure of unbiased clinical judgment.