Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Response Letter to the Editor Regarding Article "Orthopaedic Trauma Association Annual Meetings Consistently Provide Academic Value for Attendees".

Journal of orthopaedic trauma·2026

Same author

Redefining Orthopaedic Expectations of Pediatric Fracture Reductions Performed in Hybrid Emergency Departments.

Journal of pediatric orthopedics·2026

Same author

Computed Tomography Overestimates Roentgenographic Posterior Tilt in Geriatric Garden I/II Femoral Neck Fractures.

The Journal of the American Academy of Orthopaedic Surgeons·2026

Same author

Buckle Up! Formal Restrictions Are Not Required After Pediatric Distal Radius Buckle Fractures.

Journal of pediatric orthopedics·2026

Same author

Orthopaedic Trauma Association Annual Meetings Consistently Provide Academic Value for Attendees.

Journal of orthopaedic trauma·2025

Same author

The Burden of Surgical Site Infections With Pathogens Presumably Resistant to Perioperative Prophylaxis in Orthopedic Tumor Surgery: Secondary Analysis of the Prophylactic Antibiotic Regimens in Tumor Surgery (PARITY) Trial.

The Journal of infectious diseases·2025

Same journal

Reframing Fixation Strategy in Total Knee Arthroplasty With Tibial Bone Density as a Central Criterion.

JB & JS open access·2026

Same journal

Radiographic Progression of Medial Ankle Osteoarthritis: The Influence of Global Coronal Alignment in a 7-Year Longitudinal Cohort.

JB & JS open access·2026

Same journal

5-Year Results of an Implantable Shock Absorber Demonstrate Durable Outcomes in Patients with Medial Knee Osteoarthritis.

JB & JS open access·2026

Same journal

Marked Increase in Anterior Ulnar Nerve Displacement at ≥90° of Elbow Flexion in Healthy Children.

JB & JS open access·2026

Same journal

Passive Laxity in Patients With Subjective Instability Following a Cruciate-Retaining Total Knee Arthroplasty: A Pilot Study.

JB & JS open access·2026

Same journal

Total Ankle Arthroplasty: A Comparative Review of Surgical Approaches and Outcomes.

JB & JS open access·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 18, 2026

The Transition to an Anterior-Based Muscle Sparing Approach Improves Early Postoperative Function but is Associated with a Learning Curve

The Transition to an Anterior-Based Muscle Sparing Approach Improves Early Postoperative Function but is Associated with a Learning Curve

Published on: September 7, 2022

ChatGPT-4o is Not a Reliable Study Source for Orthopaedic Surgery Residents.

Neil Jain¹, Caleb Gottlich², John Fisher²

¹Department of Orthopaedic Surgery, St. Luke's University Health Network, Bethlehem, Pennsylvania.

JB & JS Open Access

|September 10, 2025

Summary

This summary is machine-generated.

Artificial intelligence (AI) tool ChatGPT-4o shows inconsistent performance on orthopaedic surgery exams, scoring similarly to residents but with flawed explanations. Its use for medical education requires further validation.

More Related Videos

An Anatomical Study of Nerves at Risk During Minimally Invasive Hallux Valgus Surgery

An Anatomical Study of Nerves at Risk During Minimally Invasive Hallux Valgus Surgery

Published on: February 17, 2018

Three-Dimensional Preoperative Virtual Planning in Derotational Proximal Femoral Osteotomy

Three-Dimensional Preoperative Virtual Planning in Derotational Proximal Femoral Osteotomy

Published on: February 17, 2023

Related Experiment Videos

Last Updated: Jan 18, 2026

The Transition to an Anterior-Based Muscle Sparing Approach Improves Early Postoperative Function but is Associated with a Learning Curve

The Transition to an Anterior-Based Muscle Sparing Approach Improves Early Postoperative Function but is Associated with a Learning Curve

Published on: September 7, 2022

An Anatomical Study of Nerves at Risk During Minimally Invasive Hallux Valgus Surgery

An Anatomical Study of Nerves at Risk During Minimally Invasive Hallux Valgus Surgery

Published on: February 17, 2018

Three-Dimensional Preoperative Virtual Planning in Derotational Proximal Femoral Osteotomy

Three-Dimensional Preoperative Virtual Planning in Derotational Proximal Femoral Osteotomy

Published on: February 17, 2023

Area of Science:

Medical Education
Artificial Intelligence in Medicine
Orthopaedic Surgery

Background:

The adoption of AI platforms like ChatGPT is rising among medical residents for educational purposes.
Previous ChatGPT versions underperformed compared to orthopaedic surgery residents on exams, particularly with image-based questions.
ChatGPT-4o, a newer model, aims to address these limitations but requires evaluation.

Purpose of the Study:

To assess ChatGPT-4o's accuracy in answering Orthopaedic In-Training Examination (OITE) questions.
To evaluate the educational quality of ChatGPT-4o's explanations for orthopaedic surgery trainees.

Main Methods:

ChatGPT-4o processed OITE questions from 2020-2022.
The AI's raw scores were compared against ACGME-accredited orthopaedic resident performance.
Answer explanations were analyzed for consistency with AAOS expert content, categorizing responses as ideal, inadequate, or unacceptable.

Main Results:

ChatGPT-4o achieved scores of 68.8% (2020), 63.4% (2021), and 70.1% (2022), comparable to PGY-5, PGY2-3, and PGY-4 residents, respectively.
Ideal response quality (correct answer with consistent explanation) was achieved for 58.7% of questions.
Performance on media-based questions (60.0%) was significantly lower than non-media questions (73.1%).

Conclusions:

ChatGPT-4o demonstrates inconsistent performance on the OITE, with a significant portion of responses being unacceptable or lacking adequate explanations.
The AI's limitations in handling media-based orthopaedic surgery questions persist.
The efficacy of using ChatGPT for orthopaedic surgery resident education remains unvalidated.