Zero-Shot PI-RADS Version 2.1 Scoring with ChatGPT-4 Turbo and Llama 3: Diagnostic Performance and Agreement with Abdominal Radiologists
View abstract on PubMed
Summary
This summary is machine-generated.Large language models (LLMs) like ChatGPT-4 Turbo and Llama 3 show high agreement in assigning Prostate Imaging Reporting and Data System (PI-RADS) scores for prostate MRI reports, aiding oncologists.
Area Of Science
- Radiology and Medical Imaging
- Artificial Intelligence in Healthcare
- Oncology and Prostate Cancer Diagnostics
Background
- Prostate MRI interpretation relies on standardized scoring systems like the Prostate Imaging Reporting and Data System (PI-RADS).
- Accurate PI-RADS scoring is crucial for diagnosing prostate cancer and guiding treatment decisions.
- Evaluating the role of emerging AI technologies, specifically large language models (LLMs), in clinical reporting is essential.
Purpose Of The Study
- To assess the diagnostic performance and inter-rater agreement of ChatGPT-4 Turbo and Llama 3 in assigning PI-RADS scores to prostate MRI reports.
- To compare the performance of these LLMs against experienced abdominal radiologists.
- To determine the potential of LLMs to enhance consistency and accuracy in prostate MRI reporting.
Main Methods
- A retrospective analysis of 500 structured prostate MRI reports was conducted, with original PI-RADS scores anonymized.
- Two LLMs (ChatGPT-4 Turbo and Llama 3) were employed using a standardized prompt to extract PI-RADS version 2.1 scores.
- Two abdominal radiologists independently assigned PI-RADS scores, with discrepancies resolved by a third radiologist; prostate biopsy results served as the reference standard.
Main Results
- Both LLMs demonstrated high agreement with radiologists (94.7%-95.7% agreement, κ = 0.89-0.91).
- ChatGPT-4 Turbo showed a trend towards higher PI-RADS scores compared to radiologists (P < .005).
- Area under the receiver operating characteristic curves (AUCs) for predicting prostate cancer were comparable: 0.79 for ChatGPT-4 Turbo and original reports, 0.78 for radiologists and Llama 3.
Conclusions
- Large language models exhibit strong agreement with expert radiologists in PI-RADS scoring for prostate MRI.
- LLMs show potential to improve the accuracy and consistency of prostate MRI reporting, supporting oncological diagnosis.
- Further integration of LLMs could streamline radiology workflows and enhance diagnostic capabilities in prostate cancer detection.
Related Concept Videos
Radiological investigations are paramount in the diagnosis and management of various pulmonary diseases. Two essential investigations are the Pulmonary Angiogram and the Positron Emission Tomography (PET) Scan.
Pulmonary Angiogram
A Pulmonary Angiogram is an invasive procedure involving injecting a contrast medium through a catheter threaded into the pulmonary artery or the right side of the heart to visualize the pulmonary vasculature. Computed Tomography (CT) scans have mainly replaced this...
Positron emission tomography (PET) is a medical imaging technique involving radiopharmaceuticals — substances that emit short-lived radiation. Although the first PET scanner was introduced in 1961, it took 15 more years before radiopharmaceuticals were combined with the technique and revolutionized its potential.
One of the main requirements of a PET scan is a positron-emitting radioisotope, which is produced in a cyclotron and then attached to a substance used by the part of the body...
Description
Magnetic Resonance Imaging (MRI) and Ventilation Perfusion Scans are two radiological investigations that offer detailed diagnostic images of the body, particularly lung structures.
MRI
MRI uses magnetic fields and radiofrequency signals to distinguish between normal and abnormal tissues. This technology provides a more detailed diagnostic image than CT scans, enabling it to characterize pulmonary nodules, stage bronchogenic carcinoma, and evaluate inflammatory activity in...
Positron Emission Tomography (PET) is a medical imaging technique that provides crucial insights into the body's physiological functions at a molecular level. It is an indispensable resource for diagnosing, staging, and monitoring various illnesses, notably cancer, neurological disorders, and cardiovascular conditions.
Fundamental Principles of PET
Radioactive Tracer: PET involves using biologically active molecules labeled with radioactive isotopes, known as tracers or radiotracers. The...

