Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Guidelines for Writing Outcome01:11

Guidelines for Writing Outcome

4.0K
When developing expected outcomes for a patient care plan, the nurse should adhere to the following recommendations:
Patient outcomes reflect the patient's response to the goal rather than what the nurse aims to achieve. Terminology should be observable and measurable to avoid the reader's interpretation. The desired outcome should be realistic and achievable in the designated care timeframe. Expected outcomes should align with adjunctive therapies. The outcome should enhance care...
4.0K
Guidelines for Nursing Documentation I01:30

Guidelines for Nursing Documentation I

2.0K
Quality documentation and reporting share essential characteristics that ensure they are practical and valuable resources for those who use them. These characteristics are:
Factual:  
The following points emphasize the significance of upholding accurate and unbiased documentation in healthcare.
2.0K
Guidelines for Sketching a Curve01:23

Guidelines for Sketching a Curve

144
Curve sketching is a systematic method for understanding the overall behavior of a function by analyzing its key mathematical features. A function defines a curve on the coordinate plane, where the horizontal axis represents the input variable and the vertical axis represents the output. The process begins by determining the domain, which specifies the set of input values for which the function is defined and establishes the horizontal extent of the graph.Intercepts with the horizontal and...
144
Guidelines for Nursing Documentation II01:26

Guidelines for Nursing Documentation II

1.8K
Effective documentation is an integral part of nursing practice. Here are some essential guidelines to follow when documenting patient care:
Timely documentation is crucial to ensure continuity of care for patients. Any delays in recording or reporting medical information can result in medical errors and even adverse patient outcomes. From medication administration to diagnostic test results, every detail must be accurately and promptly documented to provide the best possible care for patients.
1.8K
Legal Guidelines for Documentation01:06

Legal Guidelines for Documentation

2.3K
The legal guidelines for nursing documentation are essential for ensuring accurate, professional, and ethical recording of patient care. The guidelines are discussed here:
2.3K
Guidelines and Strategies for Safe Computer Charting01:18

Guidelines and Strategies for Safe Computer Charting

2.7K
The guidelines and strategies provided by the American Nurses Association (ANA) and the Canadian Nurses Association (CNA) offer essential principles for ensuring safe and secure computer charting systems in healthcare settings. Let's break down each recommendation:
Maintain Confidentiality and Security:
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Response to "Evaluating Multimodal Large Language Models in Neuroradiology: Methodological Considerations".

Korean journal of radiology·2026
Same author

Evaluating the Effect of Sorafenib on Gd-EOB-DTPA-mediated Contrast Enhancement: An Experimental Study using DCE-MRI.

Molecular imaging and biology·2026
Same author

Erratum: Evaluating the Accuracy and Diagnostic Reasoning of Multimodal Large Language Models in Interpreting Neuroradiology Cases From <i>RadioGraphics</i>.

Korean journal of radiology·2026
Same author

Brain morphological changes in acquired hearing loss: A surface-based morphometry study.

PloS one·2026
Same author

Insufficient reporting quality in large language model studies in the field of radiology.

Insights into imaging·2026
Same author

Sex-Based Differences in Disease Burden and Phenotype in CADASIL: A Multicenter Study of 368 Korean Patients.

Neurology. Genetics·2026
Same journal

Comment on "White matter microstructural alterations in obstructive sleep apnea assessed by time-dependent diffusion MRI".

Japanese journal of radiology·2026
Same journal

Early prediction of joint space narrowing in rheumatoid arthritis using AI-quantified bilateral joint space asymmetry on hand radiography.

Japanese journal of radiology·2026
Same journal

Moving beyond SUV<sub>max</sub>: volumetric and heterogeneous dose evaluation in <sup>18</sup>F-BPA PET-guided BNCT.

Japanese journal of radiology·2026
Same journal

From shadow AI to accountable assistance: a framework for level 2 AI-supported peer review in medical journals.

Japanese journal of radiology·2026
Same journal

Whole-body CT characteristics of patients with severe fever with thrombocytopenia syndrome.

Japanese journal of radiology·2026
Same journal

Noninvasive assessment of multistate intracranial and cervical aneurysms using PCCTA.

Japanese journal of radiology·2026
See all related articles

Related Experiment Video

Updated: Feb 7, 2026

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible
10:42

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible

Published on: January 28, 2020

7.0K

Evaluating guideline adherence in LLM studies using LLMs.

Ji Su Ko1, Hwon Heo2, Chong Hyun Suh3

  • 1Department of Radiology, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea.

Japanese Journal of Radiology
|February 5, 2026
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) show promise in evaluating medical research reporting quality, accurately extracting explicit details. However, they struggle with context-dependent information, indicating areas for future LLM development in scientific analysis.

Keywords:
Artificial intelligenceChecklistComputer-assistedDeep learningImage interpretation

More Related Videos

Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens
13:47

Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens

Published on: May 19, 2020

3.9K
In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells
09:20

In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells

Published on: July 23, 2010

16.4K

Related Experiment Videos

Last Updated: Feb 7, 2026

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible
10:42

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible

Published on: January 28, 2020

7.0K
Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens
13:47

Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens

Published on: May 19, 2020

3.9K
In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells
09:20

In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells

Published on: July 23, 2010

16.4K

Area of Science:

  • Medical research reporting standards
  • Artificial intelligence in healthcare
  • Natural Language Processing (NLP) applications

Background:

  • The MI-CLEAR-LLM checklist aims to standardize reporting quality for medical research involving large language models (LLMs).
  • Evaluating LLM adherence to reporting guidelines is crucial for transparency and reproducibility in medical studies.
  • Previous methods for checklist assessment were manual and time-consuming.

Purpose of the Study:

  • To assess the capability of advanced LLMs, specifically GPT-4o and o1, in automatically evaluating adherence to the MI-CLEAR-LLM checklist.
  • To compare the performance of text-based versus image-based LLM modalities in this assessment task.
  • To determine the consistency and accuracy of LLM-driven checklist evaluations.

Main Methods:

  • Analysis of 159 medical research articles focusing on LLM applications.
  • Testing GPT-4o and o1 models in both text and image modalities using structured prompts with reasoning strategies.
  • Utilizing human evaluations as a reference standard and conducting three independent trials per model for consistency assessment.

Main Results:

  • Both GPT-4o and o1 achieved high accuracy (85.9-100%) for explicit LLM specifications and good accuracy (63.6-95%) for stochasticity parameters.
  • Performance decreased for context-dependent items like prompt session handling (51.5-70.7%) and test data independence (59.6-76.8%).
  • Text-based models demonstrated superior inter-trial consistency (GPT-4o-text: κ=0.926), while image-based models showed greater variability (κ=0.402-0.772).

Conclusions:

  • LLMs possess significant potential for automating the assessment of reporting quality in medical research, especially for structured information.
  • Challenges remain in LLM performance for extracting context-dependent or inferential reporting details.
  • Further refinement of LLMs is needed to improve their ability to critically evaluate complex research reporting elements.