Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Guidelines for Writing Outcome

Guidelines for Writing Outcome

When developing expected outcomes for a patient care plan, the nurse should adhere to the following recommendations:
Patient outcomes reflect the patient's response to the goal rather than what the nurse aims to achieve. Terminology should be observable and measurable to avoid the reader's interpretation. The desired outcome should be realistic and achievable in the designated care timeframe. Expected outcomes should align with adjunctive therapies. The outcome should enhance care...

Guidelines for Nursing Documentation I

Guidelines for Nursing Documentation I

Quality documentation and reporting share essential characteristics that ensure they are practical and valuable resources for those who use them. These characteristics are:
Factual:
The following points emphasize the significance of upholding accurate and unbiased documentation in healthcare.

Guidelines for Sketching a Curve

Guidelines for Sketching a Curve

Curve sketching is a systematic method for understanding the overall behavior of a function by analyzing its key mathematical features. A function defines a curve on the coordinate plane, where the horizontal axis represents the input variable and the vertical axis represents the output. The process begins by determining the domain, which specifies the set of input values for which the function is defined and establishes the horizontal extent of the graph.Intercepts with the horizontal and...

Guidelines for Nursing Documentation II

Guidelines for Nursing Documentation II

Effective documentation is an integral part of nursing practice. Here are some essential guidelines to follow when documenting patient care:
Timely documentation is crucial to ensure continuity of care for patients. Any delays in recording or reporting medical information can result in medical errors and even adverse patient outcomes. From medication administration to diagnostic test results, every detail must be accurately and promptly documented to provide the best possible care for patients.

Legal Guidelines for Documentation

Legal Guidelines for Documentation

The legal guidelines for nursing documentation are essential for ensuring accurate, professional, and ethical recording of patient care. The guidelines are discussed here:

Guidelines and Strategies for Safe Computer Charting

Guidelines and Strategies for Safe Computer Charting

The guidelines and strategies provided by the American Nurses Association (ANA) and the Canadian Nurses Association (CNA) offer essential principles for ensuring safe and secure computer charting systems in healthcare settings. Let's break down each recommendation:
Maintain Confidentiality and Security:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Response to "Evaluating Multimodal Large Language Models in Neuroradiology: Methodological Considerations".

Korean journal of radiology·2026

Same author

Evaluating the Effect of Sorafenib on Gd-EOB-DTPA-mediated Contrast Enhancement: An Experimental Study using DCE-MRI.

Molecular imaging and biology·2026

Same author

Erratum: Evaluating the Accuracy and Diagnostic Reasoning of Multimodal Large Language Models in Interpreting Neuroradiology Cases From <i>RadioGraphics</i>.

Korean journal of radiology·2026

Same author

Brain morphological changes in acquired hearing loss: A surface-based morphometry study.

PloS one·2026

Same author

Insufficient reporting quality in large language model studies in the field of radiology.

Insights into imaging·2026

Same author

Sex-Based Differences in Disease Burden and Phenotype in CADASIL: A Multicenter Study of 368 Korean Patients.

Neurology. Genetics·2026

Same journal

Comment on "White matter microstructural alterations in obstructive sleep apnea assessed by time-dependent diffusion MRI".

Japanese journal of radiology·2026

Same journal

Early prediction of joint space narrowing in rheumatoid arthritis using AI-quantified bilateral joint space asymmetry on hand radiography.

Japanese journal of radiology·2026

Same journal

Moving beyond SUV<sub>max</sub>: volumetric and heterogeneous dose evaluation in <sup>18</sup>F-BPA PET-guided BNCT.

Japanese journal of radiology·2026

Same journal

From shadow AI to accountable assistance: a framework for level 2 AI-supported peer review in medical journals.

Japanese journal of radiology·2026

Same journal

Whole-body CT characteristics of patients with severe fever with thrombocytopenia syndrome.

Japanese journal of radiology·2026

Same journal

Noninvasive assessment of multistate intracranial and cervical aneurysms using PCCTA.

Japanese journal of radiology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 7, 2026

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible

Published on: January 28, 2020

Evaluating guideline adherence in LLM studies using LLMs.

Ji Su Ko¹, Hwon Heo², Chong Hyun Suh³

¹Department of Radiology, Kangbuk Samsung Hospital, Sungkyunkwan University School of Medicine, Seoul, Republic of Korea.

Japanese Journal of Radiology

|February 5, 2026

Summary

This summary is machine-generated.

Large language models (LLMs) show promise in evaluating medical research reporting quality, accurately extracting explicit details. However, they struggle with context-dependent information, indicating areas for future LLM development in scientific analysis.

Keywords:

Artificial intelligence Checklist Computer-assisted Deep learning Image interpretation

More Related Videos

Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens

Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens

Published on: May 19, 2020

In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells

In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells

Published on: July 23, 2010

Related Experiment Videos

Last Updated: Feb 7, 2026

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible

A Postoperative Evaluation Guideline for Computer-Assisted Reconstruction of the Mandible

Published on: January 28, 2020

Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens

Opsono-Adherence Assay to Evaluate Functional Antibodies in Vaccine Development Against Bacillus anthracis and Other Encapsulated Pathogens

Published on: May 19, 2020

In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells

In situ Subcellular Fractionation of Adherent and Non-adherent Mammalian Cells

Published on: July 23, 2010

Area of Science:

Medical research reporting standards
Artificial intelligence in healthcare
Natural Language Processing (NLP) applications

Background:

The MI-CLEAR-LLM checklist aims to standardize reporting quality for medical research involving large language models (LLMs).
Evaluating LLM adherence to reporting guidelines is crucial for transparency and reproducibility in medical studies.
Previous methods for checklist assessment were manual and time-consuming.

Purpose of the Study:

To assess the capability of advanced LLMs, specifically GPT-4o and o1, in automatically evaluating adherence to the MI-CLEAR-LLM checklist.
To compare the performance of text-based versus image-based LLM modalities in this assessment task.
To determine the consistency and accuracy of LLM-driven checklist evaluations.

Main Methods:

Analysis of 159 medical research articles focusing on LLM applications.
Testing GPT-4o and o1 models in both text and image modalities using structured prompts with reasoning strategies.
Utilizing human evaluations as a reference standard and conducting three independent trials per model for consistency assessment.

Main Results:

Both GPT-4o and o1 achieved high accuracy (85.9-100%) for explicit LLM specifications and good accuracy (63.6-95%) for stochasticity parameters.
Performance decreased for context-dependent items like prompt session handling (51.5-70.7%) and test data independence (59.6-76.8%).
Text-based models demonstrated superior inter-trial consistency (GPT-4o-text: κ=0.926), while image-based models showed greater variability (κ=0.402-0.772).

Conclusions:

LLMs possess significant potential for automating the assessment of reporting quality in medical research, especially for structured information.
Challenges remain in LLM performance for extracting context-dependent or inferential reporting details.
Further refinement of LLMs is needed to improve their ability to critically evaluate complex research reporting elements.