相关视频 - | JoVE Visualize

Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

Search research articles

相关实验视频

Edoardo Leo¹, Francesco Baglivo¹, Federico Starace¹

¹Dipartimento di Ricerca Traslazionale e delle Nuove Tecnologie in Medicina e Chirurgia, Università di Pisa.

Recenti progressi in medicina

|October 2, 2025

概括

此摘要是机器生成的。

检索增强生成 (RAG) 提高了睡眠医学认证问题的大语言模型 (LLM) 准确性. 这项研究证明了RAG.

相关实验视频

相关概念视频

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Cytology and <i>KRAS/GNAS</i> Molecular Testing of Pancreatic Cyst Fluid for Risk Stratification of Intraductal Papillary Mucinous Neoplasms: A Single-Center Study with Histological Correlation.

Journal of clinical medicine·2026

Same author

Reliable ECG monitoring during rest and exercise: a pilot comparative validation of a wearable single-lead band.

European heart journal. Imaging methods and practice·2026

Same author

Mapping risk communication practices in public health emergencies: a scoping review and comparison with Italian regional pandemic plans.

BMC public health·2026

Same author

Comparative efficacy of agomelatine and escitalopram in people with epilepsy and comorbid major depressive disorder: A double-blind randomized controlled trial.

Epilepsy & behavior reports·2026

Same author

Impact of Respiratory Viral Codetections on RSV Disease Burden in Young Children in Primary Care.

Influenza and other respiratory viruses·2026

Same author

Clinical outcomes of rehabilitation with a robotic anthropomorphic exoskeleton in patients with motor-incomplete spinal cord injury: a multicenter randomized controlled trial.

European journal of physical and rehabilitation medicine·2026

Same journal

Recenti progressi in medicina·2026

Same journal

Recenti progressi in medicina·2026

Same journal

Recenti progressi in medicina·2026

Same journal

Recenti progressi in medicina·2026

Same journal

Recenti progressi in medicina·2026

Same journal

Recenti progressi in medicina·2026

查看所有相关文章

科学领域:

医疗信息学医疗信息学
人工智能在医学中的应用
睡眠医学睡眠医学

背景情况:

大型语言模型 (LLM) 在医学教育中表现有前途.
评估LLM在睡眠医学等专业领域的表现至关重要.
目前的LLM准确性可能不足以获得高风险的医疗认证.

研究的目的:

评估四个LLM的睡眠医学认证问题的表现.
为了比较基线LLM性能与检索增强生成 (RAG) 增强性能.
评估RAG对LLM可靠性在专业医疗环境中的影响.

主要方法:

使用睡眠医学指南和教科书作为知识库.
评估了四个LLM:拉玛3.2 3B,拉玛3.3 70B,GPT 4o mini,以及双子座2.0闪光.
在AIMS认证问题上,将基线性能与RAG增强性能进行比较.

主要成果:

在所有测试的LLMs中,RAG显著提高了准确性.
拉玛3.2显示RAG的准确性增加了+9.6个点.
双子座2.0显示RAG的精度增加了4.0个点.

结论:

在提高专业医学知识的LLM准确性方面,RAG是有效的.
在睡眠医学认证的LLM表现可以使用RAG改进.
RAG集成是提高医学领域LLM可靠性的关键.