Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Improving Translational Accuracy

Improving Translational Accuracy

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Using Large Language Models to Understand Suicidality in a Social Media-Based Taxonomy of Mental Health Disorders: Linguistic Analysis of Reddit Posts.

JMIR mental health·2024

Same author

Development of a Quantitative Digital Urinalysis Tool for Detection of Nitrite, Protein, Creatinine, and pH.

Biosensors·2024

Same author

Pan-Canadian Electronic Medical Record Diagnostic and Unstructured Text Data for Capturing PTSD: Retrospective Observational Study.

JMIR medical informatics·2022

Same author

Characterizing primary care patients with posttraumatic stress disorder using electronic medical records: a retrospective cross-sectional study.

Family practice·2022

Same author

Natural Language Processing of Computed Tomography Reports to Label Metastatic Phenotypes With Prognostic Significance in Patients With Colorectal Cancer.

JCO clinical cancer informatics·2022

Same author

Diagnosing post-traumatic stress disorder using electronic medical record data.

Health informatics journal·2021

Same journal

Supporting Radiology Resident Education and Clinical Decision-Making With Large Language Models: Comparative Study of Reasoning Models DeepSeek-R1 and ChatGPT-o1.

JMIR AI·2026

Same journal

Patient Perceptions on the Use of Artificial Intelligence in Creating Clinical Research Documents: Survey Study.

JMIR AI·2026

Same journal

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review.

JMIR AI·2026

Same journal

Correction: Deep Learning for Age Estimation and Sex Prediction Using Mandibular-Cropped Cephalometric Images: Comparative Model Development and Validation Study.

JMIR AI·2026

Same journal

AI-Assisted Systematic Literature Review of the Economic Burden of Pneumococcal Disease: Development and Validation Study.

JMIR AI·2026

Same journal

Knowledge-Augmented Large Language Model for Multimodal Electronic Health Record-Based Risk Prediction: Development and Validation Study.

JMIR AI·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

医学文本的多代理总结和自我评估框架:开发和评估研究

Yuhao Chen¹, Bo Wen², Farhana Zulkernine¹

¹School of Computing, Queen's University, 557 Goodwin Hall, Kingston, ON, K7L 2N8, Canada, 1 6138930999.

|December 16, 2025

概括

此摘要是机器生成的。

大型语言模型 (LLM) 可以可靠地总结和评估医学文本,减少对人类专家的依赖. 这种人工智能系统证明了临床使用的可扩展性,解决了幻觉和偏见等挑战.

关键词:

在法学士 (LLM) 课程中.法学士作为一个法官.大型语言模型评估评估.多个代理网络的多个代理网络.总结总结评价评价非结构化的医疗数据总结.

更多相关视频

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

相关实验视频

Last Updated: Jan 8, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

科学领域:

人工智能的人工智能
医疗信息学医疗信息学
自然语言处理自然语言处理.

背景情况:

大型语言模型 (LLM) 在处理医疗文本方面表现有前途,但容易出现不准确 (幻觉).
人类专家对LLM输出的审查是耗时和昂贵的,阻碍了临床部署.
确保准确性和可靠性对于医疗保健中的LLM至关重要.

研究的目的:

开发一种人工智能系统,从非结构化医疗数据中提取结构化信息.
整合自我验证机制来评估LLM输出准确性和可靠性.
增强人工智能驱动的医学总结和评估的稳定性和可靠性.

主要方法:

一个两层的框架:总结 (Llama2-70B,Mistral-7B) 和评估 (GPT-4-turbo作为法官).
双对比和即时策略评估了总结的连贯性,一致性,流性和相关性.
对LLM的判断与医学专家的评估进行了比较,并分析了专家间的分歧.

主要成果:

GPT-4表现出与专家判断的强烈一致 (83.06%的人同意至少一名专家).
与基线提示相比,快速增强的指导改善了GPT-4的调整.
观察到专家共识的变化 (总体上为19.2%,在3位专家中为54%).

结论:

在医学数据总结和评估方面,LLM可以作为可靠的工具,减少对人类的依赖.
拟议的多种药物总结和自我评估框架是可扩展和适应临床应用的.
该框架解决了LLM输出中的幻觉和位置偏差等关键挑战.