Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Advances in Wearable Biosensors for Non-Invasive Biofluid Monitoring.

Biosensors·2026
Same author

Design and validation of a technology for 3D printing training phantoms for ultrasound imaging.

Physical and engineering sciences in medicine·2025
Same author

Large Language Models in Healthcare and Medical Applications: A Review.

Bioengineering (Basel, Switzerland)·2025
Same author

IMPACT: an interactive multi-disease prevention and counterfactual treatment system using explainable AI and a multimodal LLM.

PeerJ. Computer science·2025
Same author

Automated Detection of Aberrant Episodes in Epileptic Conditions: Leveraging EEG and Machine Learning Algorithms.

Bioengineering (Basel, Switzerland)·2025
Same author

Large scale summarization using ensemble prompts and in context learning approaches.

Scientific reports·2025
Same journal

Correction: Komatsu et al. Three-Dimensional Visualization and Detection of the Pulmonary Venous-Left Atrium Connection Using Artificial Intelligence in Fetal Cardiac Ultrasound Screening. <i>Bioengineering</i> 2026, <i>13</i>, 100.

Bioengineering (Basel, Switzerland)·2026
Same journal

Comparison of CO<sub>2</sub> Laser and Microdebrider in the Surgical Treatment of Pediatric Recurrent Respiratory Papillomatosis: A Retrospective Analysis.

Bioengineering (Basel, Switzerland)·2026
Same journal

Toward More Translational Tumor Models: Breast dECM-Based 3D Systems Capture Native Microenvironmental Cues.

Bioengineering (Basel, Switzerland)·2026
Same journal

Postural Stability Changes During the 4 Phases of the Half Squat: Kinematics Profile of the Center of Pressure and Center of Mass in High-Performance Weightlifters-A Pilot Study.

Bioengineering (Basel, Switzerland)·2026
Same journal

Definite Implant Position as Novel Readout for Effectiveness of Ridge Preservation Indicates to Beneficial Effect of Combined Treatment with Platelet-Rich Fibrin (PRF) and Xenogenic Biomaterial in Bone Regeneration.

Bioengineering (Basel, Switzerland)·2026
Same journal

Trueness and Precision of Intraoral Scanners for 3D-Printed Orthodontic Models with Attachments: An In Vitro Comparative Study.

Bioengineering (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

684

Medical LLMs: Fine-Tuning vs. Retrieval-Augmented Generation.

Bhagyajit Pingua1,2, Adyakanta Sahoo1,3, Meenakshi Kandpal2

  • 1Biomedical Sensors & Systems Lab, University of Memphis, Memphis, TN 38152, USA.

Bioengineering (Basel, Switzerland)
|July 29, 2025
PubMed
Summary
This summary is machine-generated.

Retrieval-augmented generation (RAG) and combined fine-tuning (FT+RAG) improve large language model (LLM) performance in healthcare more than fine-tuning alone. LLAMA and PHI models showed the best results with these specialized training methods.

Keywords:
fine-tuninghealthcarelarge language modelsmedicalretrieval-augmented generation

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

584
Industrialized, Artificial Intelligence-guided Laser Microdissection for Microscaled Proteomic Analysis of the Tumor Microenvironment
13:01

Industrialized, Artificial Intelligence-guided Laser Microdissection for Microscaled Proteomic Analysis of the Tumor Microenvironment

Published on: June 3, 2022

3.9K

Related Experiment Videos

Last Updated: Sep 13, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

684
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

584
Industrialized, Artificial Intelligence-guided Laser Microdissection for Microscaled Proteomic Analysis of the Tumor Microenvironment
13:01

Industrialized, Artificial Intelligence-guided Laser Microdissection for Microscaled Proteomic Analysis of the Tumor Microenvironment

Published on: June 3, 2022

3.9K

Area of Science:

  • Artificial Intelligence
  • Medical Informatics
  • Natural Language Processing

Background:

  • Large language models (LLMs) possess broad knowledge but lack specialized expertise.
  • Domain-specific LLM performance can be enhanced through targeted training methodologies.
  • Healthcare applications require specialized LLMs capable of accurate and reliable information retrieval and generation.

Purpose of the Study:

  • To evaluate the effectiveness of retrieval-augmented generation (RAG) and fine-tuning (FT) on healthcare data for five distinct large language models.
  • To compare the performance of RAG alone, FT alone, and a combined FT+RAG approach.
  • To identify which models and training strategies yield superior results for specialized healthcare applications.

Main Methods:

  • Five large language models (LLama-3.1-8B, Gemma-2-9B, Mistral-7B-Instruct, Qwen2.5-7B, Phi-3.5-Mini-Instruct) were fine-tuned on the MedQuAD dataset.
  • Models were trained using three approaches: retrieval-augmented generation (RAG) only, fine-tuning (FT) only, and a combination of both (FT+RAG).
  • Performance was assessed across multiple metrics to evaluate domain-specific accuracy and capabilities.

Main Results:

  • Retrieval-augmented generation (RAG) and combined fine-tuning (FT+RAG) consistently outperformed fine-tuning (FT) alone across most evaluated models.
  • LLAMA and PHI models demonstrated superior performance, with LLAMA showing overall excellence and PHI excelling in RAG/FT+RAG capabilities.
  • QWEN models generally lagged in performance, while GEMMA and MISTRAL exhibited varied results depending on the training approach.

Conclusions:

  • Specialized training methods, particularly RAG and FT+RAG, are crucial for enhancing LLM performance in niche domains like healthcare.
  • The choice of LLM architecture significantly impacts the effectiveness of different training strategies.
  • Further research into optimizing these training methods can unlock greater potential for AI in medical information systems.