Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Health Literacy01:21

Health Literacy

Health literacy is an individual's or a community's capacity to comprehend, receive, read, and use relevant healthcare information and services. The World Health Organization (WHO, 2018) defines health literacy as the cognitive and social skills that determine the ability of individuals to gain access to, understand, and use information in ways that promote and maintain good health. As a result, the WHO helps individuals manage long-term health concerns, participate in preventative programs,...
Levels of Health Promotion and Illness Prevention01:26

Levels of Health Promotion and Illness Prevention

Health promotion allows a person to control the determinants of health, resulting in an improved health status. It enhances the quality of life and reduces premature deaths. Health promotion and illness prevention programs help people make beneficial choices to reduce the risk of disease and disabilities. There are three health promotion and illness prevention levels: primary, secondary, and tertiary prevention.
In primary prevention, actions taken before disease onset prevent the disease from...
Health Information Technology and Healthcare Information System01:30

Health Information Technology and Healthcare Information System

Health Information Technology (HIT)
Health Information Technology, commonly called HIT, integrates advanced information systems and technology in healthcare settings. Its primary functions include:
The Availability Heuristic01:08

The Availability Heuristic

A heuristic is a general problem-solving framework (Tversky & Kahneman, 1974). You can think of these as mental shortcuts that are used to solve problems. Different types of heuristics are used in different types of situations, and the impulse to use a heuristic occurs when one of five conditions is met (Pratkanis, 1989):
Healthcare Agencies II01:17

Healthcare Agencies II

There are various healthcare agencies in the United States—some of which are managed by religious institutions and others by different government branches.
Parish nursing is a growing specialty nursing profession that focuses on holistic healthcare, health promotion, and illness prevention. It blends professional nursing practice with a health ministry, focusing on health and healing within the context of a Christian community. Parish nurses serve as health educators, referral sources, and lay...
Models of Health Promotion and Illness Prevention II01:18

Models of Health Promotion and Illness Prevention II

The person's health status fluctuates continually, varying from being in good health to becoming ill and returning to being healthy. To understand the concept of illness prevention, there are two models. First, the health-illness continuum model is a graphic representation of an individual's wellness. It states that a person is considered healthy in the absence of physical disease and the presence of good emotional health.
The agent-host-environment model states that disease results from...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Epidemiology and risk factors of stroke in the Americas: a comprehensive narrative literature review.

Lancet regional health. Americas·2026
Same author

Implementation of stroke prevention: a review of challenges and opportunities in the Americas.

Lancet regional health. Americas·2026
Same author

Impact of Imaging Protocols on Thermal Detection of Pressure Injuries: Threshold versus Deep Learning Across Skin Tones.

medRxiv : the preprint server for health sciences·2026
Same author

Correction: Impact of skin tone, environmental, and technical factors on thermal imaging.

PloS one·2026
Same author

Association of life-course stressful life events with later-life intrinsic capacity: A multi-cohort study.

The journal of nutrition, health & aging·2026
Same author

GDA-AM: ON THE EFFECTIVENESS OF SOLVING MIN-IMAX OPTIMIZATION VIA ANDERSON MIXING.

... International Conference on Learning Representations·2026
Same journal

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift.

Proceedings of machine learning research·2026
Same journal

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video.

Proceedings of machine learning research·2026
Same journal

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026
Same journal

Classifying Phonotrauma Severity from Vocal Fold Images with Soft Ordinal Regression.

Proceedings of machine learning research·2026
Same journal

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential.

Proceedings of machine learning research·2026
Same journal

Fast Calculation of Feature Contributions in Boosting Trees.

Proceedings of machine learning research·2026
See all related articles

Related Experiment Video

Updated: Jun 13, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Chase M Fensore1, Rodrigo M Carrillo-Larco2, Megha K Shah3

  • 1Department of Computer Science, Emory University.

Proceedings of Machine Learning Research
|June 12, 2026
PubMed
Summary
This summary is machine-generated.

Vanilla large language models (LLMs) surprisingly outperformed retrieval-augmented generation (RAG) for answering consumer health questions. Current RAG methods face challenges in medical question-answering, requiring more advanced approaches for effective implementation.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Jun 13, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

  • Artificial Intelligence
  • Medical Informatics
  • Natural Language Processing

Background:

  • Large language models (LLMs) demonstrate strong performance on medical tasks.
  • The efficacy of retrieval-augmented generation (RAG) for consumer health questions remains unclear.
  • Evaluating RAG's impact on LLM-based medical question-answering is crucial.

Purpose of the Study:

  • To systematically compare vanilla LLMs with RAG-enhanced LLMs for consumer health question-answering.
  • To assess performance using automated metrics, LLM-based evaluation, and clinical validation.
  • To identify challenges and requirements for effective RAG implementation in medical AI.

Main Methods:

  • Four open-source LLMs were evaluated in both vanilla and RAG configurations.
  • The NIDDK portion of the MedQuAD dataset was utilized for systematic evaluation.
  • Performance was measured using quantitative metrics (BLEU, ROUGE, BERTScore), LLM-based assessment, and clinical validation.

Main Results:

  • Vanilla LLM approaches consistently outperformed RAG variants across all evaluation metrics.
  • Low retrieval performance (Precision@5 = 0.15) indicates significant challenges in current RAG systems for medical queries.
  • RAG showed competitive results only in specific domains like scientific consensus and harm reduction.

Conclusions:

  • Simple retrieval and prompt engineering are insufficient for effective RAG in consumer health question-answering.
  • Developing medical-specific RAG infrastructure is necessary to improve medical AI systems.
  • Further research into sophisticated RAG approaches is required for reliable medical question-answering.