Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

2.7K
2.7K
Higher Mental Functions of the Brain: Language01:10

Higher Mental Functions of the Brain: Language

1.0K
Language is a system of communication that allows the expression of thoughts, ideas, and feelings. The brain processes language in both hemispheres.
Language formation and comprehension take place in the dominant hemisphere. The dominant hemisphere is responsible for understanding the meaning of spoken, written, or sign language, as well as the ability to communicate. For most people, the left hemisphere is the dominant one. The right hemisphere, then, gives tone and emotional context to the...
1.0K
ER Retrieval Pathway01:45

ER Retrieval Pathway

3.9K
In the secretory pathway, vesicles transport proteins from one cellular compartment to another in forward transport to deliver the protein to its correct location. Occasionally, misfolded proteins and incorrect proteins escape their original compartments, and a retrieval pathway is used to return the escaped proteins to their original compartment.
The ER uses many checkpoints to prevent the entry of incorrectly folded or a resident protein as cargo onto a transport vesicle. These mechanisms...
3.9K
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

126
Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
126
Super-resolution Fluorescence Microscopy01:37

Super-resolution Fluorescence Microscopy

7.7K
Super-resolution fluorescence microscopy (SRFM) provides a better resolution than conventional fluorescence microscopy by reducing the point spread function (PSF). PSF is the light intensity distribution from a point that causes it to appear blurred. Due to PSF, each fluorescing point appears bigger than its actual size, and it is the PSF interference of nearby fluorophores that causes the blurred image. Various approaches to achieving higher resolution through SRFM have recently been...
7.7K
Pharmacokinetic Models: Comparison and Selection Criterion01:26

Pharmacokinetic Models: Comparison and Selection Criterion

149
Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.
149

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Desiderata for a biomedical knowledge network: opportunities, challenges and future directions.

Bioinformatics advances·2026
Same author

Federated knowledge retrieval elevates large language model performance on biomedical benchmarks.

GigaScience·2026
Same author

A case-based explainable graph neural network framework for mechanistic drug repositioning.

Bioinformatics (Oxford, England)·2026
Same author

The NIAID Discovery Portal: a unified search engine for infectious and immune-mediated disease datasets.

mSystems·2025
Same author

Announcing the Biomedical Data Translator: Initial Public Release.

Clinical and translational science·2025
Same author

Drug Repurposing using consilience of Knowledge Graph Completion methods.

bioRxiv : the preprint server for biology·2024
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Sep 12, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

579

Federated Knowledge Retrieval Elevates Large Language Model Performance on Biomedical Benchmarks.

Janet Joy1, Andrew I Su1

  • 1Department of Integrative Structural and Computational Biology, Scripps Research, La Jolla, CA, USA.

Biorxiv : the Preprint Server for Biology
|August 6, 2025
PubMed
Summary
This summary is machine-generated.

Retrieval-augmented generation using BioThings Explorer (BTE-RAG) significantly improves large language model accuracy in biomedical research by integrating explicit evidence, enhancing mechanistic exploration and translational applications.

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

682
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.7K

Related Experiment Videos

Last Updated: Sep 12, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

579
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

682
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.7K

Area of Science:

  • Biomedical Informatics
  • Artificial Intelligence in Medicine
  • Computational Biology

Background:

  • Large language models (LLMs) show promise in biomedical natural language processing.
  • LLMs often produce factual inaccuracies (hallucinations) due to reliance on statistical patterns.
  • These inaccuracies pose risks in critical biomedical applications.

Purpose of the Study:

  • To develop and evaluate a novel retrieval-augmented generation framework, BTE-RAG.
  • To enhance the accuracy and reliability of LLMs in biomedical research.
  • To leverage explicit mechanistic evidence for improved LLM performance.

Main Methods:

  • Developed BTE-RAG, integrating LLMs with BioThings Explorer's API federation.
  • Created three benchmark datasets from DrugMechDB for gene-centric mechanisms, metabolite effects, and drug-biological process relationships.
  • Systematically evaluated BTE-RAG against LLM-only approaches using GPT-4o and GPT-4o mini.

Main Results:

  • BTE-RAG improved gene-centric accuracy from 51% to 75.8% (GPT-4o mini) and 69.8% to 78.6% (GPT-4o).
  • Metabolite effect question similarity scores (>=0.90) increased by 82% (GPT-4o mini) and 77% (GPT-4o).
  • Drug-biological process concordance improved, with a >10% increase in high-agreement answers for GPT-4o.

Conclusions:

  • Federated knowledge retrieval enhances LLM accuracy in biomedical contexts.
  • BTE-RAG offers transparent improvements for mechanistic exploration.
  • BTE-RAG is a practical tool for translational biomedical research.