Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

9.4K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
9.4K
Genetic Lingo01:11

Genetic Lingo

102.0K
Overview
102.0K
Pharmacokinetic Models: Overview01:20

Pharmacokinetic Models: Overview

609
Pharmacokinetic models utilize mathematical analysis to achieve a detailed quantitative understanding of a drug's life cycle within the body. They are instrumental in simulating a drug's pharmacokinetic parameters, predicting drug concentrations over time, optimizing dosage regimens, linking concentrations with pharmacologic activity, and estimating potential toxicity.
There are three primary types of models: empirical, compartment, and physiological. Empirical models, with minimal...
609
Synthetic Biology02:55

Synthetic Biology

4.7K
Synthetic biology is an interdisciplinary science that involves using principles from disciplines such as engineering, molecular biology, cell biology, and systems biology. It involves remodeling existing organisms from nature or constructing completely new synthetic organisms for applications such as protein or enzyme production, bioremediation, value-added macromolecule production, and the addition of desirable traits to crops, to name a few.
Golden rice
Golden rice is a genetically modified...
4.7K
Genomics02:02

Genomics

36.2K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
36.2K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.0K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.0K
  1. Home
  2. Research Domains
  3. Information And Computing Sciences
  4. Machine Learning
  5. Context Learning
  6. Biomedical Knowledge Graph-optimized Prompt Generation For Large Language Models

Biomedical knowledge graph-optimized prompt generation for large language models

Karthik Soman1, Peter W Rose2, John H Morris3

  • 1Department of Neurology, Weill Institute for Neurosciences, University of California, San Francisco, San Francisco, CA 94158, United States.

Bioinformatics (Oxford, England)
|September 17, 2024

Related Experiment Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

516
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

655

View abstract on PubMed

Summary
This summary is machine-generated.

This study introduces a token-optimized Knowledge Graph-based Retrieval Augmented Generation (KG-RAG) framework. KG-RAG significantly improves large language model performance on biomedical tasks with reduced token consumption and enhanced accuracy.

Area of Science:

  • Biomedical Informatics
  • Artificial Intelligence

Background:

  • Large language models (LLMs) face challenges in knowledge-intensive domains like biomedicine.
  • Current solutions like pretraining and fine-tuning incur high computational costs and require domain expertise.

Purpose of the Study:

  • To introduce a token-optimized Knowledge Graph-based Retrieval Augmented Generation (KG-RAG) framework.
  • To leverage the SPOKE biomedical knowledge graph (KG) with LLMs for generating accurate biomedical text.

Main Methods:

  • Developed a KG-RAG framework utilizing the SPOKE KG and LLMs (Llama-2, GPT-3.5, GPT-4).
  • Implemented context extraction with minimal graph schema and context pruning via embedding methods.
  • Optimized token consumption for cost-effective RAG on proprietary LLMs.

Related Experiment Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

516
A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

1.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

655

Main Results:

  • Achieved over 50% reduction in token consumption without compromising accuracy.
  • Enhanced LLM performance across diverse biomedical prompts, providing provenance and statistical evidence.
  • Boosted Llama-2 model performance by 71% on a challenging multiple-choice question dataset.

Conclusions:

  • The KG-RAG framework effectively combines explicit KG knowledge and implicit LLM knowledge in a token-optimized manner.
  • This approach enhances the adaptability of general-purpose LLMs for domain-specific biomedical questions cost-effectively.
  • The framework empowers open-source models for specialized tasks and improves proprietary LLM performance.