Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Background and Environment Affect Phenotype02:27

Background and Environment Affect Phenotype

6.4K
Although the genetic makeup of an organism plays a major role in determining the phenotype, there are also several environmental factors, such as temperature, oxygen availability, presence of mutagens, that can alter an organism’s phenotype.
An example of how genetic background affects phenotype can be seen in horses. The Extension gene in horses is responsible for their coat color. A wild-type gene (EE) produces black pigment in the coat, while a mutant gene (ee) produces red pigment. A...
6.4K
Improving Translational Accuracy02:07

Improving Translational Accuracy

2.5K
2.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Restoration of Limb Alignment in Post-Traumatic Genu Varum Case Using Medial Open Wedge Proximal Tibial Osteotomy with Posterior Tibial Slope Correction: A Case Report.

Journal of orthopaedic case reports·2026
Same author

Please Tell Me It Is Only a Nightmare-The Proposed Dismantling of the United States Federal Research Infrastructure.

Diabetes care·2026
Same author

Building an Interoperable Rare Disease Multi-omic Resource: The GREGoR Data Model and Dataset.

bioRxiv : the preprint server for biology·2026
Same author

Author Correction: Biallelic variants in the noncoding RNA gene RNU4-2 cause a recessive neurodevelopmental syndrome with distinct white matter changes.

Nature genetics·2026
Same author

An integrated cardiometabolic genetic testing program in a predominantly Hispanic population within a community setting.

Genetics in medicine : official journal of the American College of Medical Genetics·2026
Same author

Rhizosphere microbial shifts drive amygdalin detoxification and jasmonate-mediated alleviation of peach autotoxicity.

The ISME journal·2026
Same journal

Predicting Chemotherapy Response from Staging Laparoscopy Images.

medRxiv : the preprint server for health sciences·2026
Same journal

Development and External Validation of a Machine Learning Model for 10-Year Ischemic Stroke Risk Prediction in Diverse Populations.

medRxiv : the preprint server for health sciences·2026
Same journal

MCH-Guard: Multimodal Machine Learning Framework for Risk Stratification of Cerebral Microhemorrhage Risk in the Alzheimer's Disease Neuroimaging Initiative.

medRxiv : the preprint server for health sciences·2026
Same journal

Genetic and maternal environmental contributions to estimated fetal weight at 20 weeks gestation compared with birthweight.

medRxiv : the preprint server for health sciences·2026
Same journal

Better immediate declarative memory is associated with forgetting during locomotor adaptation in chronic stroke and in older adults.

medRxiv : the preprint server for health sciences·2026
Same journal

An empirical Bayes framework for burden and dispersion association tests helps prioritize rare variants associated with Alzheimer's disease.

medRxiv : the preprint server for health sciences·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

500

Improving Automated Deep Phenotyping Through Large Language Models Using Retrieval Augmented Generation.

Brandon T Garcia1,2,3, Lauren Westerfield1,4, Priya Yelemali1

  • 1Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.

Medrxiv : the Preprint Server for Health Sciences
|December 16, 2024
PubMed
Summary
This summary is machine-generated.

RAG-HPO improves rare genetic disorder diagnosis by accurately assigning Human Phenotype Ontology (HPO) terms using retrieval-augmented generation, surpassing existing tools.

Keywords:
Clinical GenomicsGenerative AIGenerative Pre-trained Transformer (GPT)Human Phenotype Ontology (HPO)Large language models (LLMs)Llama-3Natural Language Processing (NLP)PhenotypingRetrieval augmented generation (RAG)

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

637
Construction of an Improved Multi-Tetrode Hyperdrive for Large-Scale Neural Recording in Behaving Rats
10:04

Construction of an Improved Multi-Tetrode Hyperdrive for Large-Scale Neural Recording in Behaving Rats

Published on: May 9, 2018

11.3K

Related Experiment Videos

Last Updated: Jun 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

500
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

637
Construction of an Improved Multi-Tetrode Hyperdrive for Large-Scale Neural Recording in Behaving Rats
10:04

Construction of an Improved Multi-Tetrode Hyperdrive for Large-Scale Neural Recording in Behaving Rats

Published on: May 9, 2018

11.3K

Area of Science:

  • Computational biology and bioinformatics
  • Genomics and genetic disorder research
  • Natural Language Processing (NLP) in clinical settings

Background:

  • Accurate diagnosis of rare genetic disorders requires precise phenotypic and genotypic analysis.
  • Human Phenotype Ontology (HPO) provides a standardized language for clinical phenotypes.
  • Existing HPO tools (Doc2HPO, ClinPhen) struggle with incomplete assignments and require manual review; LLMs are prone to hallucinations.

Purpose of the Study:

  • To present RAG-HPO, a novel Python-based tool leveraging Retrieval-Augmented Generation (RAG) for accurate HPO term assignment.
  • To enhance LLM accuracy in HPO term extraction without fine-tuning, addressing limitations of current methods.

Main Methods:

  • RAG-HPO utilizes a dynamic vector database containing over 54,000 phenotypic phrases mapped to HPO IDs.
  • The workflow involves LLM extraction of phenotypic phrases, semantic similarity matching against the vector database, and LLM-based HPO term assignment.
  • Performance was benchmarked against Doc2HPO, ClinPhen, and FastHPOCR using 120 case reports with 1,792 manually assigned HPO terms.

Main Results:

  • RAG-HPO, powered by Llama-3 70B, achieved a mean precision of 0.84, recall of 0.78, and F1 score of 0.80 on 120 case reports.
  • These results significantly surpassed conventional tools (p<0.00001).
  • False positive HPO term identification was low (15.8%), with minimal hallucinations (2.7%).

Conclusions:

  • RAG-HPO is a user-friendly, adaptable tool that significantly outperforms standard HPO-matching tools.
  • Its enhanced precision and recall accelerate the identification of genetic mechanisms underlying rare diseases.
  • RAG-HPO represents a substantial advancement in phenotypic analysis for genetic research and clinical genomics.