Search research articles

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Implementing trust in non-small cell lung cancer diagnosis with a conformalized uncertainty-aware AI framework.

Nature biomedical engineering·2026

Same author

Generating synthetic multi-national longitudinal cohorts for clinically grounded HIV research.

Nature communications·2026

Same author

A real-world feasibility evaluation of LLM-based clinical prediction: emergency department return visit admission across two academic medical centers.

Research square·2026

Same author

Evaluating Large Language Models for Translating Multimodal Phenotype Documentations into Executable EHR Phenotyping Algorithms.

Research square·2026

Same author

Integrating genetically predicted transcriptomic signatures with longitudinal real-world data enables scalable drug repurposing for Alzheimer's disease.

Research square·2026

Same author

Making LLM Predictions Interpretable: Fine-Tuning GPT-4o for Early Discontinuation of Cancer Medication.

Studies in health technology and informatics·2026

Same journal

Comparative Evaluation of Pretrained Large Language Models for Suicide Risk Prediction from Clinical Notes in U.S. Veterans.

medRxiv : the preprint server for health sciences·2026

Same journal

Nocturnal Respiratory Rate and Variability Predict Long-term Mortality in Stable Outpatients with Cardiovascular Disease.

medRxiv : the preprint server for health sciences·2026

Same journal

MOSAIC: Methylation-Oriented Site Analysis and Information Classifier for Robust Epigenomic Classification of Acute Leukemia in Clinical Cohorts with Variable Tumor Purity.

medRxiv : the preprint server for health sciences·2026

Same journal

Risk beliefs, intensive digital information and demand for a new preventative health product in public clinics: Evidence from an experiment in Zimbabwe.

medRxiv : the preprint server for health sciences·2026

Same journal

Development of an automated, imaging-based preoperative screening model for early identification of malnutrition in an abdominal surgery cohort.

medRxiv : the preprint server for health sciences·2026

Same journal

A Pilot Project Leveraging Large Language Models for Automated Screening and Variable Extraction in Observational Studies.

medRxiv : the preprint server for health sciences·2026

See all related articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Video

Updated: Jun 5, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evaluating Large Language Models for Translating Multimodal Phenotype Documentations into Executable EHR Phenotyping

Chao Yan¹, Yi Xin², Wu-Chen Su¹

¹Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.

Medrxiv : the Preprint Server for Health Sciences

|June 4, 2026

Summary

This summary is machine-generated.

Large language models show promise for translating clinical definitions into electronic health record (EHR) database queries. However, documentation quality, not model performance, remains a key challenge for EHR research.

More Related Videos

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Jun 5, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Health Informatics
Artificial Intelligence in Medicine
Clinical Data Management

Background:

Translating clinical definitions into executable electronic health record (EHR) database queries is crucial for research but is a labor-intensive process.
The increasing availability of EHR data necessitates efficient methods for phenotype extraction and query generation.

Purpose of the Study:

To evaluate the performance of two advanced large language models (LLMs) in generating EHR database queries from clinical definitions.
To assess the impact of different documentation modalities (e.g., structured text, diagrams) on LLM performance for EHR phenotype translation.

Main Methods:

Two state-of-the-art large language models were tested on five distinct clinical phenotypes.
The models processed three types of documentation: structured text, semi-structured text, and diagrams.
Performance was evaluated based on the accuracy and completeness of the generated EHR database queries.

Main Results:

Both evaluated LLMs demonstrated proficiency in capturing high-level logic from structured and semi-structured clinical documentation.
Model performance significantly degraded when presented with diagram-only input, indicating limitations in interpreting visual data.
Error analysis identified seven distinct categories of failures, with documentation quality emerging as the primary bottleneck.

Conclusions:

While LLMs show potential for automating EHR phenotype query generation, current capabilities are constrained by input documentation quality.
Standardization of clinical documentation and continued expert oversight are essential to overcome current limitations and improve LLM utility in EHR research.
Future research should focus on improving LLM's ability to interpret diverse documentation formats and addressing the impact of data quality.