Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Vision Foundry: A System for Training Foundational Vision AI Models.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same author

A Secure Sandbox Environment for Orchestrating Medical AI Agents Using Model Context Protocols and Role-Based Access Control.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same author

A Framework for Cross-Domain Generalization in Coronary Artery Calcium Scoring Across Gated and Non-Gated Computed Tomography.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same author

Implementation and Assessment of Machine Learning Models for Forecasting Suspected Opioid Overdoses in Emergency Medical Services Data.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026

Same author

A histopathology aware DINO model with attention based representation enhancement.

Scientific reports·2025

Same author

Toward Automated Clinical Transcriptions.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2025

Same journal

LabSage: Structural-Semantic Decoupling for Enhanced Retrieval-Augmented Generation in Clinical Laboratories.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same journal

Evaluating Representation Embeddings from LLMs and Time-Series Foundation Models for Wearable Accelerometer-Based Health Prediction.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same journal

ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same journal

Mapping the Storm: Linking Tornado Paths to Emergency Room Surges Through Geocoded Patient Data.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same journal

Multi-Modal Deep Learning-Based Model to Predict Burkitt Lymphoma Recurrence.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

Same journal

A Multi-Model LLM Consensus Framework to Identify EHR-Predictable Eligibility Criteria in NSCLC Immunotherapy Trials.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 20, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Logit Fingerprinting: A Novel, Accuracy-Independent Method for Validating Large Language Model Stability in

W Vaiden Logan¹, V K Cody Bumgardner¹

¹Center For Applied Artificial Intelligence, University of Kentucky, Lexington, KY.

AMIA Joint Summits on Translational Science Proceedings. AMIA Joint Summits on Translational Science

|June 19, 2026

Summary

This summary is machine-generated.

New quality assurance methods are needed for clinical Large Language Models (LLMs). A "behavioral fingerprint" using a Single-Token Forced-Choice Logit Probe detects instability from model compression, crucial for safe deployment.

Related Experiment Videos

Last Updated: Jun 20, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Area of Science:

Artificial Intelligence
Clinical Informatics
Medical AI

Background:

Large Language Models (LLMs) integration into clinical settings necessitates robust quality assurance.
Standard accuracy metrics are insufficient for detecting behavioral volatility caused by model compression (quantization, distillation) and sparse architectures.

Purpose of the Study:

To introduce a novel method, the "Single-Token Forced-Choice Logit Probe," for assessing LLM behavioral stability.
To generate a "behavioral fingerprint" to detect hidden effects of model compression and architectural instability.

Main Methods:

Developed and validated the "Single-Token Forced-Choice Logit Probe" on 11 local model families using a domain-specific (MedQA) benchmark.
Conducted a longitudinal audit of commercial APIs to assess decision-making stability.
Utilized forensic classification to identify compression techniques (e.g., Q8 vs. FP8) and analyze non-determinism sources (e.g., Sparse Mixture-of-Experts routing).

Main Results:

The proposed method achieved 100% accuracy in distinguishing full-precision models from quantized variants.
A significant "Stability Gap" was observed in commercial APIs, with distilled "Nano" models showing nearly double the decision instability (2.82% Flip Rate) compared to standard models (1.58% Flip Rate).
Identified Sparse Mixture-of-Experts routing as a likely source of non-determinism in distilled models.

Conclusions:

The "Flip Rate" is identified as a critical safety metric for evaluating LLMs in clinical contexts.
Distilled and quantized LLMs require rigorous stability auditing before clinical deployment to ensure patient safety and reliable performance.