Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
Improving Translational Accuracy02:07

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Vision Foundry: A System for Training Foundational Vision AI Models.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

A Secure Sandbox Environment for Orchestrating Medical AI Agents Using Model Context Protocols and Role-Based Access Control.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

A Framework for Cross-Domain Generalization in Coronary Artery Calcium Scoring Across Gated and Non-Gated Computed Tomography.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

Implementation and Assessment of Machine Learning Models for Forecasting Suspected Opioid Overdoses in Emergency Medical Services Data.

AMIA ... Annual Symposium proceedings. AMIA Symposium·2026
Same author

A histopathology aware DINO model with attention based representation enhancement.

Scientific reports·2025
Same author

Toward Automated Clinical Transcriptions.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2025
Same journal

LabSage: Structural-Semantic Decoupling for Enhanced Retrieval-Augmented Generation in Clinical Laboratories.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same journal

Evaluating Representation Embeddings from LLMs and Time-Series Foundation Models for Wearable Accelerometer-Based Health Prediction.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same journal

ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same journal

Mapping the Storm: Linking Tornado Paths to Emergency Room Surges Through Geocoded Patient Data.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same journal

Multi-Modal Deep Learning-Based Model to Predict Burkitt Lymphoma Recurrence.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same journal

A Multi-Model LLM Consensus Framework to Identify EHR-Predictable Eligibility Criteria in NSCLC Immunotherapy Trials.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
See all related articles

Related Experiment Video

Updated: Jun 20, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Logit Fingerprinting: A Novel, Accuracy-Independent Method for Validating Large Language Model Stability in

W Vaiden Logan1, V K Cody Bumgardner1

  • 1Center For Applied Artificial Intelligence, University of Kentucky, Lexington, KY.

AMIA Joint Summits on Translational Science Proceedings. AMIA Joint Summits on Translational Science
|June 19, 2026
PubMed
Summary
This summary is machine-generated.

New quality assurance methods are needed for clinical Large Language Models (LLMs). A "behavioral fingerprint" using a Single-Token Forced-Choice Logit Probe detects instability from model compression, crucial for safe deployment.

Related Experiment Videos

Last Updated: Jun 20, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Area of Science:

  • Artificial Intelligence
  • Clinical Informatics
  • Medical AI

Background:

  • Large Language Models (LLMs) integration into clinical settings necessitates robust quality assurance.
  • Standard accuracy metrics are insufficient for detecting behavioral volatility caused by model compression (quantization, distillation) and sparse architectures.

Purpose of the Study:

  • To introduce a novel method, the "Single-Token Forced-Choice Logit Probe," for assessing LLM behavioral stability.
  • To generate a "behavioral fingerprint" to detect hidden effects of model compression and architectural instability.

Main Methods:

  • Developed and validated the "Single-Token Forced-Choice Logit Probe" on 11 local model families using a domain-specific (MedQA) benchmark.
  • Conducted a longitudinal audit of commercial APIs to assess decision-making stability.
  • Utilized forensic classification to identify compression techniques (e.g., Q8 vs. FP8) and analyze non-determinism sources (e.g., Sparse Mixture-of-Experts routing).

Main Results:

  • The proposed method achieved 100% accuracy in distinguishing full-precision models from quantized variants.
  • A significant "Stability Gap" was observed in commercial APIs, with distilled "Nano" models showing nearly double the decision instability (2.82% Flip Rate) compared to standard models (1.58% Flip Rate).
  • Identified Sparse Mixture-of-Experts routing as a likely source of non-determinism in distilled models.

Conclusions:

  • The "Flip Rate" is identified as a critical safety metric for evaluating LLMs in clinical contexts.
  • Distilled and quantized LLMs require rigorous stability auditing before clinical deployment to ensure patient safety and reliable performance.