Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Differences in Safety Risks Across Languages in Health-Relevant Queries: Vulnerability Analysis of Large Language Model Responses.

JMIR formative research·2026

Same author

Designing Psychologically Grounded Artificial Intelligence for Supporting Bystander-Based Cyberaggression Intervention: Mixed Methods Exploratory Study.

JMIR formative research·2026

Same author

Fairness aware subset selection for advancing equity in skin cancer detection.

Journal of the American Medical Informatics Association : JAMIA·2026

Same author

Automated detection of stigmatizing language in Electronic Health Records (EHRs) using a multi-stage transfer learning approach.

Journal of the American Medical Informatics Association : JAMIA·2025

Same author

Current Landscape and Future Directions for Mental Health Conversational Agents for Youth: Scoping Review.

JMIR medical informatics·2025

Same author

Language disparities in pandemic information: Autocomplete analysis of COVID-19 searches in New York.

Health informatics journal·2024

Same journal

Ambient AI Scribes and Emergency Department Documentation Burden: Retrospective Cohort Study.

JMIR AI·2026

Same journal

Supporting Radiology Resident Education and Clinical Decision-Making With Large Language Models: Comparative Study of Reasoning Models DeepSeek-R1 and ChatGPT-o1.

JMIR AI·2026

Same journal

Patient Perceptions on the Use of Artificial Intelligence in Creating Clinical Research Documents: Survey Study.

JMIR AI·2026

Same journal

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review.

JMIR AI·2026

Same journal

Correction: Deep Learning for Age Estimation and Sex Prediction Using Mandibular-Cropped Cephalometric Images: Comparative Model Development and Validation Study.

JMIR AI·2026

Same journal

AI-Assisted Systematic Literature Review of the Economic Burden of Pneumococcal Disease: Development and Validation Study.

JMIR AI·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 22, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Performance of Large Language Models Under Input Variability in Health Care Applications: Dataset Development and

Saubhagya Joshi¹, Monjil Mehta², Sarjak Maniar²

¹Library and Information Sciences, School of Communication & Information, Rutgers University, 4 Huntington St, New Brunswick, NJ, 08901, United States, +1 (848) 932-7500.

|February 20, 2026

Summary

This summary is machine-generated.

Large language models (LLMs) in healthcare show surprising robustness to common input errors like typos and homophones. However, redactions significantly degrade LLM performance, highlighting a need for careful design in clinical applications.

Keywords:

dataset error analysis health informatics large language models robustness

More Related Videos

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Feb 22, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Artificial Intelligence in Healthcare
Natural Language Processing
Clinical Informatics

Background:

Large language models (LLMs) are increasingly used in healthcare for patient care and decision-making.
The reliability of LLMs with imperfect clinical data is not well understood.
Data imperfections are common in clinical documentation and patient-generated information.

Purpose of the Study:

Investigate the impact of input perturbations on LLM performance in health applications.
Compare the effects of different perturbation types and levels.
Analyze differential impacts on health-related versus non-health-related terms.

Main Methods:

Systematic evaluation of 3 LLMs across 3 health-related tasks.
Utilized a novel dataset with human-like variations: redactions, homophones, and typographical errors.
Assessed performance at various perturbation levels.

Main Results:

LLMs demonstrated notable robustness to common input variations; performance was stable or improved in over 55% of cases.
Lower perturbation levels sometimes led to increased performance (14.07%).
Redactions proved more detrimental to LLM performance than other variations.

Conclusions:

Healthcare applications using LLMs must account for input variability and data quality.
Robustness to imperfect inputs is crucial for LLM reliability in clinical settings.
Findings offer insights for developing resilient AI tools and improving LLM performance in healthcare.