Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

15.2K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
15.2K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.7K
3.7K
Variability: Analysis01:11

Variability: Analysis

547
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
547
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

277
Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
277

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Differences in Safety Risks Across Languages in Health-Relevant Queries: Vulnerability Analysis of Large Language Model Responses.

JMIR formative research·2026
Same author

Designing Psychologically Grounded Artificial Intelligence for Supporting Bystander-Based Cyberaggression Intervention: Mixed Methods Exploratory Study.

JMIR formative research·2026
Same author

Fairness aware subset selection for advancing equity in skin cancer detection.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

Automated detection of stigmatizing language in Electronic Health Records (EHRs) using a multi-stage transfer learning approach.

Journal of the American Medical Informatics Association : JAMIA·2025
Same author

Current Landscape and Future Directions for Mental Health Conversational Agents for Youth: Scoping Review.

JMIR medical informatics·2025
Same author

Language disparities in pandemic information: Autocomplete analysis of COVID-19 searches in New York.

Health informatics journal·2024
Same journal

Ambient AI Scribes and Emergency Department Documentation Burden: Retrospective Cohort Study.

JMIR AI·2026
Same journal

Supporting Radiology Resident Education and Clinical Decision-Making With Large Language Models: Comparative Study of Reasoning Models DeepSeek-R1 and ChatGPT-o1.

JMIR AI·2026
Same journal

Patient Perceptions on the Use of Artificial Intelligence in Creating Clinical Research Documents: Survey Study.

JMIR AI·2026
Same journal

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review.

JMIR AI·2026
Same journal

Correction: Deep Learning for Age Estimation and Sex Prediction Using Mandibular-Cropped Cephalometric Images: Comparative Model Development and Validation Study.

JMIR AI·2026
Same journal

AI-Assisted Systematic Literature Review of the Economic Burden of Pneumococcal Disease: Development and Validation Study.

JMIR AI·2026
See all related articles

Related Experiment Video

Updated: Feb 22, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.2K

Performance of Large Language Models Under Input Variability in Health Care Applications: Dataset Development and

Saubhagya Joshi1, Monjil Mehta2, Sarjak Maniar2

  • 1Library and Information Sciences, School of Communication & Information, Rutgers University, 4 Huntington St, New Brunswick, NJ, 08901, United States, +1 (848) 932-7500.

JMIR AI
|February 20, 2026
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) in healthcare show surprising robustness to common input errors like typos and homophones. However, redactions significantly degrade LLM performance, highlighting a need for careful design in clinical applications.

Keywords:
dataseterror analysishealth informaticslarge language modelsrobustness

More Related Videos

Asthma Detection Research Based on Voice Signal Processing and Machine Learning
04:04

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

1.1K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.7K

Related Experiment Videos

Last Updated: Feb 22, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.2K
Asthma Detection Research Based on Voice Signal Processing and Machine Learning
04:04

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

1.1K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.7K

Area of Science:

  • Artificial Intelligence in Healthcare
  • Natural Language Processing
  • Clinical Informatics

Background:

  • Large language models (LLMs) are increasingly used in healthcare for patient care and decision-making.
  • The reliability of LLMs with imperfect clinical data is not well understood.
  • Data imperfections are common in clinical documentation and patient-generated information.

Purpose of the Study:

  • Investigate the impact of input perturbations on LLM performance in health applications.
  • Compare the effects of different perturbation types and levels.
  • Analyze differential impacts on health-related versus non-health-related terms.

Main Methods:

  • Systematic evaluation of 3 LLMs across 3 health-related tasks.
  • Utilized a novel dataset with human-like variations: redactions, homophones, and typographical errors.
  • Assessed performance at various perturbation levels.

Main Results:

  • LLMs demonstrated notable robustness to common input variations; performance was stable or improved in over 55% of cases.
  • Lower perturbation levels sometimes led to increased performance (14.07%).
  • Redactions proved more detrimental to LLM performance than other variations.

Conclusions:

  • Healthcare applications using LLMs must account for input variability and data quality.
  • Robustness to imperfect inputs is crucial for LLM reliability in clinical settings.
  • Findings offer insights for developing resilient AI tools and improving LLM performance in healthcare.