Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Language Development01:22

Language Development

852
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
852
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.5K
3.5K
Improving Translational Accuracy02:07

Improving Translational Accuracy

14.1K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
14.1K
Language and Cognition01:27

Language and Cognition

715
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
715
Components of Language01:24

Components of Language

752
Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.
752
Higher Mental Functions of the Brain: Language01:10

Higher Mental Functions of the Brain: Language

3.5K
Language is a system of communication that allows the expression of thoughts, ideas, and feelings. The brain processes language in both hemispheres.
Language formation and comprehension take place in the dominant hemisphere. The dominant hemisphere is responsible for understanding the meaning of spoken, written, or sign language, as well as the ability to communicate. For most people, the left hemisphere is the dominant one. The right hemisphere, then, gives tone and emotional context to the...
3.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Exploring the relationship between everyday functioning and cognitive performance in individuals with late onset unexplained epilepsy.

Epilepsia·2026
Same author

Apathy and resting state functional connectivity across the Alzheimer's disease continuum.

Alzheimer's & dementia (Amsterdam, Netherlands)·2026
Same author

Plasma phosphorylated tau 217 and longitudinal trajectories of Aβ, tau, and cognition in cognitively unimpaired older adults.

Nature communications·2026
Same author

Baseline cortical amyloid-β levels are associated with subsequent study-partner-rated apathy in community-dwelling older adults.

Journal of Alzheimer's disease : JAD·2026
Same author

Combining p-tau217 and digital cognitive testing to predict cognitive decline.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2026
Same author

Clinical and Pathological Progression of Awareness Trajectories in Preclinical Alzheimer's Disease.

medRxiv : the preprint server for health sciences·2026
Same journal

BlockFedMed: A blockchain-federated learning framework for privacy-preserving mortality prediction across heterogeneous intensive care units.

International journal of medical informatics·2026
Same journal

Integrating clinical decision support systems in pediatric oncology: A scoping review of applications, implementation gaps, and management Implications.

International journal of medical informatics·2026
Same journal

Understanding digital health capability of allied health professionals - a mixed-methods study with content validity analysis.

International journal of medical informatics·2026
Same journal

On-premises open-source large language models for privacy-preserving multimodal depression screening.

International journal of medical informatics·2026
Same journal

Data mining methods, tasks, and algorithms for adverse drug reaction analysis in pharmacovigilance: A scoping review.

International journal of medical informatics·2026
Same journal

Development and validation of an interpretable machine learning model for predicting systemic inflammatory response syndrome after percutaneous nephrolithotomy: A multicenter study.

International journal of medical informatics·2026
See all related articles

Related Experiment Video

Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K

Extracting language information from clinical notes using large language models.

Lingfei Qian1, Na Hong1, Yujia Zhou1

  • 1Department of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA.

International Journal of Medical Informatics
|September 24, 2025
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) can accurately extract patient language proficiency from clinical notes, improving healthcare equity. This named entity recognition (NER) pipeline enhances communication and resource allocation.

Keywords:
Cross-sites validationLarge language modelsNamed entity recognitionPatient language information

More Related Videos

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

16.4K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.3K

Related Experiment Videos

Last Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

1.0K
A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts
07:50

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

16.4K
Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
05:47

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

1.3K

Area of Science:

  • Natural Language Processing
  • Clinical Informatics
  • Health Services Research

Background:

  • Patient language proficiency is crucial for equitable care and research.
  • Electronic health record (EHR) language data is often incomplete or inaccurate.
  • Heterogeneous documentation practices hinder multi-institutional data use.

Purpose of the Study:

  • Develop and evaluate a named entity recognition (NER) pipeline using large language models (LLMs).
  • Accurately extract detailed patient language status from unstructured clinical notes.
  • Enable scalable and generalizable language information extraction for research and practice.

Main Methods:

  • Defined four language status categories: fluent use, partial ability, lack of understanding, and unrelated mentions.
  • Annotated datasets from Yale New Haven Hospital (YNHH) and MIMIC-III.
  • Evaluated proprietary (GPT-4o) and open-source (LLaMA3, BERT) LLMs in zero-shot and fine-tuning settings.
  • Assessed cross-site generalizability.

Main Results:

  • GPT-4o achieved high F1 scores (87% YNHH, 82% MIMIC) without fine-tuning.
  • Fine-tuned open-source models (BERT, LLaMA3) showed comparable or superior performance.
  • LLMs, especially LLaMA3, demonstrated stronger cross-institutional generalizability than traditional models.
  • Unrelated language mentions were the most challenging category.

Conclusions:

  • The NER framework accurately extracts nuanced language information from clinical narratives.
  • High accuracy and generalizability support large-scale, language-focused research.
  • Implications include improved patient-provider communication, interpreter services, and equitable healthcare.