Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Language Development

Language Development

Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...

Improving Translational Accuracy

Improving Translational Accuracy

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Language and Cognition

Language and Cognition

Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.

Components of Language

Components of Language

Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.

Higher Mental Functions of the Brain: Language

Higher Mental Functions of the Brain: Language

Language is a system of communication that allows the expression of thoughts, ideas, and feelings. The brain processes language in both hemispheres.
Language formation and comprehension take place in the dominant hemisphere. The dominant hemisphere is responsible for understanding the meaning of spoken, written, or sign language, as well as the ability to communicate. For most people, the left hemisphere is the dominant one. The right hemisphere, then, gives tone and emotional context to the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Exploring the relationship between everyday functioning and cognitive performance in individuals with late onset unexplained epilepsy.

Epilepsia·2026

Same author

Apathy and resting state functional connectivity across the Alzheimer's disease continuum.

Alzheimer's & dementia (Amsterdam, Netherlands)·2026

Same author

Plasma phosphorylated tau 217 and longitudinal trajectories of Aβ, tau, and cognition in cognitively unimpaired older adults.

Nature communications·2026

Same author

Baseline cortical amyloid-β levels are associated with subsequent study-partner-rated apathy in community-dwelling older adults.

Journal of Alzheimer's disease : JAD·2026

Same author

Combining p-tau217 and digital cognitive testing to predict cognitive decline.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2026

Same author

Clinical and Pathological Progression of Awareness Trajectories in Preclinical Alzheimer's Disease.

medRxiv : the preprint server for health sciences·2026

Same journal

BlockFedMed: A blockchain-federated learning framework for privacy-preserving mortality prediction across heterogeneous intensive care units.

International journal of medical informatics·2026

Same journal

Integrating clinical decision support systems in pediatric oncology: A scoping review of applications, implementation gaps, and management Implications.

International journal of medical informatics·2026

Same journal

Understanding digital health capability of allied health professionals - a mixed-methods study with content validity analysis.

International journal of medical informatics·2026

Same journal

On-premises open-source large language models for privacy-preserving multimodal depression screening.

International journal of medical informatics·2026

Same journal

Data mining methods, tasks, and algorithms for adverse drug reaction analysis in pharmacovigilance: A scoping review.

International journal of medical informatics·2026

Same journal

Development and validation of an interpretable machine learning model for predicting systemic inflammatory response syndrome after percutaneous nephrolithotomy: A multicenter study.

International journal of medical informatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Extracting language information from clinical notes using large language models.

Lingfei Qian¹, Na Hong¹, Yujia Zhou¹

¹Department of Biomedical Informatics and Data Science, Yale School of Medicine, Yale University, New Haven, CT, USA.

International Journal of Medical Informatics

|September 24, 2025

Summary

This summary is machine-generated.

Large language models (LLMs) can accurately extract patient language proficiency from clinical notes, improving healthcare equity. This named entity recognition (NER) pipeline enhances communication and resource allocation.

Keywords:

Cross-sites validation Large language models Named entity recognition Patient language information

More Related Videos

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Jan 17, 2026

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

A Metadata Extraction Approach for Clinical Case Reports to Enable Advanced Understanding of Biomedical Concepts

Published on: September 20, 2018

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Natural Language Processing
Clinical Informatics
Health Services Research

Background:

Patient language proficiency is crucial for equitable care and research.
Electronic health record (EHR) language data is often incomplete or inaccurate.
Heterogeneous documentation practices hinder multi-institutional data use.

Purpose of the Study:

Develop and evaluate a named entity recognition (NER) pipeline using large language models (LLMs).
Accurately extract detailed patient language status from unstructured clinical notes.
Enable scalable and generalizable language information extraction for research and practice.

Main Methods:

Defined four language status categories: fluent use, partial ability, lack of understanding, and unrelated mentions.
Annotated datasets from Yale New Haven Hospital (YNHH) and MIMIC-III.
Evaluated proprietary (GPT-4o) and open-source (LLaMA3, BERT) LLMs in zero-shot and fine-tuning settings.
Assessed cross-site generalizability.

Main Results:

GPT-4o achieved high F1 scores (87% YNHH, 82% MIMIC) without fine-tuning.
Fine-tuned open-source models (BERT, LLaMA3) showed comparable or superior performance.
LLMs, especially LLaMA3, demonstrated stronger cross-institutional generalizability than traditional models.
Unrelated language mentions were the most challenging category.

Conclusions:

The NER framework accurately extracts nuanced language information from clinical narratives.
High accuracy and generalizability support large-scale, language-focused research.
Implications include improved patient-provider communication, interpreter services, and equitable healthcare.