Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Language Development01:22

Language Development

318
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
318
Language and Cognition01:27

Language and Cognition

325
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
325
Higher Mental Functions of the Brain: Language01:10

Higher Mental Functions of the Brain: Language

744
Language is a system of communication that allows the expression of thoughts, ideas, and feelings. The brain processes language in both hemispheres.
Language formation and comprehension take place in the dominant hemisphere. The dominant hemisphere is responsible for understanding the meaning of spoken, written, or sign language, as well as the ability to communicate. For most people, the left hemisphere is the dominant one. The right hemisphere, then, gives tone and emotional context to the...
744
Typical Model Studies01:30

Typical Model Studies

340
Fluid mechanics model studies often utilize scaled-down systems to predict fluid behavior in full-scale environments, such as river flows, dam spillways, and structures interacting with open surfaces. Maintaining Froude number similarity in river models is crucial, as it replicates surface flow features like wave patterns and velocities.
340
Modeling and Similitude01:12

Modeling and Similitude

245
Scaled modeling is a fundamental technique in engineering, enabling the study of large and complex systems by creating smaller, manageable replicas that recreate critical characteristics of the original. In hydrology and civil infrastructure, for example, scaled models of dams help analyze water flow, turbulence, and pressure. This method allows for accurate predictions of real-world behavior within a controlled environment, significantly reducing the cost and time involved in full-scale...
245
Stereotype Content Model02:16

Stereotype Content Model

14.0K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
14.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Top-down effect of dialogue coherence on perceived speaker identity.

Scientific reportsĀ·2023
Same author

Predictive coding across the left fronto-temporal hierarchy during language comprehension.

Cerebral cortex (New York, N.Y. : 1991)Ā·2022
Same author

Cognitive cascades: How to model (and potentially counter) the spread of fake news.

PloS oneĀ·2022
Same author

Evaluating Models of Gesture and Speech Production for People With Aphasia.

Cognitive scienceĀ·2020
Same author

Going the Extra Mile: Effects of Discourse Context on Two Late Positivities During Language Comprehension.

Neurobiology of language (Cambridge, Mass.)Ā·2020
Same author

Neural Evidence for the Prediction of Animacy Features during Language Comprehension: Evidence from MEG and EEG Representational Similarity Analysis.

The Journal of neuroscience : the official journal of the Society for NeuroscienceĀ·2020
Same journal

Pronoun Resolution in Turkish: The Interplay of Referential Form, Word Order, and Implicit Causality.

Cognitive scienceĀ·2026
Same journal

What's in a Color?: Language, Synesthesia, and Categorical Perception.

Cognitive scienceĀ·2026
Same journal

Reasoning Beyond Explicit Rules: Adults' and Children's Use of Closure Principles in Novel Cases.

Cognitive scienceĀ·2026
Same journal

Intermediary Object States Are Activated by Sentences Describing Completed Events.

Cognitive scienceĀ·2026
Same journal

Large Language Models Estimate Fine-Grained Human Color-Concept Associations.

Cognitive scienceĀ·2026
Same journal

Computational Models of Causal Reasoning: Bayesian Accounts of Normative Violations.

Cognitive scienceĀ·2026
See all related articles

Related Experiment Video

Updated: Jun 6, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.4K

Can Language Models Trained on Written Monologue Learn to Predict Spoken Dialogue?

Muhammad Umair1, Julia B Mertens2, Lena Warnke2

  • 1Department of Computer Science, Tufts University.

Cognitive Science
|November 26, 2024
PubMed
Summary
This summary is machine-generated.

Large Language Models (LLMs) can generate human-like text but struggle to predict spoken language influenced by speaker identity. These models capture linguistic patterns but not the nuances of natural human conversation.

Keywords:
Generative pretrained transformersLanguage in interactionNatural language processing

More Related Videos

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

403
Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

9.9K

Related Experiment Videos

Last Updated: Jun 6, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.4K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

403
Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language
09:27

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

9.9K

Area of Science:

  • Computational Linguistics
  • Cognitive Science
  • Artificial Intelligence

Background:

  • Transformer-based Large Language Models (LLMs) demonstrate significant capabilities in text generation.
  • The ability of LLMs to model spoken language, particularly in interactive contexts, remains underexplored.
  • Spoken and written language exhibit distinct characteristics in syntax, pragmatics, and conversational norms.

Purpose of the Study:

  • To evaluate Large Language Models (LLMs) as predictive models of spoken dialogue.
  • To investigate whether LLMs can learn that speaker identity influences utterance predictability in conversation.
  • To compare LLM predictions with human behavioral data in natural spoken interactions.

Main Methods:

  • Fine-tuning two variants of GPT-2 on English spoken dialogue transcripts.
  • Calculating word surprisal values for two-turn conversational sequences.
  • Comparing model-generated surprisal data with human behavioral responses regarding speaker identity.

Main Results:

  • Fine-tuned LLMs showed that word predictability is influenced by speaker identity.
  • However, the models did not replicate the way humans utilize speaker identity information in predicting spoken language.
  • LLM performance indicated a divergence from human conversational behavior.

Conclusions:

  • LLMs can learn normative linguistic structures from spoken dialogue data.
  • Current LLMs do not fully capture the pragmatic and social factors, such as speaker identity, that shape natural human conversation.
  • Further research is needed to enhance LLMs' ability to model the complexities of interactive spoken language.