Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Non-Verbal Cues

Non-Verbal Cues

Non-verbal communication extends beyond gestures and facial expressions to include vocal elements known as paralanguage. Paralanguage consists of non-verbal vocal cues such as pitch, loudness, speech rate, pauses, and non-verbal vocalizations like laughter, sighs, and moans. These elements not only accompany speech but also provide critical emotional and contextual information.The Role of Paralanguage in CommunicationParalanguage adds depth to spoken language by conveying emotions and...

Difference from Background: Limit of Detection

Difference from Background: Limit of Detection

The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...

Prosopagnosia

Prosopagnosia

Prosopagnosia, also known as face blindness, is the inability to recognize faces. In severe cases, individuals with prosopagnosia may not recognize close family members, including parents and spouses, by their faces. For instance, someone with prosopagnosia might walk past their child in a crowd, only realizing their mistake upon noticing their child's distinctive backpack or favorite jacket. Prosopagnosia specifically impairs facial recognition, while the recognition of other objects or...

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The Noor Project: fair transformer transfer learning for autism spectrum disorder recognition from speech.

Frontiers in digital health·2025

Same author

Self-Supervised Video-Centralised Transformer for Video Face Clustering.

IEEE transactions on pattern analysis and machine intelligence·2023

Same author

End-to-End Video-to-Speech Synthesis Using Generative Adversarial Networks.

IEEE transactions on cybernetics·2022

Same author

FP-Age: Leveraging Face Parsing Attention for Facial Age Estimation in the Wild.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2022

Same author

Speech-Driven Facial Animations Improve Speech-in-Noise Comprehension of Humans.

Frontiers in neuroscience·2022

Same author

Fast Algorithms for Fitting Active Appearance Models to Unconstrained Images.

International journal of computer vision·2020

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 31, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Discrimination Between Native and Non-Native Speech Using Visual Features Only.

Christos Georgakis, Stavros Petridis, Maja Pantic

IEEE Transactions on Cybernetics

|October 30, 2015

Summary

This summary is machine-generated.

This study shows that visual cues from speech can identify non-native English speakers, even without audio. Appearance features, using hidden Markov models, achieved 76.5% accuracy in distinguishing accents.

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

Related Experiment Videos

Last Updated: Mar 31, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Using Eye Movements Recorded in the Visual World Paradigm to Explore the Online Processing of Spoken Language

Published on: October 13, 2018

Area of Science:

Biometrics
Speech Processing
Computer Vision

Background:

Accent classification traditionally relies on audio features.
Visual speech analysis offers a complementary or alternative approach.
Temporal visual speech dynamics can reveal accent characteristics.

Purpose of the Study:

To investigate the effectiveness of visual speech dynamics for accent identification without audio.
To develop an automated system for discriminating native from non-native English speech using only visual cues.
To evaluate the performance of different visual features for this task.

Main Methods:

Developed a fully automated approach using exclusively visual speech information.
Systematically evaluated appearance and shape features.
Employed fusion of five hidden Markov models trained on appearance features.
Conducted subject-independent cross-validation on the MOBIO database.

Main Results:

Appearance features consistently outperformed shape features.
Achieved a high performance of 76.5% accuracy on a text-dependent protocol.
The framework demonstrated efficiency on unseen speech examples, though with reduced accuracy.

Conclusions:

Temporal visual speech dynamics are valuable for accent classification, especially in audio-degraded conditions.
Appearance-based features combined with hidden Markov models provide a robust method for accent identification.
Visual-only accent recognition shows promise for applications where audio is unavailable or unreliable.