Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Auditory Perception

Auditory Perception

The auditory system is essential for sound perception, utilizing various critical structures. When sound waves enter the outer ear, they travel through the ear canal and cause the eardrum to vibrate. These vibrations are then transmitted to the middle ear, where three tiny bones – the malleus, incus, and stapes – amplify the sound. This amplification is crucial, as it ensures that the sound vibrations are strong enough to be conveyed to the inner ear. These vibrations then reach the cochlea, a...

Perception of Sound Waves

Perception of Sound Waves

The human ear is not equally sensitive to all frequencies in the audible range. It may perceive sound waves with the same pressure but different frequencies as having different loudness. Moreover, the perception of sound waves depends on the health of an individual's ears, which decays with age. The health of one's ears may also be affected by regular exposure to loud noises.
The pitch of a sound depends on the frequency and the pressure amplitude of the source. Two sounds of the same frequency...

Perceiving Loudness, Pitch, and Location

Perceiving Loudness, Pitch, and Location

The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by identifying...

Introducing Social Perception

Introducing Social Perception

Perceiving others accurately is fundamental to effective communication and relationship-building. Social perception, a key concept in social psychology, refers to the cognitive processes through which individuals gather and interpret information about others to understand their actions, intentions, and motivations. This process extends beyond spoken words and overt behaviors, incorporating subtle nonverbal cues and contextual factors.Nonverbal Cues and Their SignificanceNonverbal cues play a...

Facial Feedback Hypothesis

Facial Feedback Hypothesis

Charles Darwin proposed that facial expressions are an evolutionary adaptation for communication. He argued that these expressions are not influenced by culture but are universal across species. For example, a snarling expression with exposed teeth signals a threat in many animals, including humans. Darwin also suggested that displaying an emotion can intensify the feeling. Smiling, for example, could enhance one's sense of happiness. This idea laid the foundation for understanding the role of...

Perception

Perception

Perception is a fundamental psychological process that enables individuals to organize, interpret, and consciously experience sensory information. This process is crucial for understanding and interacting with the world around us. It includes both bottom-up and top-down processing, each playing a distinct role in how we perceive our environment.
Bottom-up processing begins at the sensory level, where receptors detect external environmental stimuli. These could include the tactile sensation of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A speech prediction model based on codec modeling and transformer decoding.

Computer speech & language·2026

Same author

A Molecular Trimming Strategy for Hypoxia-Tolerant Photosensitizers With Enhanced cGAS-STING Activation.

Angewandte Chemie (International ed. in English)·2026

Same author

Towards decoupling frontend enhancement and backend recognition in monaural robust ASR.

Computer speech & language·2026

Same author

Efficacy of SWIM technology combined with direct aspiration first pass technique for large vessel occlusion in acute ischemic stroke.

American journal of translational research·2026

Same author

Manipulating RTP properties of the same organic molecule by polymorphic engineering.

Chemical communications (Cambridge, England)·2025

Same author

Confined Growth of 2D Covalent Organic Framework Nanosheets with Controlled Thickness for Osmotic Energy Conversion.

Small (Weinheim an der Bergstrasse, Germany)·2025

Same journal

Sibilant differentiation before and after tongue cancer surgery: Acoustics, kinematics and the role of sensorimotor controla).

The Journal of the Acoustical Society of America·2026

Same journal

BioNet-A: Ultrasonic echo representation network for target discrimination using active SONAR.

The Journal of the Acoustical Society of America·2026

Same journal

Empty soft-drink cans and mass-loaded rods: Analogous homework problems from acoustic and mechanical domains.

The Journal of the Acoustical Society of America·2026

Same journal

Erratum: Statistical wave field theory: Anisotropic wave fields under Neumann's boundary condition [J. Acoust. Soc. Am. 159(3), 2265-2280 (2026)].

The Journal of the Acoustical Society of America·2026

Same journal

On the modification of tip leakage noise sources by porous treatment.

The Journal of the Acoustical Society of America·2026

Same journal

An educational opportunity: Acoustics in an empty room.

The Journal of the Acoustical Society of America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 27, 2026

Using the Race Model Inequality to Quantify Behavioral Multisensory Integration Effects

Using the Race Model Inequality to Quantify Behavioral Multisensory Integration Effects

Published on: May 10, 2019

A model for multitalker speech perception.

Soundararajan Srinivasan¹, DeLiang Wang

¹Biomedical Engineering Department, The Ohio State University, Columbus, Ohio 43210, USA. srinivasan.36@osu.edu

The Journal of the Acoustical Society of America

|December 3, 2008

Summary

This summary is machine-generated.

This study presents a computational model for understanding speech in noisy environments, accounting for both energetic masking (inaudible speech) and informational masking (difficulty distinguishing speakers). The model accurately predicts human speech perception performance in multitalker situations.

More Related Videos

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis

Published on: August 9, 2024

Related Experiment Videos

Last Updated: Jun 27, 2026

Using the Race Model Inequality to Quantify Behavioral Multisensory Integration Effects

Using the Race Model Inequality to Quantify Behavioral Multisensory Integration Effects

Published on: May 10, 2019

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis

Published on: August 9, 2024

Area of Science:

Auditory neuroscience
Computational linguistics
Speech processing

Background:

Speech perception is challenging in multitalker environments due to energetic and informational masking.
Energetic masking occurs when speech signals overlap, rendering parts inaudible.
Informational masking arises from the inability to segregate competing speech streams, even when audible.

Purpose of the Study:

To present a computational model of multitalker speech perception.
To account for both energetic and informational masking effects.
To evaluate the model's agreement with human perceptual data.

Main Methods:

Modeled energetic masking using a speech recognizer treating masked time-frequency units as missing data.
Modeled informational masking via speech separation errors in target segregation.
Systematically evaluated model performance against a recent perceptual study.

Main Results:

The computational model demonstrated broad agreement with human perceptual study results.
The model successfully simulated the effects of both energetic and informational masking.
Performance evaluation confirmed the model's predictive capabilities.

Conclusions:

The proposed computational model effectively captures key aspects of multitalker speech perception.
The model provides a framework for understanding and predicting listener performance in complex auditory scenes.
This work contributes to the fields of speech processing and auditory modeling.