Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Perceiving Loudness, Pitch, and Location

Perceiving Loudness, Pitch, and Location

The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

BioNet-A: Ultrasonic echo representation network for target discrimination using active SONAR.

The Journal of the Acoustical Society of America·2026

Same author

Dissociation in cross-feature integration between behavioral and pupil dilation responses in auditory deviant detection.

iScience·2026

Same author

Object and setting identification in natural auditory scenesa).

The Journal of the Acoustical Society of America·2026

Same author

The shape of attention reflects flexible filtering of natural speech modulations.

Communications biology·2026

Same author

Auditory deviance detection across time scales: Effects of local and global context.

JASA express letters·2026

Same author

Perception of dynamic multi-speaker auditory scenes under different modes of attention.

NeuroImage·2026

Same journal

<math></math> Estimation and Voicing Detection With Cascade Architecture in Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2025

Same journal

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation.

IEEE/ACM transactions on audio, speech, and language processing·2025

Same journal

Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2024

Same journal

Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.

IEEE/ACM transactions on audio, speech, and language processing·2023

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 26, 2026

A Method to Study Adaptation to Left-Right Reversed Audition

A Method to Study Adaptation to Left-Right Reversed Audition

Published on: October 29, 2018

Feedback-Driven Sensory Mapping Adaptation for Robust Speech Activity Detection.

Ashwin Bellur¹, Mounya Elhilali¹

¹Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218 USA.

IEEE/ACM Transactions on Audio, Speech, and Language Processing

|July 25, 2017

Summary

This summary is machine-generated.

This study introduces a novel computational framework for speech activity detection inspired by the human auditory system. By adapting neural representations, the model achieves robust performance in noisy acoustic environments, reducing errors in challenging conditions.

Keywords:

Adaptation gabor filters genetic algorithm spectrotemporal filters speech activity detection

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example

Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example

Published on: October 24, 2012

Related Experiment Videos

Last Updated: Feb 26, 2026

A Method to Study Adaptation to Left-Right Reversed Audition

A Method to Study Adaptation to Left-Right Reversed Audition

Published on: October 29, 2018

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example

Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example

Published on: October 24, 2012

Area of Science:

Computational Auditory Scene Analysis
Bio-inspired Signal Processing
Machine Learning for Audio

Background:

Parsing complex acoustic scenes is challenging for computational systems due to data mismatch.
The human auditory system excels at segmenting soundscapes by adapting neural representations.
Existing data-driven audio processing systems struggle with real-world noisy conditions.

Purpose of the Study:

To develop a robust speech activity detection system inspired by biological auditory principles.
To mimic the brain's adaptation of neural representations for improved sound parsing.
To address the data mismatch problem in computational audio processing.

Main Methods:

Proposed a framework mimicking the auditory system's adaptation of neural input in high-dimensional space.
Employed a 2-D Gabor filter bank with parameters retuned offline.
Used feedback from statistical models to minimize misclassification risk for speech and nonspeech sounds.

Main Results:

The adapted system demonstrated robustness to novel acoustic conditions.
Achieved a marked reduction in equal error rates across various noisy databases.
Showcased enhanced separability between speech and nonspeech features in a high-dimensional space.

Conclusions:

Biological auditory system principles offer effective strategies for robust audio processing.
Adapting neural representations is key to overcoming data mismatch in challenging acoustic environments.
The developed framework advances the creation of intelligent audio processing systems.