Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

1.2K
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
1.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

BioNet-A: Ultrasonic echo representation network for target discrimination using active SONAR.

The Journal of the Acoustical Society of America·2026
Same author

Dissociation in cross-feature integration between behavioral and pupil dilation responses in auditory deviant detection.

iScience·2026
Same author

Object and setting identification in natural auditory scenesa).

The Journal of the Acoustical Society of America·2026
Same author

The shape of attention reflects flexible filtering of natural speech modulations.

Communications biology·2026
Same author

Auditory deviance detection across time scales: Effects of local and global context.

JASA express letters·2026
Same author

Perception of dynamic multi-speaker auditory scenes under different modes of attention.

NeuroImage·2026
Same journal

<math></math> Estimation and Voicing Detection With Cascade Architecture in Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2025
Same journal

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation.

IEEE/ACM transactions on audio, speech, and language processing·2025
Same journal

Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2024
Same journal

Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing.

IEEE/ACM transactions on audio, speech, and language processing·2023
Same journal

Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.

IEEE/ACM transactions on audio, speech, and language processing·2023
Same journal

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.

IEEE/ACM transactions on audio, speech, and language processing·2023
See all related articles

Related Experiment Video

Updated: Feb 26, 2026

A Method to Study Adaptation to Left-Right Reversed Audition
07:14

A Method to Study Adaptation to Left-Right Reversed Audition

Published on: October 29, 2018

6.9K

Feedback-Driven Sensory Mapping Adaptation for Robust Speech Activity Detection.

Ashwin Bellur1, Mounya Elhilali1

  • 1Department of Electrical and Computer Engineering, Johns Hopkins University, Baltimore, MD 21218 USA.

IEEE/ACM Transactions on Audio, Speech, and Language Processing
|July 25, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a novel computational framework for speech activity detection inspired by the human auditory system. By adapting neural representations, the model achieves robust performance in noisy acoustic environments, reducing errors in challenging conditions.

Keywords:
Adaptationgabor filtersgenetic algorithmspectrotemporal filtersspeech activity detection

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

2.0K
Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example
08:45

Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example

Published on: October 24, 2012

15.3K

Related Experiment Videos

Last Updated: Feb 26, 2026

A Method to Study Adaptation to Left-Right Reversed Audition
07:14

A Method to Study Adaptation to Left-Right Reversed Audition

Published on: October 29, 2018

6.9K
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

2.0K
Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example
08:45

Mapping Cortical Dynamics Using Simultaneous MEG/EEG and Anatomically-constrained Minimum-norm Estimates: an Auditory Attention Example

Published on: October 24, 2012

15.3K

Area of Science:

  • Computational Auditory Scene Analysis
  • Bio-inspired Signal Processing
  • Machine Learning for Audio

Background:

  • Parsing complex acoustic scenes is challenging for computational systems due to data mismatch.
  • The human auditory system excels at segmenting soundscapes by adapting neural representations.
  • Existing data-driven audio processing systems struggle with real-world noisy conditions.

Purpose of the Study:

  • To develop a robust speech activity detection system inspired by biological auditory principles.
  • To mimic the brain's adaptation of neural representations for improved sound parsing.
  • To address the data mismatch problem in computational audio processing.

Main Methods:

  • Proposed a framework mimicking the auditory system's adaptation of neural input in high-dimensional space.
  • Employed a 2-D Gabor filter bank with parameters retuned offline.
  • Used feedback from statistical models to minimize misclassification risk for speech and nonspeech sounds.

Main Results:

  • The adapted system demonstrated robustness to novel acoustic conditions.
  • Achieved a marked reduction in equal error rates across various noisy databases.
  • Showcased enhanced separability between speech and nonspeech features in a high-dimensional space.

Conclusions:

  • Biological auditory system principles offer effective strategies for robust audio processing.
  • Adapting neural representations is key to overcoming data mismatch in challenging acoustic environments.
  • The developed framework advances the creation of intelligent audio processing systems.