Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The effect of empagliflozin on inflammation in patients with overweight or obesity and risk of heart failure: a substudy from the Empire Prevent Metabolic trial.

Cardiovascular diabetology·2026

Same author

A Randomized Clinical Implementation Trial Testing a Digital Strategy to Increase SGLT2 Inhibitor Initiation in Heart Failure.

JACC. Heart failure·2026

Same author

Exploring Dysphonic Artificial Intelligence Voice Cloning for Speech Intelligibility in Noise.

Journal of voice : official journal of the Voice Foundation·2026

Same author

Care and Outcomes of New-Onset Heart Failure Across Vulnerable Populations: Temporal Trends in Mortality and Treatment in a Danish Nationwide Cohort From 2003 to 2022.

Journal of the American Heart Association·2026

Same author

Design and rationale of the EMPagliflozin after Aortic Valve Replacement (EMPAVR) study: A randomized clinical trial.

American heart journal·2026

Same author

The Effect of Empagliflozin on Left Ventricular Mass and Volumes in Older Adult Individuals With Overweight and High Risk of Heart Failure: The Empire Prevent Cardiac Trial.

Journal of cardiac failure·2026

Same journal

<math></math> Estimation and Voicing Detection With Cascade Architecture in Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2025

Same journal

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation.

IEEE/ACM transactions on audio, speech, and language processing·2025

Same journal

Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2024

Same journal

Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.

IEEE/ACM transactions on audio, speech, and language processing·2023

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 12, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Speech Intelligibility Prediction using Spectro-Temporal Modulation Analysis.

Amin Edraki¹, Wai-Yip Chan¹, Jesper Jensen²

¹Department of Electrical and Computer Engineering, Queen's University, Kingston, ON K7L 3N6, Canada.

IEEE/ACM Transactions on Audio, Speech, and Language Processing

|March 22, 2021

Summary

This summary is machine-generated.

This study introduces wSTMI, a novel speech intelligibility prediction algorithm for normal-hearing listeners. It effectively predicts speech understanding in noisy conditions by analyzing spectro-temporal modulations.

Keywords:

spectro-temporal modulation speech intelligibility speech quality model

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Related Experiment Videos

Last Updated: Nov 12, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Area of Science:

Auditory Neuroscience
Signal Processing
Speech Perception

Background:

Spectro-temporal modulations are crucial for speech sound analysis in the auditory cortex.
Human speech comprehension remains robust in challenging acoustic environments.

Purpose of the Study:

To propose an intrusive speech intelligibility prediction (SIP) algorithm, wSTMI, for normal-hearing listeners.
To leverage spectro-temporal modulation analysis (STMA) for predicting speech intelligibility in degraded conditions.

Main Methods:

Utilized spectro-temporal modulation analysis (STMA) on clean and degraded speech signals.
Employed a sparse linear model with Lasso regression to combine modulation frequency channel measures.
Optimized parameters by selecting the 8 most salient modulation frequency channels.

Main Results:

The wSTMI algorithm demonstrated consistent performance across 13 diverse datasets.
Evaluated conditions included modulated noise, noise reduction, reverberation, and speech interruption.
Compared wSTMI against 10 other existing SIP algorithms, showing superior or comparable results.

Conclusions:

The optimized parameters of wSTMI align with human auditory system's modulation transfer functions.
The proposed algorithm provides evidence supporting perceptual characteristics of speech intelligibility.
wSTMI offers a robust method for speech intelligibility prediction in various acoustic challenges.