Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear.

Frequency-Domain Interpretation of PD Control

Frequency-Domain Interpretation of PD Control

Proportional-Derivative (PD) controllers are widely used in fan control systems to improve stability and performance. A fan control system can be effectively represented using a Bode plot to illustrate the impact of a PD controller through its transfer function. The Bode plot visually conveys how PD control modifies the fan's response across various frequencies, providing a frequency domain interpretation of the controller's behavior.
The proportional control gain, combined with the system's...

Determination of Expected Frequency

Determination of Expected Frequency

Suppose one wants to test independence between the two variables of a contingency table. The values in the table constitute the observed frequencies of the dataset. But how does one determine the expected frequency of the dataset? One of the important assumptions is that the two variables are independent, which means the variables do not influence each other. For independent variables, the statistical probability of any event involving both variables is calculated by multiplying the individual...

Perceiving Loudness, Pitch, and Location

Perceiving Loudness, Pitch, and Location

The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by identifying...

IR Frequency Region: Fingerprint Region

IR Frequency Region: Fingerprint Region

IR spectra are divided into two main regions: the diagnostic region and the fingerprint region. The diagnostic region of the spectrum lies above 1500 cm−1. The absorptions resulting from single-bond vibrations of the N–H, C–H, and O–H stretch at higher wavenumbers and appear on the left side of the spectrum. The stretching absorptions of the C≡C and C≡N occur between 2100–2300 cm−1. In contrast, those arising from stretching absorptions of the C=O, C=N, and C=C occur between 1600–1850 cm−1.
The...

Time and frequency -Domain Interpretation of Phase-lag Control

Time and frequency -Domain Interpretation of Phase-lag Control

Phase-lag controllers are widely used in control systems to improve stability and reduce steady-state errors. A dimmer switch controlling the brightness of a light bulb serves as a practical example of phase-lag control, gradually adjusting the bulb's brightness. Mathematically, phase-lag control or low-pass filtering is represented when the factor 'a' is less than 1.
Phase-lag controllers do not place a pole at zero, but instead influence the steady-state error by amplifying any finite,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Cyclooxygenase-dependent contribution to increased flow mediated dilatation following a week of repeated ischemic preconditioning.

European journal of applied physiology·2026

Same author

Hypertension Exacerbates Endothelial Dysfunction in Patients With Atrial Fibrillation.

Journal of clinical hypertension (Greenwich, Conn.)·2025

Same author

Parallel hierarchical encoding of linguistic representations in the human auditory cortex and recurrent automatic speech recognition systems.

bioRxiv : the preprint server for biology·2025

Same author

Iterative alignment discovery of speech-associated neural activity.

Journal of neural engineering·2024

Same author

Transfer functions for<i>Q<sub>A</sub></i>/<i>Q<sub>B</sub></i>international regulatory limits for the safe transport of radioactive materials.

Journal of radiological protection : official journal of the Society for Radiological Protection·2024

Same author

Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.

Scientific reports·2024

Same journal

High-resolution depth estimation for multiple wideband sources in deep sea via sparse Bayesian learninga).

The Journal of the Acoustical Society of America·2026

Same journal

Depression markers in speech: An approach based on tract variables dynamics.

The Journal of the Acoustical Society of America·2026

Same journal

The oyster toadfish (Opsanus tau) alters active and diurnal calling amid vessel noise in New York City.

The Journal of the Acoustical Society of America·2026

Same journal

Experimental noise characterisation of phase-locked tandem-rotor in edgewise flight.

The Journal of the Acoustical Society of America·2026

Same journal

The tune-text-temporal synergy: Prosodic effects of final segmental weakening in Neapolitan.

The Journal of the Acoustical Society of America·2026

Same journal

Monitoring vessel movement above critical offshore infrastructure using distributed acoustic sensing.

The Journal of the Acoustical Society of America·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 26, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Modulation frequency features for phoneme recognition in noisy speech.

Sriram Ganapathy¹, Samuel Thomas, Hynek Hermansky

¹Idiap Research Institute, Martigny, Switzerland. ganapathy@idiap.ch

The Journal of the Acoustical Society of America

|January 29, 2009

Summary

This summary is machine-generated.

A novel speech analysis method using modulation spectrum features improves phoneme recognition in telephone speech. This technique enhances performance without compromising accuracy in clean environments.

More Related Videos

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Related Experiment Videos

Last Updated: Jun 26, 2026

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Area of Science:

Speech processing
Machine learning
Signal analysis

Background:

Traditional speech recognition methods face challenges with telephone speech quality.
Feature extraction is crucial for accurate phoneme identification.

Purpose of the Study:

To introduce a new feature extraction technique for improved phoneme recognition in telephone speech.
To evaluate the proposed method against existing state-of-the-art techniques.

Main Methods:

Feature extraction based on modulation spectrum of subband temporal envelopes.
Autoregressive modeling of Hilbert envelopes in critical bands.
Application of static and dynamic compression to subband envelopes.
Machine recognition of phonemes using the extracted features.

Main Results:

Significant improvements in phoneme recognition rates for telephone speech.
Comparable performance to existing methods in clean speech conditions.
Detailed performance analysis across broad phonetic classes.

Conclusions:

The proposed modulation spectrum-based features offer a robust solution for telephone speech recognition.
This technique advances the field of speech analysis and machine recognition.
The method demonstrates superior performance, particularly in noisy or degraded speech conditions.