Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Aliasing01:18

Aliasing

Accurate signal sampling and reconstruction are crucial in various signal-processing applications. A time-domain signal's spectrum can be revealed using its Fourier transform. When this signal is sampled at a specific frequency, it results in multiple scaled replicas of the original spectrum in the frequency domain. The spacing of these replicas is determined by the sampling frequency.
If the sampling frequency is below the Nyquist rate, these replicas overlap, preventing the original signal...
Frequency-Domain Interpretation of PD Control01:24

Frequency-Domain Interpretation of PD Control

Proportional-Derivative (PD) controllers are widely used in fan control systems to improve stability and performance. A fan control system can be effectively represented using a Bode plot to illustrate the impact of a PD controller through its transfer function. The Bode plot visually conveys how PD control modifies the fan's response across various frequencies, providing a frequency domain interpretation of the controller's behavior.
The proportional control gain, combined with the system's...
IR Spectrum Peak Splitting: Symmetric vs Asymmetric Vibrations01:08

IR Spectrum Peak Splitting: Symmetric vs Asymmetric Vibrations

Identical bonds within a polyatomic group can stretch symmetrically (in-phase) or asymmetrically (out-of-phase). Similar to hydrogen bonding, these vibrations also influence the shape of the IR peak. Generally, asymmetric stretching frequencies are higher than symmetric stretching frequencies. For example, primary amines exhibit two distinct IR peaks between 3300–3500 cm−1 corresponding to the symmetric and asymmetric N-H stretching, while secondary amines exhibit a single stretching vibration...
Bandpass Sampling01:17

Bandpass Sampling

In signal processing, bandpass sampling is an effective technique for sampling signals that have most of their energy concentrated within a narrow frequency band. This type of signal is known as a bandpass signal. The key principle of bandpass sampling involves sampling the signal at a rate that is greater than twice the signal's bandwidth to prevent aliasing.
A bandpass signal has a spectrum with a lower frequency limit, denoted as ω1, and an upper frequency limit, denoted as ω2. The spectrum...
Effective Value of a Periodic Waveform01:07

Effective Value of a Periodic Waveform

The concept of effective value, the root mean square (RMS) value, is crucial in understanding electrical circuits and power delivery. This idea emerges from the necessity to measure the effectiveness of a voltage or current source in supplying power to a resistive load.
The effective value of a periodic current represents the direct current (DC) that conveys the same average power to a resistor as the periodic current itself. This concept is crucial when assessing AC circuits. To determine the...
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cyclooxygenase-dependent contribution to increased flow mediated dilatation following a week of repeated ischemic preconditioning.

European journal of applied physiology·2026
Same author

Hypertension Exacerbates Endothelial Dysfunction in Patients With Atrial Fibrillation.

Journal of clinical hypertension (Greenwich, Conn.)·2025
Same author

Parallel hierarchical encoding of linguistic representations in the human auditory cortex and recurrent automatic speech recognition systems.

bioRxiv : the preprint server for biology·2025
Same author

Iterative alignment discovery of speech-associated neural activity.

Journal of neural engineering·2024
Same author

Transfer functions for<i>Q<sub>A</sub></i>/<i>Q<sub>B</sub></i>international regulatory limits for the safe transport of radioactive materials.

Journal of radiological protection : official journal of the Society for Radiological Protection·2024
Same author

Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.

Scientific reports·2024
Same journal

Sibilant differentiation before and after tongue cancer surgery: Acoustics, kinematics and the role of sensorimotor controla).

The Journal of the Acoustical Society of America·2026
Same journal

BioNet-A: Ultrasonic echo representation network for target discrimination using active SONAR.

The Journal of the Acoustical Society of America·2026
Same journal

Empty soft-drink cans and mass-loaded rods: Analogous homework problems from acoustic and mechanical domains.

The Journal of the Acoustical Society of America·2026
Same journal

Erratum: Statistical wave field theory: Anisotropic wave fields under Neumann's boundary condition [J. Acoust. Soc. Am. 159(3), 2265-2280 (2026)].

The Journal of the Acoustical Society of America·2026
Same journal

On the modification of tip leakage noise sources by porous treatment.

The Journal of the Acoustical Society of America·2026
Same journal

An educational opportunity: Acoustics in an empty room.

The Journal of the Acoustical Society of America·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2026

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis
05:48

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis

Published on: August 9, 2024

Temporal envelope compensation for robust phoneme recognition using modulation spectrum.

Sriram Ganapathy1, Samuel Thomas, Hynek Hermansky

  • 1Department of Electrical and Computer Engineering, Human Language Technology Center of Excellence, Johns Hopkins University, Baltimore, Maryland 21218, USA. ganapathy@jhu.edu

The Journal of the Acoustical Society of America
|January 12, 2011
PubMed
Summary
This summary is machine-generated.

This study introduces a new method for phoneme recognition using modulation frequencies derived from speech signals. The technique significantly improves accuracy in noisy environments through noise compensation and gain normalization.

More Related Videos

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology
05:38

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Related Experiment Videos

Last Updated: Jun 5, 2026

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis
05:48

Memorization-Based Training and Testing Paradigm for Robust Vocal Identity Recognition in Expressive Speech Using Event-Related Potentials Analysis

Published on: August 9, 2024

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology
05:38

Interaction between Phonological and Semantic Processes in Visual Word Recognition using Electrophysiology

Published on: June 29, 2021

Area of Science:

  • Speech processing
  • Acoustic phonetics
  • Machine learning for speech recognition

Background:

  • Traditional phoneme recognition methods struggle with performance degradation in noisy conditions.
  • Feature extraction is crucial for robust speech signal representation.
  • Existing techniques often fail to adequately compensate for environmental distortions.

Purpose of the Study:

  • To propose a robust feature extraction technique for phoneme recognition.
  • To enhance feature robustness against various noise conditions.
  • To analyze the contribution of different processing stages for improved speech signal representation.

Main Methods:

  • Deriving modulation frequency components from speech signals using syllable-length segments and frequency domain linear prediction.
  • Implementing a noise compensation technique by subtracting noise envelope estimates from speech envelopes.
  • Applying gain normalization and static/dynamic compression to temporal envelopes.
  • Evaluating features in automatic phoneme recognition tasks under mismatched conditions.

Main Results:

  • The proposed features demonstrate substantial improvements in phoneme recognition rates compared to other techniques.
  • The noise compensation and gain normalization significantly enhance robustness in noisy environments.
  • Effective performance was observed even with mismatched training and testing data, including telephone channel noise, additive noise, and reverberation.

Conclusions:

  • The developed modulation frequency features offer a robust solution for phoneme recognition, particularly in adverse acoustic conditions.
  • The proposed noise compensation and gain normalization techniques are effective in suppressing additive noise and improving overall system performance.
  • This approach provides a significant advancement for automatic speech recognition systems operating in real-world scenarios.