Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

The Cochlea

The Cochlea

The cochlea is a coiled structure in the inner ear that contains hair cells—the sensory receptors of the auditory system. Sound waves are transmitted to the cochlea by small bones attached to the eardrum called the ossicles, which vibrate the oval window that leads to the inner ear. This causes fluid in the chambers of the cochlea to move, vibrating the basilar membrane.

Reconstruction of Signal using Interpolation

Reconstruction of Signal using Interpolation

Signal processing techniques are essential for accurately converting continuous signals to digital formats and vice versa. When a continuous signal is sampled with a period T, the resulting sampled signal exhibits replicas of the original spectrum in the frequency domain, spaced at intervals equal to the sampling frequency. To handle this sampled signal, a zero-order hold method can be applied, which creates a piecewise constant signal by retaining each sample's value until the next...

Types Of Transformers

Types Of Transformers

Transformers can provide desired voltages to a circuit by modifying the number of turns in the secondary windings.
If the ratio of the number of turns in the secondary winding to that of the primary winding is greater than one, then the transformer is said to be a step-up transformer. In a step-up transformer, the voltage at the secondary winding is greater than the voltage applied at the primary winding.
However, if this ratio is less than one, the transformer is said to be a step-down...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Convolution Properties I

Convolution Properties I

Convolution computations can be simplified by utilizing their inherent properties.
The commutative property reveals that the input and the impulse response of an LTI (Linear Time-Invariant) system can be interchanged without affecting the output:

Convolution: Math, Graphics, and Discrete Signals

Convolution: Math, Graphics, and Discrete Signals

In any LTI (Linear Time-Invariant) system, the convolution of two signals is denoted using a convolution operator, assuming all initial conditions are zero. The convolution integral can be divided into two parts: the zero-input or natural response and the zero-state or forced response, with t0 indicating the initial time.
To simplify the convolution integral, it is assumed that both the input signal and impulse response are zero for negative time values. The graphical convolution process...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Deep learning-based environmental source separation and sound enhancement: Advancements for cochlear implant and normal hearing listeners.

The Journal of the Acoustical Society of America·2026

Same author

Capabilities of the CCi-MOBILE cochlear implant research platform for real-time sound coding.

The Journal of the Acoustical Society of America·2025

Same author

Multi-objective non-intrusive hearing-aid speech assessment model.

The Journal of the Acoustical Society of America·2024

Same author

Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition.

The Journal of the Acoustical Society of America·2024

Same author

Child-adult speech diarization in naturalistic conditions of preschool classrooms using room-independent ResNet model and automatic speech recognition-based re-segmentation.

The Journal of the Acoustical Society of America·2024

Same authorSame journal

Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

<math></math> Estimation and Voicing Detection With Cascade Architecture in Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2025

Same journal

Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2024

Same journal

Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Attentive Training: A New Training Framework for Speech Enhancement.

IEEE/ACM transactions on audio, speech, and language processing·2023

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 1, 2025

Author Spotlight: Optimizing EAS with Long Electrodes for Enhanced Cochlear Coverage and Hearing Preservation

Author Spotlight: Optimizing EAS with Long Electrodes for Enhanced Cochlear Coverage and Hearing Preservation

Published on: October 11, 2024

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency

Nursadul Mamun¹, John H L Hansen¹

¹CRSS: Center for Robust Speech Systems; Cochlear Implant Processing Laboratory (CILab), Department of Electrical and Computer Engineering, University of Texas at Dallas, USA.

IEEE/ACM Transactions on Audio, Speech, and Language Processing

|January 20, 2025

Summary

This summary is machine-generated.

This study introduces a Deep Complex Convolution Transformer Network (DCCTN) to improve speech understanding for cochlear implant (CI) users by enhancing both speech magnitude and phase. The new method significantly boosts intelligibility in noisy environments.

Keywords:

Complex-valued Network Deep Neural Network Frequency Transformation Block Speech Enhancement Transformer U-Net

More Related Videos

Enhancing Electrode Location Assessment in Cochlear Implantation via Computed Tomography Image Fusion

Enhancing Electrode Location Assessment in Cochlear Implantation via Computed Tomography Image Fusion

Published on: January 17, 2025

Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages

Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages

Published on: March 24, 2023

Related Experiment Videos

Last Updated: Jun 1, 2025

Author Spotlight: Optimizing EAS with Long Electrodes for Enhanced Cochlear Coverage and Hearing Preservation

Author Spotlight: Optimizing EAS with Long Electrodes for Enhanced Cochlear Coverage and Hearing Preservation

Published on: October 11, 2024

Enhancing Electrode Location Assessment in Cochlear Implantation via Computed Tomography Image Fusion

Enhancing Electrode Location Assessment in Cochlear Implantation via Computed Tomography Image Fusion

Published on: January 17, 2025

Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages

Systematic Hearing Performance Evaluation Process for Adolescents with Cochlear Implantation at Early Ages

Published on: March 24, 2023

Area of Science:

Signal Processing
Machine Learning
Auditory Neuroscience

Background:

Cochlear implant (CI) users face communication challenges in noisy environments due to distorted speech signals.
Existing speech enhancement (SE) methods often neglect the crucial role of phase information in speech perception.

Purpose of the Study:

To develop a novel deep learning model for simultaneous enhancement of speech magnitude and phase spectra.
To improve speech intelligibility and quality for CI users in complex acoustic environments.

Main Methods:

A Deep Complex Convolution Transformer Network (DCCTN) was proposed, utilizing a complex-valued U-Net with a transformer in the bottleneck.
The network incorporates a frequency transformation block to capture speech harmonic correlations.
DCCTN learns a complex transformation matrix for time-frequency domain speech recovery.

Main Results:

DCCTN outperformed existing SE models (CRN, DCCRN, GCRN) in objective speech intelligibility and quality metrics.
Formal listener evaluations with CI recipients confirmed significant improvements in noisy conditions.
The model effectively suppressed non-stationary noise without introducing musical artifacts.

Conclusions:

The proposed DCCTN offers a superior approach to speech enhancement for CI users by addressing both magnitude and phase distortions.
This method holds promise for enhancing real-world communication for individuals with hearing impairments.