Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Super-resolution Fluorescence Microscopy

Super-resolution Fluorescence Microscopy

Super-resolution fluorescence microscopy (SRFM) provides a better resolution than conventional fluorescence microscopy by reducing the point spread function (PSF). PSF is the light intensity distribution from a point that causes it to appear blurred. Due to PSF, each fluorescing point appears bigger than its actual size, and it is the PSF interference of nearby fluorophores that causes the blurred image. Various approaches to achieving higher resolution through SRFM have recently been...

Upsampling

Upsampling

Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...

Difference from Background: Limit of Detection

Difference from Background: Limit of Detection

The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

Downsampling

Downsampling

When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...

Confocal Fluorescence Microscopy

Confocal Fluorescence Microscopy

Confocal microscopy is an advanced microscopic technique. The prime advantage of the confocal microscope over other microscopy techniques is its ability to block the out-of-focus light from the illuminated samples using pinholes. It is widely used with fluorescence optics to obtain high-resolution, sharp contrast images. Unlike optical microscopes, confocal microscopes use a focused beam of light laser to scan the entire sample surface at different z-planes. These microscopes are, therefore,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Co-removal of norfloxacin and Cr(VI) by Co/N-doped carbon activating peroxymonosulfate: <sup>1</sup>O<sub>2</sub> oxidation coupled with interfacial electron transfer.

Environmental research·2026

Same author

A speech prediction model based on codec modeling and transformer decoding.

Computer speech & language·2026

Same author

A Molecular Trimming Strategy for Hypoxia-Tolerant Photosensitizers With Enhanced cGAS-STING Activation.

Angewandte Chemie (International ed. in English)·2026

Same author

Towards decoupling frontend enhancement and backend recognition in monaural robust ASR.

Computer speech & language·2026

Same author

Colocalization of eQTLs With Type 2 Diabetes and Glycemic Traits Using Whole-Genome Sequences in Diverse Populations From the NHLBI Trans-Omics in Precision Medicine (TOPMed) Program.

Diabetes·2026

Same author

Embodied Transboundary Greenhouse Gas Emissions and Mitigation Opportunities of over 700 Belt and Road Initiative Projects.

Environmental science & technology·2026

Same journal

<math></math> Estimation and Voicing Detection With Cascade Architecture in Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2025

Same journal

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation.

IEEE/ACM transactions on audio, speech, and language processing·2025

Same journal

Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2024

Same journal

Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.

IEEE/ACM transactions on audio, speech, and language processing·2023

Same journal

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.

IEEE/ACM transactions on audio, speech, and language processing·2023

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 22, 2025

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion

Published on: February 10, 2023

Towards Robust Speech Super-resolution.

Heming Wang¹, DeLiang Wang²

¹Department of Computer Science and Engineering, The Ohio State University, OH 43210 USA.

IEEE/ACM Transactions on Audio, Speech, and Language Processing

|August 30, 2021

Summary

This summary is machine-generated.

This study introduces a convolutional neural network (CNN) for speech super-resolution (SR), enhancing low-resolution audio by generating high-frequency components. The novel CNN model outperforms existing deep neural network (DNN) methods and improves robustness to various microphone and downsampling conditions.

Keywords:

Speech super-resolution bandwidth extension convolutional neural network robust speech super-resolution

More Related Videos

Test Samples for Optimizing STORM Super-Resolution Microscopy

Test Samples for Optimizing STORM Super-Resolution Microscopy

Published on: September 6, 2013

Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

Published on: February 12, 2014

Related Experiment Videos

Last Updated: Oct 22, 2025

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion

Published on: February 10, 2023

Test Samples for Optimizing STORM Super-Resolution Microscopy

Test Samples for Optimizing STORM Super-Resolution Microscopy

Published on: September 6, 2013

Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

Published on: February 12, 2014

Area of Science:

Signal Processing
Machine Learning
Audio Engineering

Background:

Speech super-resolution (SR) aims to reconstruct high-frequency components of degraded speech signals.
Existing deep neural network (DNN) models for SR face challenges with robustness to variations in microphone channels and downsampling methods.

Purpose of the Study:

To propose a novel convolutional neural network (CNN) based speech super-resolution model.
To leverage both time and frequency domain information for improved SR performance.
To investigate and enhance the robustness of SR models against diverse real-world conditions.

Main Methods:

A time-domain CNN is developed, accepting raw low-resolution speech waveforms as input.
A cross-domain loss function is employed during network training for optimization.
The proposed CNN model is compared against several established DNN-based SR approaches.

Main Results:

The proposed CNN-based SR model demonstrates superior performance compared to existing DNN models.
The study investigates the robustness of DNN-based SR models concerning microphone channels and downsampling schemes.
Improved generalization capabilities are achieved for untrained microphone channels and unknown downsampling schemes through proper training and preprocessing.

Conclusions:

The developed CNN model offers an effective solution for speech super-resolution, outperforming current DNN methods.
Addressing robustness issues is crucial for practical deployment of SR systems.
Strategic training and preprocessing enhance the adaptability of SR models to varied audio acquisition conditions.