Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Super-resolution Fluorescence Microscopy01:37

Super-resolution Fluorescence Microscopy

10.6K
Super-resolution fluorescence microscopy (SRFM) provides a better resolution than conventional fluorescence microscopy by reducing the point spread function (PSF). PSF is the light intensity distribution from a point that causes it to appear blurred. Due to PSF, each fluorescing point appears bigger than its actual size, and it is the PSF interference of nearby fluorophores that causes the blurred image. Various approaches to achieving higher resolution through SRFM have recently been...
10.6K
Upsampling01:22

Upsampling

364
Managing signal sampling rates is essential in digital signal processing to maintain signal integrity. A decimated signal, characterized by a reduced frequency range due to its lower sampling rate, can be upsampled by inserting zeros between each sample. This upsampling process expands the original spectrum and introduces repeated spectral replicas at intervals dictated by the new Nyquist frequency. To refine this zero-inserted sequence, it is passed through a lowpass filter with a cutoff...
364
Difference from Background: Limit of Detection01:05

Difference from Background: Limit of Detection

7.4K
The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...
7.4K
Deconvolution01:20

Deconvolution

326
Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...
326
Downsampling01:20

Downsampling

324
When considering a sampled sequence with zero values between sampling instants, one can replace it by taking every N-th value of the sequence. At these integer multiples of N, the original and sampled sequences coincide. This process, known as decimation, involves extracting every N-th sample from a sequence, thereby creating a more efficient sequence.
The Fourier transform of the decimated sequence reveals a combination of scaled and shifted versions of the original spectrum. This...
324
Confocal Fluorescence Microscopy01:16

Confocal Fluorescence Microscopy

17.9K
Confocal microscopy is an advanced microscopic technique. The prime advantage of the confocal microscope over other microscopy techniques is its ability to block the out-of-focus light from the illuminated samples using pinholes. It is widely used with fluorescence optics to obtain high-resolution, sharp contrast images. Unlike optical microscopes, confocal microscopes use a focused beam of light laser to scan the entire sample surface at different z-planes. These microscopes are, therefore,...
17.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Co-removal of norfloxacin and Cr(VI) by Co/N-doped carbon activating peroxymonosulfate: <sup>1</sup>O<sub>2</sub> oxidation coupled with interfacial electron transfer.

Environmental research·2026
Same author

A speech prediction model based on codec modeling and transformer decoding.

Computer speech & language·2026
Same author

A Molecular Trimming Strategy for Hypoxia-Tolerant Photosensitizers With Enhanced cGAS-STING Activation.

Angewandte Chemie (International ed. in English)·2026
Same author

Towards decoupling frontend enhancement and backend recognition in monaural robust ASR.

Computer speech & language·2026
Same author

Colocalization of eQTLs With Type 2 Diabetes and Glycemic Traits Using Whole-Genome Sequences in Diverse Populations From the NHLBI Trans-Omics in Precision Medicine (TOPMed) Program.

Diabetes·2026
Same author

Embodied Transboundary Greenhouse Gas Emissions and Mitigation Opportunities of over 700 Belt and Road Initiative Projects.

Environmental science & technology·2026
Same journal

<math></math> Estimation and Voicing Detection With Cascade Architecture in Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2025
Same journal

Speech Enhancement for Cochlear Implant Recipients using Deep Complex Convolution Transformer with Frequency Transformation.

IEEE/ACM transactions on audio, speech, and language processing·2025
Same journal

Selective Acoustic Feature Enhancement for Speech Emotion Recognition With Noisy Speech.

IEEE/ACM transactions on audio, speech, and language processing·2024
Same journal

Glottal Airflow Estimation using Neck Surface Acceleration and Low-Order Kalman Smoothing.

IEEE/ACM transactions on audio, speech, and language processing·2023
Same journal

Bilateral Cochlear Implant Processing of Coding Strategies With CCi-MOBILE, an Open-Source Research Platform.

IEEE/ACM transactions on audio, speech, and language processing·2023
Same journal

Robust Vocal Quality Feature Embeddings for Dysphonic Voice Detection.

IEEE/ACM transactions on audio, speech, and language processing·2023
See all related articles

Related Experiment Video

Updated: Oct 22, 2025

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion
13:11

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion

Published on: February 10, 2023

1.7K

Towards Robust Speech Super-resolution.

Heming Wang1, DeLiang Wang2

  • 1Department of Computer Science and Engineering, The Ohio State University, OH 43210 USA.

IEEE/ACM Transactions on Audio, Speech, and Language Processing
|August 30, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces a convolutional neural network (CNN) for speech super-resolution (SR), enhancing low-resolution audio by generating high-frequency components. The novel CNN model outperforms existing deep neural network (DNN) methods and improves robustness to various microphone and downsampling conditions.

Keywords:
Speech super-resolutionbandwidth extensionconvolutional neural networkrobust speech super-resolution

More Related Videos

Test Samples for Optimizing STORM Super-Resolution Microscopy
16:52

Test Samples for Optimizing STORM Super-Resolution Microscopy

Published on: September 6, 2013

31.3K
Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform
06:25

Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

Published on: February 12, 2014

8.6K

Related Experiment Videos

Last Updated: Oct 22, 2025

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion
13:11

Super-Resolution Imaging of Bacterial Secreted Proteins Using Genetic Code Expansion

Published on: February 10, 2023

1.7K
Test Samples for Optimizing STORM Super-Resolution Microscopy
16:52

Test Samples for Optimizing STORM Super-Resolution Microscopy

Published on: September 6, 2013

31.3K
Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform
06:25

Time Multiplexing Super Resolving Technique for Imaging from a Moving Platform

Published on: February 12, 2014

8.6K

Area of Science:

  • Signal Processing
  • Machine Learning
  • Audio Engineering

Background:

  • Speech super-resolution (SR) aims to reconstruct high-frequency components of degraded speech signals.
  • Existing deep neural network (DNN) models for SR face challenges with robustness to variations in microphone channels and downsampling methods.

Purpose of the Study:

  • To propose a novel convolutional neural network (CNN) based speech super-resolution model.
  • To leverage both time and frequency domain information for improved SR performance.
  • To investigate and enhance the robustness of SR models against diverse real-world conditions.

Main Methods:

  • A time-domain CNN is developed, accepting raw low-resolution speech waveforms as input.
  • A cross-domain loss function is employed during network training for optimization.
  • The proposed CNN model is compared against several established DNN-based SR approaches.

Main Results:

  • The proposed CNN-based SR model demonstrates superior performance compared to existing DNN models.
  • The study investigates the robustness of DNN-based SR models concerning microphone channels and downsampling schemes.
  • Improved generalization capabilities are achieved for untrained microphone channels and unknown downsampling schemes through proper training and preprocessing.

Conclusions:

  • The developed CNN model offers an effective solution for speech super-resolution, outperforming current DNN methods.
  • Addressing robustness issues is crucial for practical deployment of SR systems.
  • Strategic training and preprocessing enhance the adaptability of SR models to varied audio acquisition conditions.