Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Larynx

Larynx

The human larynx, often referred to as the voice box, is an intricate organ located in the neck. It serves as a pathway for air to enter the lungs during respiration and is an essential component of voice production.
Anatomy of the Larynx
The larynx consists of various components, including cartilage, muscles, and vocal cords. Its structure includes three large unpaired cartilages—the thyroid, cricoid, and epiglottis—and three smaller paired cartilages—the arytenoids,...

Suctioning the Oropharyngeal Airway

Suctioning the Oropharyngeal Airway

In preparing for oropharyngeal airway suctioning, a nurse must gather all necessary equipment, including a suction unit with tubing, a prepackaged suction kit, sterile gloves, water or saline for irrigation, a water-soluble lubricant, and additional personal protective equipment (such as a gown, mask, and goggles) to control infections.
After assembling the equipment, the nurse should practice hand hygiene and don appropriate PPE according to infection control guidelines to avoid the...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Sleep Apnea

Sleep Apnea

Sleep apnea is a condition where breathing stops intermittently during sleep, often leading to significant health issues. Each episode can last from 10 to 20 seconds or more and is frequently accompanied by a brief arousal from sleep. This disturbance, largely unnoticed by the individual, can lead to severe daytime fatigue. Commonly, individuals seek help after being informed by their partners about loud snoring and noticeable breathing pauses during sleep.
The condition is more prevalent among...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Generalisable artificial intelligence ECG trained on public data for outcome prediction after transcatheter aortic valve replacement.

Heart (British Cardiac Society)·2026

Same author

EffortNet: A Deep Learning Framework for Objective Assessment of Speech Enhancement Technologies Using EEG-Based Alpha Oscillations.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2026

Same author

Speaker-dependent laser Doppler vibrometer-based voice conversion for dysarthric speech under noisy conditions.

JASA express letters·2026

Same author

Postoperative Pain Following a Retroauricular Approach Versus a Transcanal Approach in Tympanoplasty Type 1: A 14-Day Retrospective Study.

Diagnostics (Basel, Switzerland)·2026

Same author

Temporal variance mapping with machine learning for label-free 3D chromatin imaging using optical interferometric microscopy.

Biomedical optics express·2026

Same author

Multistream Deep Learning Models Using Multimodal Optical Coherence Tomography for Predicting Visual Impairment in Epiretinal Membrane.

American journal of ophthalmology·2026

Same journal

Highly Accelerated 1-mm Isotropic 3D Chemical Exchange Saturation Transfer MRI Using Wave-Co-CAIPI at 5 Tesla.

IEEE transactions on bio-medical engineering·2026

Same journal

Systematic Evaluation of Hip Exoskeleton Assistance Parameters for Enhancing Gait Stability During Ground Slip Perturbations.

IEEE transactions on bio-medical engineering·2026

Same journal

SleepConFormer: A Single-Channel EEG Framework for Sleep Staging and Consciousness Assessment in Patients with Disorders of Consciousness.

IEEE transactions on bio-medical engineering·2026

Same journal

Modeling Partial and Total Support of Left Ventricular Assist Device for Discrete Hemodynamic Control Framework.

IEEE transactions on bio-medical engineering·2026

Same journal

A Low-Cost Wearable TI-TACS Stimulator With Bipolar Quadratic-Boost Converter for Current Stimulation Validation in the Rat Brain.

IEEE transactions on bio-medical engineering·2026

Same journal

EMG-Based Gait Estimation Using Koopman-Inspired Method.

IEEE transactions on bio-medical engineering·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 9, 2026

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Joint Dictionary Learning-Based Non-Negative Matrix Factorization for Voice Conversion to Improve Speech

Szu-Wei Fu¹, Pei-Chun Li², Ying-Hui Lai³

¹Department of Computer Science and Information EngineeringNational Taiwan University.

IEEE Transactions on Bio-Medical Engineering

|December 28, 2016

Summary

This summary is machine-generated.

This study introduces a new machine learning method, joint dictionary learning based non-negative matrix factorization (JD-NMF), to improve speech clarity for surgical patients. The JD-NMF technique enhances intelligibility even with limited training data, offering a significant advancement for communication.

Keywords:

Data models Dictionaries Spectrogram Speech Surgery Training Training data

More Related Videos

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Related Experiment Videos

Last Updated: Mar 9, 2026

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Asthma Detection Research Based on Voice Signal Processing and Machine Learning

Published on: July 22, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Area of Science:

Machine Learning
Speech Processing
Bioacoustics

Background:

Surgical removal of articulators can lead to distorted speech, impacting patient communication.
Existing voice conversion (VC) methods face challenges with limited training data and require rapid conversion for practical use.

Purpose of the Study:

To develop an effective machine learning-based voice conversion (VC) technique for improving speech intelligibility in surgical patients.
To address the limitations of small training datasets and the need for efficient conversion in post-operative communication.

Main Methods:

A novel joint dictionary learning based non-negative matrix factorization (JD-NMF) algorithm is proposed.
The JD-NMF method is designed for efficient and effective VC with limited training data.

Main Results:

The JD-NMF method significantly improves short-time objective intelligibility (STOI) scores compared to original speech.
Experimental results show JD-NMF is more efficient and effective than conventional exemplar-based NMF VC methods.

Conclusions:

The proposed JD-NMF method demonstrates superior performance in enhancing speech intelligibility for oral surgery patients.
The joint training criterion for NMF-based VC is validated, confirming the effectiveness of JD-NMF.