Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Reducing Fiber-Induced Honeycomb Artifacts and Low-Light Noise in Nasal High-Speed Video Laryngoscopy: A Fast, Deterministic, Open-Source Approach.

Journal of voice : official journal of the Voice Foundation·2026

Same author

Investigation on vocal damping and epithelial oscillation regarding laryngeal mechanisms M2d and M3.

JASA express letters·2026

Same author

Dynamic 3D MRI of vocal fold oscillations: In vivo assessment of vocal fold thickness, contact area, and glottal area waveform across phonation types in comparison with high-speed imaging.

The Journal of the Acoustical Society of America·2026

Same author

Influence of Execution Speed of an Ascending Glissando on Vocal Stability of Vocally Untrained and Professionally Trained Subjects.

Journal of voice : official journal of the Voice Foundation·2026

Same author

Glottal Area Waveform Measurements for Healthy Female and Male Speakers in Typical, High-Frequency, and Soft Phonation.

Journal of speech, language, and hearing research : JSLHR·2026

Same author

Physics-informed neural network for predicting in vacuo vocal fold eigenmodes: A proof of concept study.

JASA express letters·2026

Same journal

Reliability of Auditory-Perceptual Voice Evaluation: An Investigation of GRBAS Scale Reproducibility Among Expert Raters.

Journal of voice : official journal of the Voice Foundation·2026

Same journal

Vocal Health of Pastors in the Protestant Church in Germany.

Journal of voice : official journal of the Voice Foundation·2026

Same journal

Assessment of the Motor Speech Profile Tremor Analysis in Vocal Tremor Diagnosis.

Journal of voice : official journal of the Voice Foundation·2026

Same journal

Psychometric and Acoustic Investigation of Identity and Phonotraumatic Vocal Hyperfunction.

Journal of voice : official journal of the Voice Foundation·2026

Same journal

"I Go but I Don't Participate": A Scoping Review With Thematic Synthesis of the Experiences of Voice Disorders in Adulthood.

Journal of voice : official journal of the Voice Foundation·2026

Same journal

Voice Changes in Laryngeal Intraepithelial Neoplasia and Nonneoplastic Lesions.

Journal of voice : official journal of the Voice Foundation·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 7, 2025

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing

Published on: December 1, 2023

Machine Learning-Based Estimation of Hoarseness Severity Using Acoustic Signals Recorded During High-Speed

Tobias Schraut¹, Michael Döllinger¹, Melda Kunduk²

¹Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany.

Journal of Voice : Official Journal of the Voice Foundation

|January 4, 2025

Summary

This summary is machine-generated.

Machine learning models using acoustic signals from high-speed videoendoscopy (HSV) show potential for assessing hoarseness severity. However, recordings from voice therapy sessions are more reliable due to practical limitations during oral laryngeal examinations.

Keywords:

Machine learning—High-speed videoendoscopy—Voice disorders—Acoustic analysis—Voice quality—Sustained vowel.

More Related Videos

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions

Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions

Published on: November 25, 2017

Related Experiment Videos

Last Updated: May 7, 2025

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing

Published on: December 1, 2023

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions

Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions

Published on: November 25, 2017

Area of Science:

Laryngology
Speech Pathology
Biomedical Engineering
Machine Learning

Background:

Assessing hoarseness severity is crucial for diagnosing voice disorders.
Machine learning offers a promising avenue for objective voice analysis.
High-speed videoendoscopy (HSV) provides detailed laryngeal visualization but its acoustic recordings' utility for hoarseness assessment is under investigation.

Purpose of the Study:

To investigate the efficacy of sustained phonations recorded during HSV for machine learning-based hoarseness severity assessment.
To compare the performance of HSV-derived acoustic recordings with conventional recordings from voice therapy sessions.
To identify key differences and limitations of HSV-derived acoustic data for voice analysis.

Main Methods:

A database of 617 voice recordings (250 ms) from HSV examinations was created.
Comparison databases included 809 vowels from voice therapy sessions (1-second and 250 ms durations).
Extracted 490 acoustic features, developed machine learning models, and classified hoarseness severity based on expert auditory-perceptual ratings.

Main Results:

Logistic regression models achieved classification accuracies of 0.863 (VT-1), 0.847 (VT-2), and 0.742 (HS).
Correlation between predicted and subjective hoarseness scores was 0.797 (VT-1), 0.763 (VT-2), and 0.637 (HS).
Correlation between changes in quantitative and subjective ratings was significantly lower for HSV recordings (0.088) compared to voice therapy recordings.

Conclusions:

Acoustic signals from HSV show potential for quantitative hoarseness assessment but are less reliable than voice therapy recordings.
Practical challenges during oral laryngeal examination limit the quality of HSV-derived acoustic recordings.
Future improvements, potentially using flexible nasal endoscopy, could enhance the utility of HSV for voice assessment.