Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Reducing Fiber-Induced Honeycomb Artifacts and Low-Light Noise in Nasal High-Speed Video Laryngoscopy: A Fast, Deterministic, Open-Source Approach.

Journal of voice : official journal of the Voice Foundation·2026
Same author

Investigation on vocal damping and epithelial oscillation regarding laryngeal mechanisms M2d and M3.

JASA express letters·2026
Same author

Dynamic 3D MRI of vocal fold oscillations: In vivo assessment of vocal fold thickness, contact area, and glottal area waveform across phonation types in comparison with high-speed imaging.

The Journal of the Acoustical Society of America·2026
Same author

Influence of Execution Speed of an Ascending Glissando on Vocal Stability of Vocally Untrained and Professionally Trained Subjects.

Journal of voice : official journal of the Voice Foundation·2026
Same author

Glottal Area Waveform Measurements for Healthy Female and Male Speakers in Typical, High-Frequency, and Soft Phonation.

Journal of speech, language, and hearing research : JSLHR·2026
Same author

Physics-informed neural network for predicting in vacuo vocal fold eigenmodes: A proof of concept study.

JASA express letters·2026
Same journal

Reliability of Auditory-Perceptual Voice Evaluation: An Investigation of GRBAS Scale Reproducibility Among Expert Raters.

Journal of voice : official journal of the Voice Foundation·2026
Same journal

Vocal Health of Pastors in the Protestant Church in Germany.

Journal of voice : official journal of the Voice Foundation·2026
Same journal

Assessment of the Motor Speech Profile Tremor Analysis in Vocal Tremor Diagnosis.

Journal of voice : official journal of the Voice Foundation·2026
Same journal

Psychometric and Acoustic Investigation of Identity and Phonotraumatic Vocal Hyperfunction.

Journal of voice : official journal of the Voice Foundation·2026
Same journal

"I Go but I Don't Participate": A Scoping Review With Thematic Synthesis of the Experiences of Voice Disorders in Adulthood.

Journal of voice : official journal of the Voice Foundation·2026
Same journal

Voice Changes in Laryngeal Intraepithelial Neoplasia and Nonneoplastic Lesions.

Journal of voice : official journal of the Voice Foundation·2026
See all related articles

Related Experiment Video

Updated: May 7, 2025

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing
01:00

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing

Published on: December 1, 2023

424

Machine Learning-Based Estimation of Hoarseness Severity Using Acoustic Signals Recorded During High-Speed

Tobias Schraut1, Michael Döllinger1, Melda Kunduk2

  • 1Division of Phoniatrics and Pediatric Audiology at the Department of Otorhinolaryngology, Head and Neck Surgery, University Hospital Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg, 91054 Erlangen, Germany.

Journal of Voice : Official Journal of the Voice Foundation
|January 4, 2025
PubMed
Summary
This summary is machine-generated.

Machine learning models using acoustic signals from high-speed videoendoscopy (HSV) show potential for assessing hoarseness severity. However, recordings from voice therapy sessions are more reliable due to practical limitations during oral laryngeal examinations.

Keywords:
Machine learning—High-speed videoendoscopy—Voice disorders—Acoustic analysis—Voice quality—Sustained vowel.

More Related Videos

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS
12:43

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

34.2K
Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions
10:13

Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions

Published on: November 25, 2017

10.9K

Related Experiment Videos

Last Updated: May 7, 2025

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing
01:00

Minimally Invasive Murine Laryngoscopy for Close-Up Imaging of Laryngeal Motion During Breathing and Swallowing

Published on: December 1, 2023

424
A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS
12:43

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

34.2K
Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions
10:13

Hemi-laryngeal Setup for Studying Vocal Fold Vibration in Three Dimensions

Published on: November 25, 2017

10.9K

Area of Science:

  • Laryngology
  • Speech Pathology
  • Biomedical Engineering
  • Machine Learning

Background:

  • Assessing hoarseness severity is crucial for diagnosing voice disorders.
  • Machine learning offers a promising avenue for objective voice analysis.
  • High-speed videoendoscopy (HSV) provides detailed laryngeal visualization but its acoustic recordings' utility for hoarseness assessment is under investigation.

Purpose of the Study:

  • To investigate the efficacy of sustained phonations recorded during HSV for machine learning-based hoarseness severity assessment.
  • To compare the performance of HSV-derived acoustic recordings with conventional recordings from voice therapy sessions.
  • To identify key differences and limitations of HSV-derived acoustic data for voice analysis.

Main Methods:

  • A database of 617 voice recordings (250 ms) from HSV examinations was created.
  • Comparison databases included 809 vowels from voice therapy sessions (1-second and 250 ms durations).
  • Extracted 490 acoustic features, developed machine learning models, and classified hoarseness severity based on expert auditory-perceptual ratings.

Main Results:

  • Logistic regression models achieved classification accuracies of 0.863 (VT-1), 0.847 (VT-2), and 0.742 (HS).
  • Correlation between predicted and subjective hoarseness scores was 0.797 (VT-1), 0.763 (VT-2), and 0.637 (HS).
  • Correlation between changes in quantitative and subjective ratings was significantly lower for HSV recordings (0.088) compared to voice therapy recordings.

Conclusions:

  • Acoustic signals from HSV show potential for quantitative hoarseness assessment but are less reliable than voice therapy recordings.
  • Practical challenges during oral laryngeal examination limit the quality of HSV-derived acoustic recordings.
  • Future improvements, potentially using flexible nasal endoscopy, could enhance the utility of HSV for voice assessment.