Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A hybrid optimized framework with energy shape prior segmentation for brain tumor detection in MRI images.

Digital health·2026
Same author

Digital twin-assisted blockchain IoT security model using contrastive and causal learning techniques.

Scientific reports·2026
Same author

Involvement of the PD-1 pathway in the modulation of immune responses during allergic diseases.

Inflammation research : official journal of the European Histamine Research Society ... [et al.]·2026
Same author

Diabetic retinopathy severity detection using an improved Whale optimization algorithm and convolutional Kolmogorov-Arnold network.

Frontiers in medicine·2026
Same author

A Novel Hybrid CNN-ViT-Based Bi-Directional Cross-Guidance Fusion-Driven Breast Cancer Detection Model.

Life (Basel, Switzerland)·2026
Same author

Hybrid GAN-LSTM framework for diabetic foot ulcer image synthesis and automated diagnosis.

Frontiers in medicine·2026
Same journal

Revolutionizing Transcriptomics: From Single-Cell Insights to RNA-based Interventions.

SLAS technology·2026
Same journal

Smartphone-based colorimetric glucose biosensor using peroxidase-like activity of bimetallic catalyst supported onto graphitic carbon nitride nanosheets.

SLAS technology·2026
Same journal

XVCF: Exquisite visualization of VCF data from genomic experiments.

SLAS technology·2026
Same journal

EasyPip: An equipment-agnostic software application to transform automated liquid handlers into efficient walk-up tools for routine plate-based pipetting.

SLAS technology·2026
Same journal

Identification of ubiquitination-related biomarkers in osteoarthritis: Combining transcriptome and Mendelian randomization analysis.

SLAS technology·2026
Same journal

Miniaturization of a Lumit p-ERK immunoassay for cell-based high-throughput screening of a chemogenetic small-molecule library.

SLAS technology·2026
See all related articles

Related Experiment Video

Updated: May 23, 2025

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
05:56

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

2.3K

Image classification-driven speech disorder detection using deep learning technique.

Nasser Ali Aljarallah1, Ashit Kumar Dutta1, Abdul Rahaman Wahab Sait2

  • 1Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh, 13713, Saudi Arabia.

SLAS Technology
|March 8, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces an automated speech disorder detection (SDD) model using Mel-spectrogram classification. The novel approach achieves 99.1% accuracy, offering efficient and accessible diagnostic tools for speech impairments.

Keywords:
Assistive technologiesDeep learningFeature extractionImage classificationSpeech disordersVison transformer

More Related Videos

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

935
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.4K

Related Experiment Videos

Last Updated: May 23, 2025

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application
05:56

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

2.3K
Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

935
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.4K

Area of Science:

  • Medical Imaging
  • Artificial Intelligence
  • Speech Pathology

Background:

  • Speech disorders significantly impact communication, social interaction, education, and quality of life.
  • Early and precise diagnosis is crucial for successful intervention, but current clinical examinations are time-consuming and subjective.
  • Automated Speech Disorder Detection (SDD) models are needed to overcome the limitations of manual clinical assessments.

Purpose of the Study:

  • To propose an automated speech disorder detection (SDD) model based on Mel-spectrogram image classification.
  • To identify multiple speech disorders accurately and efficiently using advanced machine learning techniques.
  • To enhance the accessibility and efficiency of diagnostic tools for speech impairments.

Main Methods:

  • Mel-spectrograms were generated from voice samples using a Wavelet Transform (WT) hybridization technique.
  • A LEVIT transformer was employed for enhanced feature extraction from the Mel-spectrograms.
  • An ensemble learning (EL) approach, combining CatBoost, XGBoost, and Extremely Randomized Tree, was used for classification. Quantization-aware training (QAT) was utilized to reduce computational resources.

Main Results:

  • The proposed model achieved an exceptional accuracy of 99.1% on the VOICED and LANNA datasets.
  • The model demonstrated efficiency with a limited number of parameters (8.2 million).
  • Shapley Additive Explanations (SHAP) values were used to ensure model interpretability.

Conclusions:

  • The developed automated SDD model significantly enhances speech disorder classification accuracy and efficiency.
  • The approach offers promising prospects for developing accessible and reliable diagnostic tools.
  • Future research can integrate multimodal data for broader application across languages and dialects, enabling real-time clinical and telehealth deployment.