Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A hybrid optimized framework with energy shape prior segmentation for brain tumor detection in MRI images.

Digital health·2026

Same author

Digital twin-assisted blockchain IoT security model using contrastive and causal learning techniques.

Scientific reports·2026

Same author

Involvement of the PD-1 pathway in the modulation of immune responses during allergic diseases.

Inflammation research : official journal of the European Histamine Research Society ... [et al.]·2026

Same author

Diabetic retinopathy severity detection using an improved Whale optimization algorithm and convolutional Kolmogorov-Arnold network.

Frontiers in medicine·2026

Same author

A Novel Hybrid CNN-ViT-Based Bi-Directional Cross-Guidance Fusion-Driven Breast Cancer Detection Model.

Life (Basel, Switzerland)·2026

Same author

Hybrid GAN-LSTM framework for diabetic foot ulcer image synthesis and automated diagnosis.

Frontiers in medicine·2026

Same journal

Revolutionizing Transcriptomics: From Single-Cell Insights to RNA-based Interventions.

SLAS technology·2026

Same journal

Smartphone-based colorimetric glucose biosensor using peroxidase-like activity of bimetallic catalyst supported onto graphitic carbon nitride nanosheets.

SLAS technology·2026

Same journal

XVCF: Exquisite visualization of VCF data from genomic experiments.

SLAS technology·2026

Same journal

EasyPip: An equipment-agnostic software application to transform automated liquid handlers into efficient walk-up tools for routine plate-based pipetting.

SLAS technology·2026

Same journal

Identification of ubiquitination-related biomarkers in osteoarthritis: Combining transcriptome and Mendelian randomization analysis.

SLAS technology·2026

Same journal

Miniaturization of a Lumit p-ERK immunoassay for cell-based high-throughput screening of a chemogenetic small-molecule library.

SLAS technology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 23, 2025

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

Image classification-driven speech disorder detection using deep learning technique.

Nasser Ali Aljarallah¹, Ashit Kumar Dutta¹, Abdul Rahaman Wahab Sait²

¹Department of Computer Science and Information Systems, College of Applied Sciences, AlMaarefa University, Ad Diriyah, Riyadh, 13713, Saudi Arabia.

SLAS Technology

|March 8, 2025

Summary

This summary is machine-generated.

This study introduces an automated speech disorder detection (SDD) model using Mel-spectrogram classification. The novel approach achieves 99.1% accuracy, offering efficient and accessible diagnostic tools for speech impairments.

Keywords:

Assistive technologies Deep learning Feature extraction Image classification Speech disorders Vison transformer

More Related Videos

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Related Experiment Videos

Last Updated: May 23, 2025

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Objectification of Tongue Diagnosis in Traditional Medicine, Data Analysis, and Study Application

Published on: April 14, 2023

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Area of Science:

Medical Imaging
Artificial Intelligence
Speech Pathology

Background:

Speech disorders significantly impact communication, social interaction, education, and quality of life.
Early and precise diagnosis is crucial for successful intervention, but current clinical examinations are time-consuming and subjective.
Automated Speech Disorder Detection (SDD) models are needed to overcome the limitations of manual clinical assessments.

Purpose of the Study:

To propose an automated speech disorder detection (SDD) model based on Mel-spectrogram image classification.
To identify multiple speech disorders accurately and efficiently using advanced machine learning techniques.
To enhance the accessibility and efficiency of diagnostic tools for speech impairments.

Main Methods:

Mel-spectrograms were generated from voice samples using a Wavelet Transform (WT) hybridization technique.
A LEVIT transformer was employed for enhanced feature extraction from the Mel-spectrograms.
An ensemble learning (EL) approach, combining CatBoost, XGBoost, and Extremely Randomized Tree, was used for classification. Quantization-aware training (QAT) was utilized to reduce computational resources.

Main Results:

The proposed model achieved an exceptional accuracy of 99.1% on the VOICED and LANNA datasets.
The model demonstrated efficiency with a limited number of parameters (8.2 million).
Shapley Additive Explanations (SHAP) values were used to ensure model interpretability.

Conclusions:

The developed automated SDD model significantly enhances speech disorder classification accuracy and efficiency.
The approach offers promising prospects for developing accessible and reliable diagnostic tools.
Future research can integrate multimodal data for broader application across languages and dialects, enabling real-time clinical and telehealth deployment.