Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Music Ensemble: a large dataset on musicianship, cognition, and personality in musicians and nonmusicians.

Scientific data·2026

Same author

Remembering past emotions: How emotion expressions are linked to memory reappraisal.

PloS one·2025

Same author

The Time Course of the Pupillary Response to Auditory Emotions in Pseudospeech, Music, and Vocalizations.

Trends in hearing·2025

Same author

Vocal and musical emotion perception, voice cue discrimination, and quality of life in cochlear implant users with and without acoustic hearing.

Quarterly journal of experimental psychology (2006)·2025

Same author

Assessment of Speech Processing and Listening Effort Associated With Speech-on-Speech Masking Using the Visual World Paradigm and Pupillometry.

Trends in hearing·2025

Same author

Neural Adaptation at Stimulus Onset and Speed of Neural Processing as Critical Contributors to Speech Comprehension Independent of Hearing Threshold or Age.

Journal of clinical medicine·2024

Same journal

Hearing Aids Reshape Neural Processing of Emotional Speech Without Improving Emotion Perception.

Trends in hearing·2026

Same journal

Advantages of Fluctuating Noise for Measuring Speech Intelligibility in Listeners With Hearing Loss.

Trends in hearing·2026

Same journal

Probing the Underlying Mechanisms of Spectro-Temporal Modulation Discrimination.

Trends in hearing·2026

Same journal

Objective Comparison of Auditory Profiles Using Manifold Learning and Intrinsic Measures.

Trends in hearing·2026

Same journal

Evidence for a Transient State of Auditory Hypersensitivity During Initial Onset of Tinnitus: IDAEP Changes Between Acute and Chronic Tinnitus.

Trends in hearing·2026

Same journal

Impact of Age-Related Hearing Loss on Brain Connectivity and Cognitive Performance: A Systematic Review.

Trends in hearing·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 1, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Automated Speech Audiometry: Can It Work Using Open-Source Pre-Trained Kaldi-NL Automatic Speech Recognition?

Gloria Araiza-Illan^1,2, Luke Meyer^1,2, Khiet P Truong³

¹Department of Otorhinolaryngology, Head and Neck Surgery, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands.

Trends in Hearing

|March 14, 2024

Summary

This summary is machine-generated.

This study introduces an automated digits-in-noise (DIN) hearing screening test using Kaldi-NL. The automated system accurately assesses spoken responses, showing potential for clinical use in hearing evaluations.

Keywords:

automatic speech recognition digits-in-noise test speech audiometry speech perception speech-in-noise hearing test

More Related Videos

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Semi-Automated Analysis of Peak Amplitude and Latency for Auditory Brainstem Response Waveforms Using R

Semi-Automated Analysis of Peak Amplitude and Latency for Auditory Brainstem Response Waveforms Using R

Published on: December 9, 2022

Related Experiment Videos

Last Updated: Jul 1, 2025

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Semi-Automated Analysis of Peak Amplitude and Latency for Auditory Brainstem Response Waveforms Using R

Semi-Automated Analysis of Peak Amplitude and Latency for Auditory Brainstem Response Waveforms Using R

Published on: December 9, 2022

Area of Science:

Audiology
Speech Processing
Computational Linguistics

Background:

The digits-in-noise (DIN) test is a valuable tool for hearing screening across diverse populations.
Current DIN test administration relies on human supervisors or manual response entry.
Automating the DIN test can enhance efficiency and accessibility in hearing assessments.

Purpose of the Study:

To develop and evaluate an automated digits-in-noise (DIN) test system using the Kaldi-NL toolkit for spoken response evaluation.
To assess the performance of the Kaldi-NL system in accurately transcribing spoken digits in noise.
To determine the impact of automated transcription errors on the speech reception threshold (SRT) output.

Main Methods:

An automated DIN test was developed utilizing the open-source Kaldi-NL automatic speech recognition toolkit.
Thirty self-reported normal-hearing Dutch adults participated in the study.
The system evaluated spoken responses, and its performance was measured by word error rate (WER) and its effect on SRT via bootstrapping simulations.

Main Results:

The Kaldi-NL system demonstrated an average word error rate (WER) of 5.0% across participants.
An average of three triplets per participant contained decoding errors.
Simulations indicated that up to four triplets with decoding errors minimally impacted the speech reception threshold (SRT), remaining within typical variability.

Conclusions:

The proposed automated DIN test setup using Kaldi-NL is feasible for clinical applications.
The system shows promise for unsupervised hearing screening and assessment.
Further validation may confirm its utility in real-world audiological settings.