Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Characteristics of exopolysaccharides from Paecilomyces hepiali and their simulated digestion and fermentation in vitro by human intestinal microbiota.

International journal of biological macromolecules·2024
Same author

21 Gy single fraction prostate HDR brachytherapy: 5-year results of a single institution prospective pilot study.

Brachytherapy·2024
Same author

A comparison of the time course of action and laryngeal mask airway insertion conditions with different doses of mivacurium for day-case urologic surgery in children: a prospective cohort study.

Frontiers in pediatrics·2024
Same author

Quantitative evaluation of urban green exposure and its impact on human health: A case study on the 3-30-300 green space rule.

The Science of the total environment·2024
Same author

Promoting the process of determining brain death through standardized training.

Frontiers in neurology·2024
Same author

PM2.5 exposure-induced senescence-associated secretory phenotype in airway smooth muscle cells contributes to airway remodeling.

Environmental pollution (Barking, Essex : 1987)·2024
Same journal

Analysis of strength degradation of coal and rock masses and stability of mined areas under long term immersion environment.

PloS one·2026
Same journal

Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus.

PloS one·2026
Same journal

Preparation and physicochemical characterization of a biodegradable chitosan/carboxymethyl cellulose hydrogel synthesized in NaOH/urea medium.

PloS one·2026
Same journal

Action-guilt, survivor-guilt, and depression in combat-related PTSD.

PloS one·2026
Same journal

Explainable machine learning for predicting activities of daily living at discharge in stroke patients: A retrospective study using SHAP interpretability.

PloS one·2026
Same journal

Deep learning based two-way feature depiction model for brain tumor detection.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Dec 19, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

730

Filled pause refinement based on the pronunciation probability for lecture speech.

Yan-Hua Long1, Hong Ye1

  • 1Department of Electronical and Information Engineering, Shanghai Normal University, Shanghai, China.

Plos One
|April 11, 2015
PubMed
Summary
This summary is machine-generated.

This study introduces a new method to improve automatic speech recognition by detecting filled pauses (FPs) in lecture transcriptions. The approach refines transcriptions, enhancing accuracy for spontaneous speech.

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.9K
A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS
12:43

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

35.7K

Related Experiment Videos

Last Updated: Dec 19, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

730
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.9K
A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS
12:43

A Protocol for Comprehensive Assessment of Bulbar Dysfunction in Amyotrophic Lateral Sclerosis ALS

Published on: February 21, 2011

35.7K

Area of Science:

  • Computational Linguistics
  • Speech Processing
  • Artificial Intelligence

Background:

  • Automatic speech recognition (ASR) struggles with disfluent speech, particularly lecture and conversational content.
  • Filled pauses (FPs) are common disfluencies that degrade ASR accuracy due to poor annotation and acoustic similarity to words.
  • Existing ASR systems lack robust methods for handling FPs in training data.

Purpose of the Study:

  • To propose and evaluate a novel automatic refinement approach for detecting filled pauses (FPs) in British English lecture speech.
  • To enhance the performance of ASR systems by addressing the challenge of disfluencies in spontaneous speech.
  • To improve the accuracy of speech transcription for lecture-based content.

Main Methods:

  • Developed a modified forced-alignment framework integrating pronunciation probabilities and acoustic language model scores.
  • Implemented an automatic refinement approach specifically designed for filled pause detection.
  • Evaluated the method on the Reith Lectures speech transcription task using imperfect training data.

Main Results:

  • Achieved successful results in filled pause detection for both development and evaluation datasets.
  • Demonstrated the effectiveness of the proposed FP refinement approach on ASR performance.
  • Investigated the impact of acoustic models trained on different speech genres on FP refinement.

Conclusions:

  • The proposed automatic refinement approach effectively detects filled pauses in British English lecture speech.
  • Enhancing ASR systems with FP detection improves transcription accuracy for spontaneous speech.
  • The method shows promise for improving ASR performance even with imperfect training transcriptions.