Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Force Classification01:22

Force Classification

1.1K
Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...
1.1K
Perceiving Loudness, Pitch, and Location01:21

Perceiving Loudness, Pitch, and Location

169
The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...
169
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

80
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
80
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

91
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
91
Differential Leveling01:12

Differential Leveling

103
Differential leveling is a precise method in surveying used to determine the elevation difference between two points. Its primary goal is to establish accurate vertical measurements to create level surfaces or grade lines critical for designing and constructing infrastructures such as roads, bridges, and buildings.The procedure for differential leveling begins with setting up and leveling the instrument at a point where the benchmark can be seen. The level rod is held on the benchmark (BM), and...
103
Improving Translational Accuracy02:07

Improving Translational Accuracy

2.5K
2.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Longitudinal speech and gross motor function development in children and adolescents with cerebral palsy.

Developmental medicine and child neurology·2026
Same author

How Do Infants Eat at Home? A Preliminary Study of Complementary Feeding Skills in the Naturalistic Environment.

American journal of speech-language pathology·2026
Same author

Does the Use of Crowdsourced Listeners Yield Different Speech Intelligibility Results Than In-Person Listeners for Typically Developing Children?

Journal of speech, language, and hearing research : JSLHR·2026
Same author

Apple AirPods Pro 2 Live Listen as an Assistive Listening Device.

American journal of audiology·2026
Same author

A Scoping Review of Oral Feeding Skill Development in Typically Developing Children Part II: Exploring Variability in Oral Feeding Skill Descriptions.

American journal of speech-language pathology·2025
Same author

Characterizing the Relationship Between the Intelligibility in Context Scale and Transcription Intelligibility in Typically Developing English-Speaking Children Between Ages 2;6 and 9;11.

American journal of speech-language pathology·2025
Same journal

Age-Related Maturation of Antiphasic Arabic Digits-in-Noise Thresholds in Children.

Journal of speech, language, and hearing research : JSLHR·2026
Same journal

Case Studies of Auditory Processing Assessment and Management for Veterans.

Journal of speech, language, and hearing research : JSLHR·2026
Same journal

Effect of Acupuncture Combined With Computer-Assisted Cognitive Training on Language and Cognitive Functions in Poststroke Aphasia: A Randomized Controlled Trial.

Journal of speech, language, and hearing research : JSLHR·2026
Same journal

Understanding How Older Adults Comprehend Simple Comparative Sentences in a Predicate-Final Language.

Journal of speech, language, and hearing research : JSLHR·2026
Same journal

Perception of Synthesized Mandarin Speech Based on a Large-Scale Language Model Among Deaf Adults With Cochlear Implants.

Journal of speech, language, and hearing research : JSLHR·2026
Same journal

Measurement Variability of Peak Flow: A Laboratory Experiment Comparing Cough Testing Equipment.

Journal of speech, language, and hearing research : JSLHR·2026
See all related articles

Related Experiment Video

Updated: May 16, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

381

A Tunable Forced Alignment System Based on Deep Learning: Applications to Child Speech.

Prad Kadambi1,2, Tristan J Mahr3, Katherine C Hustad4

  • 1School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe.

Journal of Speech, Language, and Hearing Research : JSLHR
|March 31, 2025
PubMed
Summary
This summary is machine-generated.

A new tool, Wav2TextGrid, enables trainable phonetic alignment for children's speech, improving accuracy over baseline methods. This system directly trains on manual alignments, enhancing clinical-grade speech analysis.

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.3K
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

2.5K

Related Experiment Videos

Last Updated: May 16, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

381
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.3K
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

2.5K

Area of Science:

  • Speech processing
  • Computational linguistics
  • Phonetics

Background:

  • Phonetic forced alignment is crucial for automated speech analysis, especially for nonstandard speech like children's speech.
  • Manual alignment is the gold standard but is time-consuming and current tools do not support direct training on these alignments.

Purpose of the Study:

  • To develop a trainable, speaker-adaptive phonetic forced alignment system for children's speech.
  • To enable direct training on manual alignments for improved accuracy.

Main Methods:

  • Developed Wav2TextGrid, a neural forced aligner using a corpus of 42 neurotypical children (3-6 years old).
  • Evaluated performance on child speech and the TIMIT corpus to assess adaptability across age and dialects.

Main Results:

  • The Wav2TextGrid tool significantly improved alignment accuracy across all phoneme categories compared to baseline.
  • Achieved over 40% accuracy improvement for plosives and affricates in children's speech.
  • Matched existing methods with 13 minutes of labeled data; significant improvements seen with 45-60 minutes.

Conclusions:

  • Wav2TextGrid offers an alternate alignment workflow, tailoring forced alignments to match clinical-grade manual alignments.
  • The tool enhances the analysis of children's speech and other nonstandard speech patterns.