Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Force Classification

Force Classification

Forces play a crucial role in the study of physics and engineering. They are essential in describing the motion, behavior, and equilibrium of objects in the physical world. Forces can be classified based on their origin, type, and direction of action.
Contact and non-contact forces are two of the most widely used categories of forces. As the name suggests, contact forces require physical contact between two objects to act upon each other. Examples of contact forces include frictional,...

Perceiving Loudness, Pitch, and Location

Perceiving Loudness, Pitch, and Location

The human brain perceives pitch through two primary mechanisms reflected in place theory and frequency theory. Each mechanism describes how sound waves are interpreted as specific pitches by the brain, offering insights into the intricate processes of auditory perception.
Place theory, or place coding, suggests that different pitches are heard because various sound waves activate specific locations along the cochlea's basilar membrane. The brain determines the pitch of a sound by...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Differential Leveling

Differential Leveling

Differential leveling is a precise method in surveying used to determine the elevation difference between two points. Its primary goal is to establish accurate vertical measurements to create level surfaces or grade lines critical for designing and constructing infrastructures such as roads, bridges, and buildings.The procedure for differential leveling begins with setting up and leveling the instrument at a point where the benchmark can be seen. The level rod is held on the benchmark (BM), and...

Improving Translational Accuracy

Improving Translational Accuracy

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Longitudinal speech and gross motor function development in children and adolescents with cerebral palsy.

Developmental medicine and child neurology·2026

Same author

How Do Infants Eat at Home? A Preliminary Study of Complementary Feeding Skills in the Naturalistic Environment.

American journal of speech-language pathology·2026

Same author

Does the Use of Crowdsourced Listeners Yield Different Speech Intelligibility Results Than In-Person Listeners for Typically Developing Children?

Journal of speech, language, and hearing research : JSLHR·2026

Same author

Apple AirPods Pro 2 Live Listen as an Assistive Listening Device.

American journal of audiology·2026

Same author

A Scoping Review of Oral Feeding Skill Development in Typically Developing Children Part II: Exploring Variability in Oral Feeding Skill Descriptions.

American journal of speech-language pathology·2025

Same author

Characterizing the Relationship Between the Intelligibility in Context Scale and Transcription Intelligibility in Typically Developing English-Speaking Children Between Ages 2;6 and 9;11.

American journal of speech-language pathology·2025

Same journal

Age-Related Maturation of Antiphasic Arabic Digits-in-Noise Thresholds in Children.

Journal of speech, language, and hearing research : JSLHR·2026

Same journal

Case Studies of Auditory Processing Assessment and Management for Veterans.

Journal of speech, language, and hearing research : JSLHR·2026

Same journal

Effect of Acupuncture Combined With Computer-Assisted Cognitive Training on Language and Cognitive Functions in Poststroke Aphasia: A Randomized Controlled Trial.

Journal of speech, language, and hearing research : JSLHR·2026

Same journal

Understanding How Older Adults Comprehend Simple Comparative Sentences in a Predicate-Final Language.

Journal of speech, language, and hearing research : JSLHR·2026

Same journal

Perception of Synthesized Mandarin Speech Based on a Large-Scale Language Model Among Deaf Adults With Cochlear Implants.

Journal of speech, language, and hearing research : JSLHR·2026

Same journal

Measurement Variability of Peak Flow: A Laboratory Experiment Comparing Cough Testing Equipment.

Journal of speech, language, and hearing research : JSLHR·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 16, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

A Tunable Forced Alignment System Based on Deep Learning: Applications to Child Speech.

Prad Kadambi^1,2, Tristan J Mahr³, Katherine C Hustad⁴

¹School of Electrical, Computer and Energy Engineering, Arizona State University, Tempe.

Journal of Speech, Language, and Hearing Research : JSLHR

|March 31, 2025

Summary

This summary is machine-generated.

A new tool, Wav2TextGrid, enables trainable phonetic alignment for children's speech, improving accuracy over baseline methods. This system directly trains on manual alignments, enhancing clinical-grade speech analysis.

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Related Experiment Videos

Last Updated: May 16, 2025

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

Area of Science:

Speech processing
Computational linguistics
Phonetics

Background:

Phonetic forced alignment is crucial for automated speech analysis, especially for nonstandard speech like children's speech.
Manual alignment is the gold standard but is time-consuming and current tools do not support direct training on these alignments.

Purpose of the Study:

To develop a trainable, speaker-adaptive phonetic forced alignment system for children's speech.
To enable direct training on manual alignments for improved accuracy.

Main Methods:

Developed Wav2TextGrid, a neural forced aligner using a corpus of 42 neurotypical children (3-6 years old).
Evaluated performance on child speech and the TIMIT corpus to assess adaptability across age and dialects.

Main Results:

The Wav2TextGrid tool significantly improved alignment accuracy across all phoneme categories compared to baseline.
Achieved over 40% accuracy improvement for plosives and affricates in children's speech.
Matched existing methods with 13 minutes of labeled data; significant improvements seen with 45-60 minutes.

Conclusions:

Wav2TextGrid offers an alternate alignment workflow, tailoring forced alignments to match clinical-grade manual alignments.
The tool enhances the analysis of children's speech and other nonstandard speech patterns.