Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Acoustic-to-phonetic mapping using recurrent neural networks.

M D Hanes1, S C Ahalt, A K Krishnamurthy

  • 1Dept. of Electr. Eng., Ohio State Univ., Columbus, OH.

IEEE Transactions on Neural Networks
|January 1, 1994
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Optical implementation and applications of closest-vector selection in neural networks.

Applied optics·2010
Same author

Comparison between single-step and balloon dilatational tracheostomy.

British journal of anaesthesia·2010
Same author

Diffusion approximation of frequency sensitive competitive learning.

IEEE transactions on neural networks·1997
Same author

Codeword distribution for frequency sensitive competitive learning with one-dimensional input data.

IEEE transactions on neural networks·1996
Same author

Intensity-weighted average of instantaneous frequency as a model for frequency discrimination.

The Journal of the Acoustical Society of America·1993
Same author

A critical review of electroglottography.

Critical reviews in biomedical engineering·1985
Same journal

Universal perceptron and DNA-like learning algorithm for binary neural networks: LSBF and PBF implementations.

IEEE transactions on neural networks·2013
Same journal

Guest editorial: special section on white box nonlinear prediction models.

IEEE transactions on neural networks·2011
Same journal

Data-based fault-tolerant control of high-speed trains with traction/braking notch nonlinearities and actuator failures.

IEEE transactions on neural networks·2011
Same journal

Guest editorial: special section on data-based control, modeling, and optimization.

IEEE transactions on neural networks·2011
Same journal

Neural network-based multiple robot simultaneous localization and mapping.

IEEE transactions on neural networks·2011
Same journal

Data-driven model-free adaptive control for a class of MIMO nonlinear discrete-time systems.

IEEE transactions on neural networks·2011
See all related articles

Artificial neural networks effectively map acoustic speech signals to phonetic representations. Simple Elman recurrent networks accurately identify consonants and vowels across varying speech tempos.

Area of Science:

  • Speech Recognition
  • Artificial Intelligence
  • Acoustic Phonetics

Background:

  • Speech recognition systems require accurate acoustic-to-phonetic mapping.
  • Temporal dynamics of speech signals are crucial for accurate phonetic interpretation.
  • Previous methods often struggle with variable speech rates.

Purpose of the Study:

  • To apply artificial neural networks for acoustic-to-phonetic mapping.
  • To evaluate the performance of Elman simple recurrent networks on this task.
  • To assess the networks' ability to handle different speech tempos.

Main Methods:

  • Utilized Elman simple recurrent neural networks.
  • Trained networks using standard backpropagation techniques.
  • Experimented with formant contour data from speech spoken at slow and normal tempos.

Related Experiment Videos

Main Results:

  • Elman networks successfully performed acoustic-to-phonetic mapping.
  • Networks accurately mapped formant contours to CVC syllables.
  • Demonstrated capability in consonant discrimination and vowel identification.

Conclusions:

  • Simple recurrent neural networks are effective for acoustic-to-phonetic tasks.
  • The proposed method is robust to variations in speech rate.
  • This approach offers a viable solution for real-time speech recognition.