Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Videos

Input-output HMMs for sequence processing.

Y Bengio¹, P Frasconi

¹Dept. of Comput. Sci. and Oper. Res., Montreal Univ., Que.

IEEE Transactions on Neural Networks

|January 1, 1996

Summary

This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Universal autofocus for quantitative volumetric microscopy of whole mouse brains.

Nature methods·2021

Same author

Adaptive importance sampling to accelerate training of a neural probabilistic language model.

IEEE transactions on neural networks·2008

Same author

A general framework for adaptive processing of data structures.

IEEE transactions on neural networks·2008

Same author

Taking on the curse of dimensionality in joint distributions using neural networks.

IEEE transactions on neural networks·2008

Same author

Cost functions and model combination for VaR-based asset allocation using neural networks.

IEEE transactions on neural networks·2008

Same author

Experiments on the application of IOHMMs to model financial returns series.

IEEE transactions on neural networks·2008

Same journal

Universal perceptron and DNA-like learning algorithm for binary neural networks: LSBF and PBF implementations.

IEEE transactions on neural networks·2013

Same journal

Guest editorial: special section on white box nonlinear prediction models.

IEEE transactions on neural networks·2011

Same journal

Data-based fault-tolerant control of high-speed trains with traction/braking notch nonlinearities and actuator failures.

IEEE transactions on neural networks·2011

Same journal

Guest editorial: special section on data-based control, modeling, and optimization.

IEEE transactions on neural networks·2011

Same journal

Neural network-based multiple robot simultaneous localization and mapping.

IEEE transactions on neural networks·2011

Same journal

Data-driven model-free adaptive control for a class of MIMO nonlinear discrete-time systems.

IEEE transactions on neural networks·2011

See all related articles

We introduce a novel Input-Output Hidden Markov Model (IOHMM) for sequence processing. This recurrent neural network-like model excels at grammatical inference tasks, demonstrating strong generalization capabilities.

Area of Science:

Artificial Intelligence
Machine Learning
Computational Linguistics

Background:

Sequence processing is a fundamental challenge in AI and machine learning.
Traditional models like Hidden Markov Models (HMMs) have limitations in mapping input to output sequences.
Recurrent neural networks offer powerful sequence processing but can be complex to train.

Purpose of the Study:

To propose a novel discrete-state model for sequence processing that represents past context.
To introduce the Input-Output Hidden Markov Model (IOHMM) with a modular, recurrent connectionist architecture.
To demonstrate the effectiveness of IOHMMs in grammatical inference tasks.

Main Methods:

Developed a recurrent connectionist architecture with state-associated subnetworks.

Related Experiment Videos

Interpreted the model statistically as an Input-Output Hidden Markov Model (IOHMM).

Employed Estimation-Maximization (EM) or Generalized EM (GEM) algorithms for training, treating state trajectories as missing data.

Main Results:

IOHMMs enable mapping input sequences to output sequences, similar to recurrent neural networks.
The model utilizes a more discriminant learning paradigm compared to HMMs.
Experimental results on the seven Tomita grammars show excellent generalization capabilities for IOHMMs.

Conclusions:

IOHMMs provide a robust framework for sequence processing and grammatical inference.
The modular architecture and EM-based training facilitate efficient learning and adaptation.
IOHMMs represent a promising advancement for tasks requiring sequence-to-sequence mapping and pattern recognition.