Y Bengio1, P Simard, P Frasconi
1Dept. d'Inf. et de Recherche Oper., Montreal Univ., Que.
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Training recurrent neural networks for long-term dependencies is challenging for gradient descent algorithms. This study explains why and explores alternative learning methods for improved sequence modeling.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: