1JST, ICORP, Computational Brain Project, 4-1-8 Honcho, Kawaguchi, Saitama, 332-0012 Japan. xmorimo@atr.jp
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
This study introduces reinforcement learning for estimating hidden states in nonlinear systems by using delayed penalties. The method successfully estimates pendulum dynamics and controllers, even learning system dynamics simultaneously.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: