Kazuyuki Samejima1, Kenji Doya, Mitsuo Kawato
1Human information science laboratories, ATR International, 2-2-2 Hikaridai, Seika, Soraku, Kyoto 619-0288, Japan. samejima@atr.co.jp
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
We introduce a novel modular reward method to improve hierarchical reinforcement learning (RL). This approach enhances sub-task independence and overall policy optimality in complex tasks.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: