1Department of Statistics, University of Michigan, Ann Arbor, MI 48109-1107, USA.
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
This study analyzes Q-learning with function approximation for policy learning from single datasets. It establishes an upper bound on generalization error, crucial for effective decision-making in complex systems.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: