Reinforcement Schedules
Sampling Continuous Time Signal
Entropy Change in Reversible Processes
Random Sampling Method
Randomized Experiments
Random Variables
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Jul 11, 2025

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
Published on: June 1, 2015
A new reinforcement learning (RL) method, continuous dynamic policy programming (CDPP), improves learning stability and sample efficiency for continuous actions. It uses relative entropy regularization for better exploration and policy updates in complex tasks.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: