Observational Learning
Regression Toward the Mean
Time-Domain Interpretation of PD Control
PD Controller: Design
Residuals and Least-Squares Property
PI Controller: Design
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
This study introduces Regularized Dual-Averaging policy gradient (RDA-PG), an actor-critic method for reinforcement learning. RDA-PG uses L1-regularization for feature selection, improving learning efficiency and performance on complex tasks.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: