Decision Making: P-value Method
Observational Learning
Reinforcement
Propagation of Uncertainty from Systematic Error
Expected Value
Reinforcement Schedules
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Nov 2, 2025

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
Published on: September 10, 2018
This study introduces a distributional soft actor-critic (DSAC) algorithm to reduce Q-value overestimations in reinforcement learning (RL). DSAC improves policy performance in continuous control tasks by learning return distributions.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: