Decision Making: P-value Method
Reinforcement
Reinforcement Schedules
Timing and Consequences on Behavior
Avoidance Learning and Learned Helplessness
Expected Value
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Jun 15, 2025

Measuring Delay Discounting in Humans Using an Adjusting Amount Task
Published on: January 9, 2016
Offline reinforcement learning (RL) methods can be suboptimal due to pessimism. This study introduces a de-pessimism (DEP) operator for accurate Q-value estimation, improving policy learning in offline RL.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: