Hindsight Biases
Purposive Learning
Observational Studies
Longitudinal Research
Properties of Laplace Transform-II
Longitudinal Studies
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
1Gatsby Computational Neuroscience Unit, UCL, London, WC1N 3AR, UK. dayan@gatsby.ucl.ac.uk
Monkeys sometimes choose poorly when rewards are delayed. A standard reinforcement learning model, using average reward per step as a baseline, can explain this behavior and improve predictions.
Area of Science:
Background:
Purpose of the Study:
Main Methods:
Main Results:
Conclusions: