Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

222
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
222
Reinforcement01:23

Reinforcement

300
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
300
Observational Learning01:12

Observational Learning

250
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
250
Operant Conditioning01:21

Operant Conditioning

1.7K
Operant conditioning, a key concept in behavioral psychology, involves using reinforcement and punishment to alter the likelihood of a behavior being repeated. B.F. introduced this type of conditioning. Skinner focused on voluntary behaviors and the consequences that follow them, influencing whether these behaviors will be strengthened or diminished.
Reinforcement in operant conditioning can be positive or negative, both of which serve to increase the likelihood of a behavior. Positive...
1.7K
Steps in the Modeling Process01:14

Steps in the Modeling Process

263
Albert Bandura's theory of observational learning identifies four critical processes: attention, retention, motor reproduction, and reinforcement or motivation.
Attention is the first necessary component for observational learning. It involves focusing on what the model is doing and saying. For example, if you decide to take a drawing class to enhance your skills, you need to pay close attention to the instructor's words and hand movements. The characteristics of the model significantly...
263
Law of Effect01:06

Law of Effect

1.5K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Flexible route planning and rapid structure learning by mice in complex environments.

bioRxiv : the preprint server for biology·2026
Same author

Gradually, then suddenly: the precarious position of UK preclinical neuroscience.

Brain : a journal of neurology·2026
Same author

Between planning and map building: Prioritizing replay when future goals are uncertain.

Neuron·2025
Same author

Single neurons and networks in the mouse claustrum integrate input from widespread cortical sources.

eLife·2025
Same author

Do goldfish like to be informed?

Proceedings. Biological sciences·2025
Same author

Under pressure: UK preclinical neuroscience at a crossroads.

Brain and neuroscience advances·2025
Same journal

The default mode network: where spontaneous thought meets memory consolidation.

Current opinion in behavioral sciences·2026
Same journal

Brain activity is not only for thinking.

Current opinion in behavioral sciences·2026
Same journal

Role of striatal dopamine signaling in compulsive eating associated with obesity.

Current opinion in behavioral sciences·2026
Same journal

Timescales of dopamine release in the striatum as a window into hierarchical control.

Current opinion in behavioral sciences·2026
Same journal

Contributions of default mode network to subjective valuation and maladaptive decision making.

Current opinion in behavioral sciences·2026
Same journal

Evolution of neural circuits in the origin of behavioral novelty.

Current opinion in behavioral sciences·2026
See all related articles

Related Experiment Video

Updated: Aug 2, 2025

Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

11.0K

What is dopamine doing in model-based reinforcement learning?

Thomas Akam1, Mark E Walton1

  • 1Department of Experimental Psychology, Oxford University, Oxford, UK.

Current Opinion in Behavioral Sciences
|April 21, 2023
PubMed
Summary
This summary is machine-generated.

Dopamine

More Related Videos

The 6-hydroxydopamine Rat Model of Parkinson's Disease
08:10

The 6-hydroxydopamine Rat Model of Parkinson's Disease

Published on: October 27, 2021

11.0K
Murine Model for Parkinson's Disease: from 6-OH Dopamine Lesion to Behavioral Test
08:06

Murine Model for Parkinson's Disease: from 6-OH Dopamine Lesion to Behavioral Test

Published on: January 15, 2010

31.7K

Related Experiment Videos

Last Updated: Aug 2, 2025

Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

11.0K
The 6-hydroxydopamine Rat Model of Parkinson's Disease
08:10

The 6-hydroxydopamine Rat Model of Parkinson's Disease

Published on: October 27, 2021

11.0K
Murine Model for Parkinson's Disease: from 6-OH Dopamine Lesion to Behavioral Test
08:06

Murine Model for Parkinson's Disease: from 6-OH Dopamine Lesion to Behavioral Test

Published on: January 15, 2010

31.7K

Area of Science:

  • Neuroscience
  • Computational Neuroscience
  • Reinforcement Learning

Background:

  • Dopamine's role in reinforcement learning (RL) is well-established, primarily linked to reward prediction errors (RPEs) in model-free RL.
  • However, recent experiments suggest dopamine's involvement in model-based RL, a finding that challenges existing theories.
  • This discrepancy necessitates a closer examination of dopamine's function in complex learning paradigms.

Purpose of the Study:

  • To investigate the mechanisms underlying dopamine's role in model-based reinforcement learning (RL).
  • To evaluate two competing hypotheses explaining dopamine's involvement in model-based RL.
  • To reconcile the known function of dopamine in model-free RL with its emerging role in model-based RL.

Main Methods:

  • The study theoretically examines two potential explanations for dopamine's function in model-based RL.
  • Hypothesis 1: Dopamine neurons encode a prediction error for updating the successor representation.
  • Hypothesis 2: Combined effects of RPEs and surprise signals explain dopamine's involvement.

Main Results:

  • The study provides a theoretical framework for understanding dopamine's function in model-based RL.
  • It explores how dopamine might contribute to learning predictive state representations (successor representation).
  • It considers the interplay between reward prediction errors and surprise signals in dopaminergic activity.

Conclusions:

  • Dopamine's involvement in model-based RL can be explained through its role in updating predictive representations or through a combination of RPEs and surprise.
  • These findings offer a more comprehensive understanding of dopamine's multifaceted role in reinforcement learning.
  • Further experimental validation is needed to distinguish between the proposed mechanisms.