Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

211
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
211
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

563
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
563
Reinforcement Schedules01:24

Reinforcement Schedules

148
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
148
Primary and Secondary Reinforcers01:23

Primary and Secondary Reinforcers

268
In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...
268
Instinctive Drift01:05

Instinctive Drift

225
Instinctive drift refers to the tendency of animals to revert to their innate behaviors despite repeated reinforcement. Breland and Breland demonstrated this concept in an experiment with a raccoon. The raccoon was trained to pick up two coins and place them in a container in exchange for food. Initially, the raccoon learned to associate the coins with food, making them a conditioned stimulus or a substitute for food. However, over time, the raccoon became less willing to put the coins into the...
225
Timing and Consequences on Behavior01:08

Timing and Consequences on Behavior

95
In operant conditioning, the timing of reinforcement is crucial. For animals like rats and cats, immediate reinforcement (within a few seconds) is much more effective than delayed reinforcement. For example, a food reward for a rat needs to follow within 30 seconds of pressing a bar to be effective. 
Humans, however, can respond to delayed reinforcers. We often make decisions between immediate small rewards and delayed larger rewards. This ability to delay gratification is a significant...
95

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Latent subdimensions of anxiety and depression differentially influence exertion of effort in pursuit of reward versus avoidance of threat.

Translational psychiatry·2026
Same author

Neural signatures of model-based and model-free reinforcement learning across prefrontal cortex and striatum.

eLife·2026
Same author

Children and adults differ in how primary and secondary incentives modulate valuation, effort, and cognitive control.

PloS one·2026
Same author

Effects of overt and covert attention on decision-making dynamics in prefrontal cortex.

bioRxiv : the preprint server for biology·2026
Same author

Functional reorganization of motor cortex connectivity during learning.

bioRxiv : the preprint server for biology·2026
Same author

Rethinking how research is reviewed and published.

eLife·2026
Same journal

Connectomic evidence that ordered activity drives neuromuscular network formation.

Nature neuroscience·2026
Same journal

Noninvasive decoding of typed sentences from human brain activity.

Nature neuroscience·2026
Same journal

Striatal control of amygdalar acetylcholine release during salience-associated processing.

Nature neuroscience·2026
Same journal

Mitochondrial stress response drives microglial senescence.

Nature neuroscience·2026
Same journal

Conditioned accumbal dopamine transients forecast individual preference for drug versus natural rewards and compulsive behavior.

Nature neuroscience·2026
Same journal

The mitochondrial unfolded protein response in human microglia disrupts neuronal-glial communication and promotes senescence.

Nature neuroscience·2026
See all related articles

Related Experiment Video

Updated: Jul 6, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning
09:22

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

14.7K

Distributional reinforcement learning in prefrontal cortex.

Timothy H Muller1,2, James L Butler3,4, Sebastijan Veselic3,4,5

  • 1Department of Experimental Psychology, University of Oxford, Oxford, UK. timothymuller127@gmail.com.

Nature Neuroscience
|January 10, 2024
PubMed
Summary
This summary is machine-generated.

Distributional reinforcement learning (RL) better explains brain activity related to reward-guided learning in the anterior cingulate cortex than classic RL theories. This suggests a common mechanism for how the brain learns from rewards.

More Related Videos

Operant Procedures for Assessing Behavioral Flexibility in Rats
08:30

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

20.9K
An Operant Intra-/Extra-dimensional Set-shift Task for Mice
08:35

An Operant Intra-/Extra-dimensional Set-shift Task for Mice

Published on: January 22, 2016

12.2K

Related Experiment Videos

Last Updated: Jul 6, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning
09:22

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

14.7K
Operant Procedures for Assessing Behavioral Flexibility in Rats
08:30

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

20.9K
An Operant Intra-/Extra-dimensional Set-shift Task for Mice
08:35

An Operant Intra-/Extra-dimensional Set-shift Task for Mice

Published on: January 22, 2016

12.2K

Area of Science:

  • Neuroscience
  • Computational Neuroscience
  • Cognitive Science

Background:

  • The prefrontal cortex is vital for learning and decision-making.
  • Classic reinforcement learning (RL) theories focus on expected rewards and explain prefrontal cortex neural data.
  • Distributional RL accounts for the full distribution of rewards and better explains dopamine responses.

Purpose of the Study:

  • To investigate whether distributional RL provides a better explanation for neuronal responses in the macaque anterior cingulate cortex compared to classic RL.
  • To determine if distributional RL represents a common mechanism for reward-guided learning across different brain regions.

Main Methods:

  • Analysis of neuronal recordings from the anterior cingulate cortex in macaques.
  • Comparison of the explanatory power of classic RL and distributional RL models for neural data.
  • Modeling of reward-based learning processes.

Main Results:

  • Distributional RL models provided a superior explanation for anterior cingulate cortex neuronal responses compared to classic RL models.
  • The findings indicate that distributional RL captures key aspects of reward processing in this brain region.
  • This suggests that learning the full distribution of outcomes is a fundamental aspect of reward-guided behavior.

Conclusions:

  • Distributional reinforcement learning offers a more comprehensive framework for understanding neural mechanisms of reward-guided learning.
  • The anterior cingulate cortex utilizes distributional RL principles, suggesting this is a widespread mechanism in the brain.
  • These findings advance our understanding of decision-making and learning processes in the brain.