Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

740
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
740
Instinctive Drift01:05

Instinctive Drift

1.5K
Instinctive drift refers to the tendency of animals to revert to their innate behaviors despite repeated reinforcement. Breland and Breland demonstrated this concept in an experiment with a raccoon. The raccoon was trained to pick up two coins and place them in a container in exchange for food. Initially, the raccoon learned to associate the coins with food, making them a conditioned stimulus or a substitute for food. However, over time, the raccoon became less willing to put the coins into the...
1.5K
Reinforcement01:23

Reinforcement

1.2K
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
1.2K
Operant Conditioning01:21

Operant Conditioning

2.6K
Operant conditioning, a key concept in behavioral psychology, involves using reinforcement and punishment to alter the likelihood of a behavior being repeated. B.F. introduced this type of conditioning. Skinner focused on voluntary behaviors and the consequences that follow them, influencing whether these behaviors will be strengthened or diminished.
Reinforcement in operant conditioning can be positive or negative, both of which serve to increase the likelihood of a behavior. Positive...
2.6K
Timing and Consequences on Behavior01:08

Timing and Consequences on Behavior

927
In operant conditioning, the timing of reinforcement is crucial. For animals like rats and cats, immediate reinforcement (within a few seconds) is much more effective than delayed reinforcement. For example, a food reward for a rat needs to follow within 30 seconds of pressing a bar to be effective. 
Humans, however, can respond to delayed reinforcers. We often make decisions between immediate small rewards and delayed larger rewards. This ability to delay gratification is a significant...
927
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

2.1K
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
2.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Polarity-dependent modulation of sleep oscillations and cortical excitability in aging.

Frontiers in aging neuroscience·2026
Same author

Neurocomputational modeling of rule abstraction and memorization during probabilistic stimulus-reward learning.

iScience·2026
Same author

On the robustness of the emergent spatiotemporal dynamics in biophysically realistic and phenomenological whole-brain models at multiple network resolutions.

Frontiers in network physiology·2025
Same author

Assessing the Long-Term Stability of the Spielberger State-Trait Inventory Trait Scale over 3.5 Years.

Journal of personality assessment·2025
Same author

Insight predicts subsequent memory via cortical representational change and hippocampal activity.

Nature communications·2025
Same author

Neural dynamics underlying the cue validity effect in target conflict resolution.

Cerebral cortex (New York, N.Y. : 1991)·2025
Same journal

A Model-Free Reinforcement Learning Implementation of Decision Making Under Uncertainty by Sequential Sampling.

Neural computation·2026
Same journal

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026
Same journal

Hierarchical Active Inference Using Successor Representations.

Neural computation·2026
Same journal

W-Kernel and Its Principal Space for Frequentist Evaluation of Bayesian Estimators.

Neural computation·2026
Same journal

A Hidden Markov Model-Inspired Sequence Classification Method for Hyperdimensional Computing.

Neural computation·2026
Same journal

Sparse Graphical Modeling for Electrophysiological Phase-Based Connectivity Using Circular Statistics.

Neural computation·2026
See all related articles

Related Experiment Video

Updated: May 1, 2026

A Conflict Model of Reward-seeking Behavior in Male Rats
06:11

A Conflict Model of Reward-seeking Behavior in Male Rats

Published on: February 20, 2019

6.8K

Risk-sensitive reinforcement learning.

Yun Shen1, Michael J Tobia, Tobias Sommer

  • 1Technical University, 10587 Berlin, Germany yun@ni.tu-berlin.de.

Neural Computation
|April 9, 2014
PubMed
Summary
This summary is machine-generated.

This study introduces risk-sensitive reinforcement learning to model human decision-making under uncertainty. The new method accurately captures prospect theory predictions and shows neural correlates in the brain.

More Related Videos

Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

10.7K
Operant Procedures for Assessing Behavioral Flexibility in Rats
08:30

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

20.9K

Related Experiment Videos

Last Updated: May 1, 2026

A Conflict Model of Reward-seeking Behavior in Male Rats
06:11

A Conflict Model of Reward-seeking Behavior in Male Rats

Published on: February 20, 2019

6.8K
Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

10.7K
Operant Procedures for Assessing Behavioral Flexibility in Rats
08:30

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

20.9K

Area of Science:

  • Neuroscience
  • Cognitive Science
  • Machine Learning

Background:

  • Sequential decision-making in uncertain environments is a complex cognitive task.
  • Traditional reinforcement learning models often assume risk neutrality, failing to capture human behavior.
  • Prospect theory describes systematic deviations from rational choice, including risk preferences and probability weighting.

Discussion:

  • Applying utility functions to temporal difference errors in reinforcement learning allows for risk-sensitive agents.
  • This approach models human risk preferences for gains and losses, aligning with prospect theory.
  • A novel risk-sensitive Q-learning algorithm is derived and proven to converge, even with unknown transition probabilities.

Key Insights:

  • The risk-sensitive reinforcement learning framework significantly improves the fit to human behavioral data in sequential investment tasks.
  • It provides an interpretation of subject responses consistent with prospect theory's principles.
  • Risk-sensitive temporal difference errors correlate with brain activity in the ventral striatum, suggesting a neural basis for risk-sensitive learning.

Outlook:

  • This framework offers a powerful tool for quantifying and understanding human decision-making under risk.
  • Future research can explore applications in various domains, including economics, psychology, and artificial intelligence.
  • Investigating the neural underpinnings of risk-sensitive learning can lead to more sophisticated computational models of the brain.