Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

327
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
327
Reinforcement01:23

Reinforcement

603
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
603
Timing and Consequences on Behavior01:08

Timing and Consequences on Behavior

241
In operant conditioning, the timing of reinforcement is crucial. For animals like rats and cats, immediate reinforcement (within a few seconds) is much more effective than delayed reinforcement. For example, a food reward for a rat needs to follow within 30 seconds of pressing a bar to be effective. 
Humans, however, can respond to delayed reinforcers. We often make decisions between immediate small rewards and delayed larger rewards. This ability to delay gratification is a significant...
241
Primary and Secondary Reinforcers01:23

Primary and Secondary Reinforcers

589
In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...
589
Cognitive Learning01:21

Cognitive Learning

835
Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...
835
Law of Effect01:06

Law of Effect

2.0K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
2.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Human-level learning of complex novel tasks as theory-based modelling, exploration and planning.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

Artificial intelligence for science: The easy and hard problems.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

Reconciling time and prediction error theories of associative learning.

Nature communications·2025
Same author

Quantifying the Cost of Context Sensitivity in Decision-Making.

Topics in cognitive science·2025
Same author

Bayesian estimation yields anti-Weber variability.

PNAS nexus·2025
Same author

On the Robustness and Provenance of the Gambler's Fallacy.

Psychological science·2025
Same journal

Prevalence and severity of mental health problems in early-career researchers: a systematic review and meta-analysis.

Nature human behaviour·2026
Same journal

Representativeness and response validity across nine opt-in online samples.

Nature human behaviour·2026
Same journal

The growing concentration of national influence in global science.

Nature human behaviour·2026
Same journal

Political polarization in low- and middle-income countries.

Nature human behaviour·2026
Same journal

Political segregation in the US workplace.

Nature human behaviour·2026
Same journal

Potential mechanisms and functional significance of aperiodic neural activity.

Nature human behaviour·2026
See all related articles

Related Experiment Video

Updated: Nov 19, 2025

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents
09:13

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

14.6K

Multi-task reinforcement learning in humans.

Momchil S Tomov1,2, Eric Schulz3,4, Samuel J Gershman5,6,7

  • 1Program in Neuroscience, Harvard Medical School, Boston, MA, USA. mtomov@g.harvard.edu.

Nature Human Behaviour
|January 29, 2021
PubMed
Summary
This summary is machine-generated.

Humans learn complex tasks by applying past knowledge to new situations. This study shows people use policy mapping in multitask reinforcement learning, mirroring advanced algorithms.

More Related Videos

Operant Procedures for Assessing Behavioral Flexibility in Rats
08:30

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

21.3K
Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice
06:04

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Published on: March 4, 2014

21.7K

Related Experiment Videos

Last Updated: Nov 19, 2025

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents
09:13

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

14.6K
Operant Procedures for Assessing Behavioral Flexibility in Rats
08:30

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

21.3K
Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice
06:04

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Published on: March 4, 2014

21.7K

Area of Science:

  • Cognitive Science
  • Neuroscience
  • Artificial Intelligence

Background:

  • Human intelligence excels at transferring knowledge across tasks and generalizing to new ones.
  • Multitask reinforcement learning in humans remains under-investigated.
  • Understanding how humans learn and adapt in dynamic environments is crucial.

Purpose of the Study:

  • To investigate human behavior in multitask reinforcement learning.
  • To compare human strategies with computational models.
  • To elucidate the mechanisms of knowledge transfer in complex decision-making.

Main Methods:

  • Participants engaged in a two-step decision-making task with varying features and reward functions.
  • Human behavior was compared against two multitask reinforcement learning algorithms (policy mapping and value function approximation) and standard algorithms.
  • Data from three exploratory and one large confirmatory experiment were analyzed.

Main Results:

  • Participants capable of learning the task demonstrated a strategy involving mapping previously acquired policies to novel scenarios.
  • Human performance aligned with algorithms employing policy mapping for knowledge transfer.
  • The findings suggest a specific cognitive strategy for navigating complex, changing environments.

Conclusions:

  • Humans utilize policy mapping to generalize learning in multitask reinforcement learning scenarios.
  • This cognitive strategy is comparable to advanced artificial intelligence approaches.
  • The study enhances understanding of human adaptability and learning in dynamic, complex environments.