Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

401
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
401
Fixed Action Patterns01:06

Fixed Action Patterns

17.3K
A fixed action pattern (FAP) is a specific, hard-wired sequence of behaviors that occurs in response to an external stimulus, called a sign stimulus. The behavior is “fixed” because it is essentially unchangeable—proceeding similarly across individuals of a species every time it occurs.
17.3K
Observational Learning01:12

Observational Learning

755
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
755
Reinforcement01:23

Reinforcement

751
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
751
Purposive Learning01:22

Purposive Learning

385
E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...
385
Associative Learning01:27

Associative Learning

1.1K
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Agreements and disagreements with resource-rational contractualism.

The Behavioral and brain sciences·2026
Same author

Cucurbituril-based anion-conducting membranes with supramolecular nanopores.

Nature·2026
Same author

Lipid oxidation and metabolism in relation to contaminants in polar bears from the Canadian high arctic and Hudson Bay.

Environmental pollution (Barking, Essex : 1987)·2025
Same author

Sustainability within Aotearoa New Zealand's aerospace sector: current state and implications for the future.

Journal of the Royal Society of New Zealand·2025
Same author

Inverse option generation: Inferences about others' values based on what comes to mind.

Cognition·2025
Same author

Introspective access to value-based multi-attribute choice processes.

Nature communications·2025
Same journal

Effects of integrating a structured design thinking strategy into generative AI-supported design learning on students' design achievement, creative self-efficacy, and problem-solving skills.

Frontiers in psychology·2026
Same journal

Fukushima treated water release and marine sports.

Frontiers in psychology·2026
Same journal

Mindful parenting and preschoolers' screen dependency behavior: the mediating role of parent-child relationship and the moderating role of effortful control.

Frontiers in psychology·2026
Same journal

Dynamic relationships among first-year university students' critical thinking, academic self-concept, and student engagement: a cross-lagged study.

Frontiers in psychology·2026
Same journal

The association between academic major identity and career decision-making difficulty among Chinese college students: a sequential indirect association model of psychological capital and career adaptability.

Frontiers in psychology·2026
Same journal

Job quality and fertility intentions among Chinese migrant workers: the role of traditional fertility beliefs.

Frontiers in psychology·2026
See all related articles

Related Experiment Video

Updated: Dec 31, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

9.0K

Model-Free RL or Action Sequences?

Adam Morris1, Fiery Cushman1

  • 1Department of Psychology, Harvard University, Cambridge, MA, United States.

Frontiers in Psychology
|January 11, 2020
PubMed
Summary
This summary is machine-generated.

Model-free reinforcement learning (MF RL) explains decision-making habits, challenging alternative models. This study provides evidence for MF RL and simultaneous model-based action sequencing in human behavior.

Keywords:
action sequencesdecision-makinghabitmodel-free controlreinforcement learning

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.1K
Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice
06:04

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Published on: March 4, 2014

22.0K

Related Experiment Videos

Last Updated: Dec 31, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

9.0K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.1K
Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice
06:04

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Published on: March 4, 2014

22.0K

Area of Science:

  • Cognitive Neuroscience
  • Computational Psychiatry
  • Behavioral Economics

Background:

  • Model-free reinforcement learning (MF RL) is a dominant computational framework for decision-making and habit formation.
  • Recent challenges propose model-based action sequencing as an alternative explanation for similar behavioral and neural patterns.
  • Dissociating these mechanisms is crucial for understanding habitual control.

Purpose of the Study:

  • To empirically differentiate between model-free reinforcement learning (MF RL) and model-based action sequencing.
  • To provide unconfounded evidence supporting the role of MF RL in human decision-making.
  • To investigate the simultaneous use of both MF RL and model-based strategies.

Main Methods:

  • Two experiments were designed to dissociate MF RL from model-based selection of action sequences.
  • Behavioral and neural data were collected to analyze decision-making patterns.
  • Analysis focused on identifying distinct signatures of MF RL and model-based control.

Main Results:

  • The study presents empirical evidence that dissociates MF RL from model-based action sequencing.
  • Results demonstrate that humans utilize MF RL for habitual control.
  • Evidence also shows simultaneous application of model-based selection of action sequences.

Conclusions:

  • MF RL plays a significant and distinct role in human decision-making and habit formation.
  • Humans employ dual mechanisms for habitual control: MF RL and model-based action sequencing.
  • These findings solidify the central position of MF RL in computational models of behavior.