Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Fixed Action Patterns

Fixed Action Patterns

A fixed action pattern (FAP) is a specific, hard-wired sequence of behaviors that occurs in response to an external stimulus, called a sign stimulus. The behavior is “fixed” because it is essentially unchangeable—proceeding similarly across individuals of a species every time it occurs.

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Purposive Learning

Purposive Learning

E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Agreements and disagreements with resource-rational contractualism.

The Behavioral and brain sciences·2026

Same author

Cucurbituril-based anion-conducting membranes with supramolecular nanopores.

Nature·2026

Same author

Lipid oxidation and metabolism in relation to contaminants in polar bears from the Canadian high arctic and Hudson Bay.

Environmental pollution (Barking, Essex : 1987)·2025

Same author

Sustainability within Aotearoa New Zealand's aerospace sector: current state and implications for the future.

Journal of the Royal Society of New Zealand·2025

Same author

Inverse option generation: Inferences about others' values based on what comes to mind.

Cognition·2025

Same author

Introspective access to value-based multi-attribute choice processes.

Nature communications·2025

Same journal

Effects of integrating a structured design thinking strategy into generative AI-supported design learning on students' design achievement, creative self-efficacy, and problem-solving skills.

Frontiers in psychology·2026

Same journal

Fukushima treated water release and marine sports.

Frontiers in psychology·2026

Same journal

Mindful parenting and preschoolers' screen dependency behavior: the mediating role of parent-child relationship and the moderating role of effortful control.

Frontiers in psychology·2026

Same journal

Dynamic relationships among first-year university students' critical thinking, academic self-concept, and student engagement: a cross-lagged study.

Frontiers in psychology·2026

Same journal

The association between academic major identity and career decision-making difficulty among Chinese college students: a sequential indirect association model of psychological capital and career adaptability.

Frontiers in psychology·2026

Same journal

Job quality and fertility intentions among Chinese migrant workers: the role of traditional fertility beliefs.

Frontiers in psychology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 31, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Model-Free RL or Action Sequences?

Adam Morris¹, Fiery Cushman¹

¹Department of Psychology, Harvard University, Cambridge, MA, United States.

Frontiers in Psychology

|January 11, 2020

Summary

This summary is machine-generated.

Model-free reinforcement learning (MF RL) explains decision-making habits, challenging alternative models. This study provides evidence for MF RL and simultaneous model-based action sequencing in human behavior.

Keywords:

action sequences decision-making habit model-free control reinforcement learning

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Published on: March 4, 2014

Related Experiment Videos

Last Updated: Dec 31, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Study Motor Skill Learning by Single-pellet Reaching Tasks in Mice

Published on: March 4, 2014

Area of Science:

Cognitive Neuroscience
Computational Psychiatry
Behavioral Economics

Background:

Model-free reinforcement learning (MF RL) is a dominant computational framework for decision-making and habit formation.
Recent challenges propose model-based action sequencing as an alternative explanation for similar behavioral and neural patterns.
Dissociating these mechanisms is crucial for understanding habitual control.

Purpose of the Study:

To empirically differentiate between model-free reinforcement learning (MF RL) and model-based action sequencing.
To provide unconfounded evidence supporting the role of MF RL in human decision-making.
To investigate the simultaneous use of both MF RL and model-based strategies.

Main Methods:

Two experiments were designed to dissociate MF RL from model-based selection of action sequences.
Behavioral and neural data were collected to analyze decision-making patterns.
Analysis focused on identifying distinct signatures of MF RL and model-based control.

Main Results:

The study presents empirical evidence that dissociates MF RL from model-based action sequencing.
Results demonstrate that humans utilize MF RL for habitual control.
Evidence also shows simultaneous application of model-based selection of action sequences.

Conclusions:

MF RL plays a significant and distinct role in human decision-making and habit formation.
Humans employ dual mechanisms for habitual control: MF RL and model-based action sequencing.
These findings solidify the central position of MF RL in computational models of behavior.