Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Natural Selection and Adaptation

Natural Selection and Adaptation

Natural selection, a fundamental concept in evolutionary biology, is the mechanism by which evolution is driven, favoring organisms that are best adapted to their environments. This process enhances their chances of survival and reproduction. Adaptation, a key outcome of this process, involves genetic modifications that optimize an organism's functionality under specific environmental challenges, such as extreme cold or thinner air at high altitudes.
Beyond physical adaptations,...

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Behavior Modification

Behavior Modification

Behavioral approaches have often been criticized for ignoring mental processes and focusing solely on observable behavior. However, these approaches provide an optimistic perspective for individuals seeking to change their behaviors. Rather than concentrating on intrinsic personality traits, behavioral approaches suggest that even longstanding habits can be modified by changing the reward contingencies that maintain them.
A real-world application of operant conditioning principles is applied...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

Evolutionary Psychology

Evolutionary Psychology

Evolutionary psychology explores the origins of human behavior and mental processes by framing them within the context of natural selection, a theory famously propounded by Charles Darwin. This field asserts that many behaviors common across human societies — ranging from instinctive fear reactions to complex social interactions — arose as evolutionary adaptations. These adaptations enhanced the survival and reproductive success of our ancestors, thereby becoming embedded in the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Interpretable abstractions of artificial neural networks predict behavior and neural activity during human information gathering.

Nature neuroscience·2026

Same author

No effect of acute pain or self-reported chronic pain on working memory in the Sternberg task.

Scientific reports·2026

Same author

Modeling the journey as well as the destination: a control theory account of rotational navigation.

bioRxiv : the preprint server for biology·2026

Same author

Decomposing trust-related decision making: Dimensionality and predictability of phishing susceptibility in an adult lifespan sample.

The journals of gerontology. Series B, Psychological sciences and social sciences·2026

Same author

Learning to select computations in recurrent neural circuits.

bioRxiv : the preprint server for biology·2026

Same author

Planning in the Brain: It's Not What You Think It Is.

Annual review of neuroscience·2026

Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026

Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026

Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026

Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026

Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026

Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 12, 2025

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation

Published on: August 23, 2017

Human Strategy Adaptation in Reinforcement Learning Resembles Policy Gradient Ascent.

Hua-Dong Xiong¹, Li Ji-An², Robert C Wilson¹

¹School of Psychology, Georgia Institute of Technology.

Biorxiv : the Preprint Server for Biology

|August 6, 2025

Summary

This summary is machine-generated.

Humans adapt their learning strategies over time, similar to gradient-based optimization. A new framework, DynamicRL, quantifies these learning strategy changes, showing improved reward acquisition and bridging biological and artificial intelligence concepts.

Keywords:

Cognitive Modeling Decision Making Meta-Learning Reinforcement Learning

More Related Videos

New Variations for Strategy Set-shifting in the Rat

New Variations for Strategy Set-shifting in the Rat

Published on: January 23, 2017

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Related Experiment Videos

Last Updated: Sep 12, 2025

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation

Published on: August 23, 2017

New Variations for Strategy Set-shifting in the Rat

New Variations for Strategy Set-shifting in the Rat

Published on: January 23, 2017

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Area of Science:

Cognitive Science
Artificial Intelligence
Computational Neuroscience

Background:

Adapting learning strategies is crucial for intelligence but lacks quantitative frameworks in biological agents.
Existing computational models often assume fixed strategies or use task-optimized networks, failing to explain strategy refinement through experience.

Purpose of the Study:

To develop a quantitative framework for characterizing how biological agents adapt their learning strategies.
To investigate if human strategy adaptation resembles gradient-based optimization principles.

Main Methods:

Introduced DynamicRL, a neural network framework to track evolving learning parameters (learning rates, decision temperatures) in participants.
Evaluated DynamicRL across four diverse bandit tasks.

Main Results:

DynamicRL outperformed traditional reinforcement learning models with fixed parameters.
Human learning strategy adaptation showed trajectories that systematically increased expected rewards.
Strategy parameter updates aligned with policy gradient ascent directions and operated across multiple timescales.

Conclusions:

Humans dynamically adapt their reinforcement learning strategies, aligning with gradient-based optimization principles.
The DynamicRL framework provides a generalizable method for studying meta-learning trajectories in biological agents.
This research bridges theories of biological and artificial intelligence by quantifying adaptive behavior optimization through experience.