Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Natural Selection and Adaptation01:15

Natural Selection and Adaptation

386
Natural selection, a fundamental concept in evolutionary biology, is the mechanism by which evolution is driven, favoring organisms that are best adapted to their environments. This process enhances their chances of survival and reproduction. Adaptation, a key outcome of this process, involves genetic modifications that optimize an organism's functionality under specific environmental challenges, such as extreme cold or thinner air at high altitudes.
Beyond physical adaptations,...
386
Avoidance Learning and Learned Helplessness01:14

Avoidance Learning and Learned Helplessness

1.9K
Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...
1.9K
Behavior Modification01:21

Behavior Modification

236
Behavioral approaches have often been criticized for ignoring mental processes and focusing solely on observable behavior. However, these approaches provide an optimistic perspective for individuals seeking to change their behaviors. Rather than concentrating on intrinsic personality traits, behavioral approaches suggest that even longstanding habits can be modified by changing the reward contingencies that maintain them.
A real-world application of operant conditioning principles is applied...
236
Observational Learning01:12

Observational Learning

314
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
314
Law of Effect01:06

Law of Effect

1.6K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.6K
Evolutionary Psychology01:20

Evolutionary Psychology

424
Evolutionary psychology explores the origins of human behavior and mental processes by framing them within the context of natural selection, a theory famously propounded by Charles Darwin. This field asserts that many behaviors common across human societies — ranging from instinctive fear reactions to complex social interactions — arose as evolutionary adaptations. These adaptations enhanced the survival and reproductive success of our ancestors, thereby becoming embedded in the...
424

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Interpretable abstractions of artificial neural networks predict behavior and neural activity during human information gathering.

Nature neuroscience·2026
Same author

No effect of acute pain or self-reported chronic pain on working memory in the Sternberg task.

Scientific reports·2026
Same author

Modeling the journey as well as the destination: a control theory account of rotational navigation.

bioRxiv : the preprint server for biology·2026
Same author

Decomposing trust-related decision making: Dimensionality and predictability of phishing susceptibility in an adult lifespan sample.

The journals of gerontology. Series B, Psychological sciences and social sciences·2026
Same author

Learning to select computations in recurrent neural circuits.

bioRxiv : the preprint server for biology·2026
Same author

Planning in the Brain: It's Not What You Think It Is.

Annual review of neuroscience·2026
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Sep 12, 2025

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation
08:04

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation

Published on: August 23, 2017

8.3K

Human Strategy Adaptation in Reinforcement Learning Resembles Policy Gradient Ascent.

Hua-Dong Xiong1, Li Ji-An2, Robert C Wilson1

  • 1School of Psychology, Georgia Institute of Technology.

Biorxiv : the Preprint Server for Biology
|August 6, 2025
PubMed
Summary
This summary is machine-generated.

Humans adapt their learning strategies over time, similar to gradient-based optimization. A new framework, DynamicRL, quantifies these learning strategy changes, showing improved reward acquisition and bridging biological and artificial intelligence concepts.

Keywords:
Cognitive ModelingDecision MakingMeta-LearningReinforcement Learning

More Related Videos

New Variations for Strategy Set-shifting in the Rat
09:45

New Variations for Strategy Set-shifting in the Rat

Published on: January 23, 2017

8.3K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.8K

Related Experiment Videos

Last Updated: Sep 12, 2025

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation
08:04

Using a Split-belt Treadmill to Evaluate Generalization of Human Locomotor Adaptation

Published on: August 23, 2017

8.3K
New Variations for Strategy Set-shifting in the Rat
09:45

New Variations for Strategy Set-shifting in the Rat

Published on: January 23, 2017

8.3K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.8K

Area of Science:

  • Cognitive Science
  • Artificial Intelligence
  • Computational Neuroscience

Background:

  • Adapting learning strategies is crucial for intelligence but lacks quantitative frameworks in biological agents.
  • Existing computational models often assume fixed strategies or use task-optimized networks, failing to explain strategy refinement through experience.

Purpose of the Study:

  • To develop a quantitative framework for characterizing how biological agents adapt their learning strategies.
  • To investigate if human strategy adaptation resembles gradient-based optimization principles.

Main Methods:

  • Introduced DynamicRL, a neural network framework to track evolving learning parameters (learning rates, decision temperatures) in participants.
  • Evaluated DynamicRL across four diverse bandit tasks.

Main Results:

  • DynamicRL outperformed traditional reinforcement learning models with fixed parameters.
  • Human learning strategy adaptation showed trajectories that systematically increased expected rewards.
  • Strategy parameter updates aligned with policy gradient ascent directions and operated across multiple timescales.

Conclusions:

  • Humans dynamically adapt their reinforcement learning strategies, aligning with gradient-based optimization principles.
  • The DynamicRL framework provides a generalizable method for studying meta-learning trajectories in biological agents.
  • This research bridges theories of biological and artificial intelligence by quantifying adaptive behavior optimization through experience.