Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Law of Effect01:06

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle boxes...
Purposive Learning01:22

Purposive Learning

E. C. Tolman emphasized the purposiveness of behavior — the idea that much of our behavior is goal-directed. For instance, employees who aim for a promotion work diligently to meet their targets. Tolman argued that when classical conditioning and operant conditioning occur, the organism acquires certain expectations. In classical conditioning, a child might fear a dog because they expect it to bite. In operant conditioning, a person might consistently work overtime because they expect a bonus...
Hierarchy of Motor Control01:18

Hierarchy of Motor Control

The hierarchy of motor control refers to the different levels of organization and processing involved in controlling movement in the body. These levels range from higher cortical areas involved in planning and decision-making to lower spinal cord reflexes that respond automatically to external stimuli.
Observational Learning01:12

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning because...
Cognitive Learning01:21

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...
Fixed Action Patterns01:06

Fixed Action Patterns

A fixed action pattern (FAP) is a specific, hard-wired sequence of behaviors that occurs in response to an external stimulus, called a sign stimulus. The behavior is “fixed” because it is essentially unchangeable—proceeding similarly across individuals of a species every time it occurs.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

AI-Discovered Cognitive Models Reveal Novel Insights into Human and Animal Learning.

bioRxiv : the preprint server for biology·2026
Same author

Developmental Change in Structure Learning Reflects a Shift From Recency-Based to Relational Prediction.

Developmental science·2026
Same author

Fast efficient coding and sensory adaptation in gain-adaptive recurrent networks.

Nature communications·2026
Same author

Human-level learning of complex novel tasks as theory-based modelling, exploration and planning.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules.

Advances in neural information processing systems·2026
Same author

Probabilistic forecasting guides dynamic decisions.

Psychological review·2026
Same journal

A Matter of Parameters: Tailored Transcranial Focused Ultrasound Enhances Cortico-Thalamo-Cortical Circuit Resonance.

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
Same journal

Proactive visual and motor prioritization differentially scale with cue reliability.

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
Same journal

Erratum: Yao et al., "Estrogen Regulates Bcl-w and Bim Expression: Role in Protection against β-Amyloid Peptide-Induced Neuronal Death".

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
Same journal

Erratum: L'Episcopo et al., "Plasticity of Subventricular Zone Neuroprogenitors in MPTP (1-Methyl-4-Phenyl-1,2,3,6-Tetrahydropyridine) Mouse Model of Parkinson's Disease Involves Cross Talk between Inflammatory and Wnt/β-Catenin Signaling Pathways: Functional Consequences for Neuroprotection and Repair".

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
Same journal

Representations of subsecond duration-based timing by complex spike synchrony in cerebellar Purkinje neurons.

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
Same journal

The extended language network: Language-responsive brain areas whose contributions to language remain to be discovered.

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
See all related articles

Related Experiment Video

Updated: Jun 19, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Human reinforcement learning subdivides structured action spaces by learning effector-specific values.

Samuel J Gershman1, Bijan Pesaran, Nathaniel D Daw

  • 1Center for Neural Science, New York University, New York, New York 10003, USA. sjgershm@princeton.edu

The Journal of Neuroscience : the Official Journal of the Society for Neuroscience
|October 30, 2009
PubMed
Summary
This summary is machine-generated.

The human brain tackles complex choices by breaking down action values into separate components for each effector, improving reinforcement learning in high-dimensional spaces.

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Related Experiment Videos

Last Updated: Jun 19, 2026

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Area of Science:

  • Neuroscience
  • Cognitive Science
  • Reinforcement Learning

Background:

  • High-dimensional action spaces pose a challenge for reinforcement learning.
  • Neural systems must efficiently learn action values despite numerous effectors.

Purpose of the Study:

  • To investigate if the brain decomposes action values into effector-specific components.
  • To test a reinforcement learning framework for handling high-dimensional action spaces.

Main Methods:

  • Subjects performed a multieffector choice task with simultaneous left and right hand decisions.
  • Behavioral data and blood oxygen-level-dependent (BOLD) activity were analyzed.
  • Computational models compared decomposed versus unitary action value representations.

Main Results:

  • Human behavior favored a learning model with decomposed action values.
  • BOLD signaling showed effector-specific value decomposition in the striatum and intraparietal sulcus.
  • Lateralized biases reflected prediction errors and value correlates.

Conclusions:

  • The brain employs decomposed value representations to manage reinforcement learning in high-dimensional action spaces.
  • This 'divide and conquer' strategy enhances behavioral flexibility and learning efficiency.