Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

Optimal Foraging

Optimal Foraging

How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.

Introduction to Structures

Introduction to Structures

A structure is defined as a system of interconnected members designed to support or transfer forces and successfully withstand the loads acting on them. The internal forces of a structure can be determined by decomposing the structure and analyzing the free-body diagrams of the individual members or of a combination of members. This helps in understanding the structural elements' behavior and ensuring that the structure is stable and can withstand the subjected loads.
There are three main...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Collisions in Multiple Dimensions: Problem Solving

Collisions in Multiple Dimensions: Problem Solving

In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A reporting checklist for large language models in behavioural science.

Nature human behaviour·2026

Same author

Fast efficient coding and sensory adaptation in gain-adaptive recurrent networks.

Nature communications·2026

Same author

Human-level learning of complex novel tasks as theory-based modelling, exploration and planning.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026

Same author

Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules.

Advances in neural information processing systems·2026

Same author

Probabilistic forecasting guides dynamic decisions.

Psychological review·2026

Same author

Phasic dopamine drives conditioned responding beyond its role in learning.

bioRxiv : the preprint server for biology·2026

Same journal

Sublexical semantic decoding during incidental novel word learning in natural Chinese reading.

Cognitive psychology·2026

Same journal

Seeing, hearing, and feeling causation.

Cognitive psychology·2026

Same journal

Separating decision and motor contributions to behavioral biases induced by manipulating stimulus probability.

Cognitive psychology·2026

Same journal

Congruency drives "conflict adaptation" independent of conflict: Converging evidence from behavior and computational modeling.

Cognitive psychology·2026

Same journal

Corrigendum to "Network analyses identify critical factors for facilitating future-oriented decision-making" [Cogn. Psychol. 165 (2026) 101815].

Cognitive psychology·2026

Same journal

The time course of local coherence effects in German: Evidence from self-paced reading times and event-related potentials.

Cognitive psychology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 28, 2025

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

Finding structure in multi-armed bandits.

Eric Schulz¹, Nicholas T Franklin¹, Samuel J Gershman¹

¹Harvard University, United States.

Cognitive Psychology

|February 15, 2020

Summary

This summary is machine-generated.

Humans explore rewards by using underlying functional structures, not just independent options. They improve exploration efficiency through function learning, clustering, and uncertainty, demonstrating sophisticated reinforcement learning.

Keywords:

Decision making Exploration-exploitation Function learning Gaussian process Generalization Latent structure Learning Learning-to-learn Reinforcement learning Structure learning

More Related Videos

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

Related Experiment Videos

Last Updated: Dec 28, 2025

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

Area of Science:

Cognitive Science
Neuroscience
Reinforcement Learning

Background:

Human reward search is often modeled using multi-armed bandit tasks.
Standard models assume independent reward distributions for each option, which is unrealistic.
Real-world options often share underlying structures that influence learning.

Purpose of the Study:

To investigate how generalization guides exploration in structured environments.
To understand if humans utilize latent functional structure in reward-seeking tasks.
To examine learning-to-learn effects across repeated exploration rounds.

Main Methods:

Utilized structured multi-armed bandit tasks with options correlated by a latent function.
Focused on bandits where rewards were linear functions of spatial position.
Conducted 5 experiments to observe human exploration strategies and compared computational models.

Main Results:

Participants demonstrated the use of functional structure to guide exploration.
Evidence of a learning-to-learn effect was observed, with faster function identification over time.
Findings held true for both linear and non-linear reward functions, ruling out heuristic explanations.

Conclusions:

Human reinforcement learning effectively leverages latent structure for efficient exploration.
Optimal models of human behavior integrate function learning, reward distribution clustering, and uncertainty-guided exploration.
This suggests sophisticated cognitive mechanisms underlie human decision-making in complex reward landscapes.