Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

335
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
335
Optimal Foraging00:48

Optimal Foraging

13.3K
How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.
13.3K
Introduction to Structures01:30

Introduction to Structures

1.6K
A structure is defined as a system of interconnected members designed to support or transfer forces and successfully withstand the loads acting on them. The internal forces of a structure can be determined by decomposing the structure and analyzing the free-body diagrams of the individual members or of a combination of members. This helps in understanding the structural elements' behavior and ensuring that the structure is stable and can withstand the subjected loads.
There are three main...
1.6K
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

235
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
235
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

5.1K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
5.1K
Randomized Experiments01:13

Randomized Experiments

8.8K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A reporting checklist for large language models in behavioural science.

Nature human behaviour·2026
Same author

Fast efficient coding and sensory adaptation in gain-adaptive recurrent networks.

Nature communications·2026
Same author

Human-level learning of complex novel tasks as theory-based modelling, exploration and planning.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

Gradient Descent as Loss Landscape Navigation: a Normative Framework for Deriving Learning Rules.

Advances in neural information processing systems·2026
Same author

Probabilistic forecasting guides dynamic decisions.

Psychological review·2026
Same author

Phasic dopamine drives conditioned responding beyond its role in learning.

bioRxiv : the preprint server for biology·2026
Same journal

Sublexical semantic decoding during incidental novel word learning in natural Chinese reading.

Cognitive psychology·2026
Same journal

Seeing, hearing, and feeling causation.

Cognitive psychology·2026
Same journal

Separating decision and motor contributions to behavioral biases induced by manipulating stimulus probability.

Cognitive psychology·2026
Same journal

Congruency drives "conflict adaptation" independent of conflict: Converging evidence from behavior and computational modeling.

Cognitive psychology·2026
Same journal

Corrigendum to "Network analyses identify critical factors for facilitating future-oriented decision-making" [Cogn. Psychol. 165 (2026) 101815].

Cognitive psychology·2026
Same journal

The time course of local coherence effects in German: Evidence from self-paced reading times and event-related potentials.

Cognitive psychology·2026
See all related articles

Related Experiment Video

Updated: Dec 28, 2025

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
07:05

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

6.4K

Finding structure in multi-armed bandits.

Eric Schulz1, Nicholas T Franklin1, Samuel J Gershman1

  • 1Harvard University, United States.

Cognitive Psychology
|February 15, 2020
PubMed
Summary
This summary is machine-generated.

Humans explore rewards by using underlying functional structures, not just independent options. They improve exploration efficiency through function learning, clustering, and uncertainty, demonstrating sophisticated reinforcement learning.

Keywords:
Decision makingExploration-exploitationFunction learningGaussian processGeneralizationLatent structureLearningLearning-to-learnReinforcement learningStructure learning

More Related Videos

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm
06:18

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

2.5K
Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods
13:04

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

12.4K

Related Experiment Videos

Last Updated: Dec 28, 2025

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
07:05

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

6.4K
The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm
06:18

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

2.5K
Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods
13:04

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

12.4K

Area of Science:

  • Cognitive Science
  • Neuroscience
  • Reinforcement Learning

Background:

  • Human reward search is often modeled using multi-armed bandit tasks.
  • Standard models assume independent reward distributions for each option, which is unrealistic.
  • Real-world options often share underlying structures that influence learning.

Purpose of the Study:

  • To investigate how generalization guides exploration in structured environments.
  • To understand if humans utilize latent functional structure in reward-seeking tasks.
  • To examine learning-to-learn effects across repeated exploration rounds.

Main Methods:

  • Utilized structured multi-armed bandit tasks with options correlated by a latent function.
  • Focused on bandits where rewards were linear functions of spatial position.
  • Conducted 5 experiments to observe human exploration strategies and compared computational models.

Main Results:

  • Participants demonstrated the use of functional structure to guide exploration.
  • Evidence of a learning-to-learn effect was observed, with faster function identification over time.
  • Findings held true for both linear and non-linear reward functions, ruling out heuristic explanations.

Conclusions:

  • Human reinforcement learning effectively leverages latent structure for efficient exploration.
  • Optimal models of human behavior integrate function learning, reward distribution clustering, and uncertainty-guided exploration.
  • This suggests sophisticated cognitive mechanisms underlie human decision-making in complex reward landscapes.