Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Timing and Consequences on Behavior01:08

Timing and Consequences on Behavior

88
In operant conditioning, the timing of reinforcement is crucial. For animals like rats and cats, immediate reinforcement (within a few seconds) is much more effective than delayed reinforcement. For example, a food reward for a rat needs to follow within 30 seconds of pressing a bar to be effective. 
Humans, however, can respond to delayed reinforcers. We often make decisions between immediate small rewards and delayed larger rewards. This ability to delay gratification is a significant...
88
Neural Regulation01:37

Neural Regulation

39.4K
Digestion begins with a cephalic phase that prepares the digestive system to receive food. When our brain processes visual or olfactory information about food, it triggers impulses in the cranial nerves innervating the salivary glands and stomach to prepare for food.
39.4K
Law of Effect01:06

Law of Effect

1.4K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Compressed sensing-based approach identifies modular neural circuitry driving learned pathogen avoidance.

eLife·2026
Same author

Single-neuron encoding of rapidly learned visual information reshapes human perception.

bioRxiv : the preprint server for biology·2025
Same author

Arrayed single-gene perturbations identify drivers of human anterior neural tube closure.

bioRxiv : the preprint server for biology·2025
Same author

Adversarial testing of global neuronal workspace and integrated information theories of consciousness.

Nature·2025
Same author

Theta phase precession supports memory formation and retrieval of naturalistic experience in humans.

Nature human behaviour·2024
Same author

The Impact of Scene Context on Visual Object Recognition: Comparing Humans, Monkeys, and Computational Models.

bioRxiv : the preprint server for biology·2024
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jun 24, 2025

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents
09:13

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

14.4K

Neuron-level Prediction and Noise can Implement Flexible Reward-Seeking Behavior.

Chenguang Li1, Jonah Brenner2, Adam Boesky3

  • 1Biophysics Program, Harvard College, Cambridge, MA 02138.

Biorxiv : the Preprint Server for Biology
|June 3, 2024
PubMed
Summary
This summary is machine-generated.

Neural networks exhibit autonomous reward-seeking behavior using internal noise and local updates, adapting to environments without external signals. This biologically plausible approach enables flexible, self-governed exploration and exploitation strategies.

More Related Videos

Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

11.0K
A Conflict Model of Reward-seeking Behavior in Male Rats
06:11

A Conflict Model of Reward-seeking Behavior in Male Rats

Published on: February 20, 2019

7.4K

Related Experiment Videos

Last Updated: Jun 24, 2025

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents
09:13

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

14.4K
Pavlovian Conditioned Approach Training in Rats
06:57

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

11.0K
A Conflict Model of Reward-seeking Behavior in Male Rats
06:11

A Conflict Model of Reward-seeking Behavior in Male Rats

Published on: February 20, 2019

7.4K

Area of Science:

  • Computational Neuroscience
  • Artificial Intelligence
  • Machine Learning

Background:

  • Traditional reinforcement learning often relies on explicit environmental reward functions.
  • Autonomous agents require mechanisms for adaptive behavior and decision-making.
  • Understanding biologically plausible learning rules is crucial for advancing AI.

Purpose of the Study:

  • To demonstrate neural networks can achieve reward-seeking behavior without external rewards.
  • To investigate the role of internal noise and local updates in autonomous behavior.
  • To explore how networks adapt to environmental and architectural changes.

Main Methods:

  • Development of neural networks utilizing local predictive updates and internal noise.
  • Analysis of attractor dynamics governing explore-exploit switching.
  • Testing network adaptability to modifications in architecture, environment, and motor interfaces.
  • Investigating task preference formation and bias mechanisms.

Main Results:

  • Neural networks successfully implemented reward-seeking behavior autonomously.
  • Internal noise and local updates were sufficient for adaptive interaction.
  • Networks demonstrated plasticity, adapting to changes without external control.
  • Task preferences were shown to be influenced by noise, initialization, and network architecture.

Conclusions:

  • A novel, biologically plausible algorithm enables autonomous, adaptable interaction with environments.
  • The approach removes the need for explicit environmental reward functions.
  • This work offers a flexible framework for developing self-governed intelligent agents.