Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Timing and Consequences on Behavior

Timing and Consequences on Behavior

In operant conditioning, the timing of reinforcement is crucial. For animals like rats and cats, immediate reinforcement (within a few seconds) is much more effective than delayed reinforcement. For example, a food reward for a rat needs to follow within 30 seconds of pressing a bar to be effective.
Humans, however, can respond to delayed reinforcers. We often make decisions between immediate small rewards and delayed larger rewards. This ability to delay gratification is a significant...

Neural Regulation

Neural Regulation

Digestion begins with a cephalic phase that prepares the digestive system to receive food. When our brain processes visual or olfactory information about food, it triggers impulses in the cranial nerves innervating the salivary glands and stomach to prepare for food.

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Compressed sensing-based approach identifies modular neural circuitry driving learned pathogen avoidance.

eLife·2026

Same author

Single-neuron encoding of rapidly learned visual information reshapes human perception.

bioRxiv : the preprint server for biology·2025

Same author

Arrayed single-gene perturbations identify drivers of human anterior neural tube closure.

bioRxiv : the preprint server for biology·2025

Same author

Adversarial testing of global neuronal workspace and integrated information theories of consciousness.

Nature·2025

Same author

Theta phase precession supports memory formation and retrieval of naturalistic experience in humans.

Nature human behaviour·2024

Same author

The Impact of Scene Context on Visual Object Recognition: Comparing Humans, Monkeys, and Computational Models.

bioRxiv : the preprint server for biology·2024

Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026

Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026

Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026

Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026

Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026

Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 24, 2025

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

Neuron-level Prediction and Noise can Implement Flexible Reward-Seeking Behavior.

Chenguang Li¹, Jonah Brenner², Adam Boesky³

¹Biophysics Program, Harvard College, Cambridge, MA 02138.

Biorxiv : the Preprint Server for Biology

|June 3, 2024

Summary

This summary is machine-generated.

Neural networks exhibit autonomous reward-seeking behavior using internal noise and local updates, adapting to environments without external signals. This biologically plausible approach enables flexible, self-governed exploration and exploitation strategies.

More Related Videos

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

A Conflict Model of Reward-seeking Behavior in Male Rats

A Conflict Model of Reward-seeking Behavior in Male Rats

Published on: February 20, 2019

Related Experiment Videos

Last Updated: Jun 24, 2025

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

A Conflict Model of Reward-seeking Behavior in Male Rats

A Conflict Model of Reward-seeking Behavior in Male Rats

Published on: February 20, 2019

Area of Science:

Computational Neuroscience
Artificial Intelligence
Machine Learning

Background:

Traditional reinforcement learning often relies on explicit environmental reward functions.
Autonomous agents require mechanisms for adaptive behavior and decision-making.
Understanding biologically plausible learning rules is crucial for advancing AI.

Purpose of the Study:

To demonstrate neural networks can achieve reward-seeking behavior without external rewards.
To investigate the role of internal noise and local updates in autonomous behavior.
To explore how networks adapt to environmental and architectural changes.

Main Methods:

Development of neural networks utilizing local predictive updates and internal noise.
Analysis of attractor dynamics governing explore-exploit switching.
Testing network adaptability to modifications in architecture, environment, and motor interfaces.
Investigating task preference formation and bias mechanisms.

Main Results:

Neural networks successfully implemented reward-seeking behavior autonomously.
Internal noise and local updates were sufficient for adaptive interaction.
Networks demonstrated plasticity, adapting to changes without external control.
Task preferences were shown to be influenced by noise, initialization, and network architecture.

Conclusions:

A novel, biologically plausible algorithm enables autonomous, adaptable interaction with environments.
The approach removes the need for explicit environmental reward functions.
This work offers a flexible framework for developing self-governed intelligent agents.