Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Instinctive Drift

Instinctive Drift

Instinctive drift refers to the tendency of animals to revert to their innate behaviors despite repeated reinforcement. Breland and Breland demonstrated this concept in an experiment with a raccoon. The raccoon was trained to pick up two coins and place them in a container in exchange for food. Initially, the raccoon learned to associate the coins with food, making them a conditioned stimulus or a substitute for food. However, over time, the raccoon became less willing to put the coins into the...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Primary and Secondary Reinforcers

Primary and Secondary Reinforcers

In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Distributed control circuits across a brain-and-cord connectome.

Nature·2026

Same author

Lean mass to soft tissue ratio in Chinese fetuses: A novel predictor of congenital myopathy.

Early human development·2026

Same author

Dopamine in the ventral and tail of striatum supports global and local evaluation in reward-threat conflict.

bioRxiv : the preprint server for biology·2026

Same author

Dual Roles and Therapeutic Prospects of Proximal Tubular Epithelial Cell Senescence in Acute Kidney Injury.

Biomolecules·2026

Same author

Spectral envelopes of facial movements predict intention, cortical representations, and neural prosthetic control.

bioRxiv : the preprint server for biology·2026

Same author

Continuously graded-doped SnO<sub>2</sub> for efficient n-i-p perovskite solar cells.

Nature·2026

Same journal

A large brain adds new types of neurons: Molecular and functional signatures of spindle neurons in the human neocortex.

Trends in neurosciences·2026

Same journal

Exercise as a regulator of glymphatic function.

Trends in neurosciences·2026

Same journal

The neural basis of laughter.

Trends in neurosciences·2026

Same journal

Enteric neuroimmune interactions in health and disease.

Trends in neurosciences·2026

Same journal

Atomic insights into the physiological and functional diversity of NMDA receptors.

Trends in neurosciences·2026

Same journal

Cognitive functions of the GPe.

Trends in neurosciences·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 4, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

Distributional Reinforcement Learning in the Brain.

Adam S Lowet¹, Qiao Zheng², Sara Matias¹

¹Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, MA 02138, USA.

Trends in Neurosciences

|October 23, 2020

Summary

This summary is machine-generated.

Understanding reward distributions is key for survival. This study explores biologically plausible machine learning algorithms for reconstructing reward distributions and their neurobiological implementation.

Keywords:

artificial intelligence deep neural networks dopamine machine learning population coding reward

More Related Videos

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

Related Experiment Videos

Last Updated: Dec 4, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

Area of Science:

Neuroscience
Machine Learning
Computational Neuroscience

Background:

Survival depends on learning from rewards and punishments.
Dopamine neuron firing correlates with reward prediction errors in reinforcement learning.
Learning the full reward distribution, not just the mean, offers survival advantages.

Purpose of the Study:

To review the mathematical foundations of machine learning algorithms for reconstructing reward distributions.
To examine the neurobiological evidence supporting these algorithms.
To identify future research directions in distributional reinforcement learning.

Main Methods:

Review of mathematical frameworks for distributional reinforcement learning.
Analysis of existing neurobiological evidence for distributional coding.
Synthesis of current understanding and future outlook.

Main Results:

Biologically plausible machine learning algorithms exist for reconstructing reward distributions.
Initial evidence suggests neurobiological implementation of these distributional codes.
The study highlights the potential for a more complete understanding of reward learning.

Conclusions:

Distributional reinforcement learning offers a more comprehensive model of reward processing.
Further research is needed to understand the neural circuits and behavioral outputs of distributional codes.
This approach advances our understanding of decision-making and adaptive behavior.