Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

662
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
662
Instinctive Drift01:05

Instinctive Drift

497
Instinctive drift refers to the tendency of animals to revert to their innate behaviors despite repeated reinforcement. Breland and Breland demonstrated this concept in an experiment with a raccoon. The raccoon was trained to pick up two coins and place them in a container in exchange for food. Initially, the raccoon learned to associate the coins with food, making them a conditioned stimulus or a substitute for food. However, over time, the raccoon became less willing to put the coins into the...
497
Reinforcement Schedules01:24

Reinforcement Schedules

358
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
358
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

1.2K
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
1.2K
Primary and Secondary Reinforcers01:23

Primary and Secondary Reinforcers

703
In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...
703
Associative Learning01:27

Associative Learning

930
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
930

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Distributed control circuits across a brain-and-cord connectome.

Nature·2026
Same author

Lean mass to soft tissue ratio in Chinese fetuses: A novel predictor of congenital myopathy.

Early human development·2026
Same author

Dopamine in the ventral and tail of striatum supports global and local evaluation in reward-threat conflict.

bioRxiv : the preprint server for biology·2026
Same author

Dual Roles and Therapeutic Prospects of Proximal Tubular Epithelial Cell Senescence in Acute Kidney Injury.

Biomolecules·2026
Same author

Spectral envelopes of facial movements predict intention, cortical representations, and neural prosthetic control.

bioRxiv : the preprint server for biology·2026
Same author

Continuously graded-doped SnO<sub>2</sub> for efficient n-i-p perovskite solar cells.

Nature·2026
Same journal

A large brain adds new types of neurons: Molecular and functional signatures of spindle neurons in the human neocortex.

Trends in neurosciences·2026
Same journal

Exercise as a regulator of glymphatic function.

Trends in neurosciences·2026
Same journal

The neural basis of laughter.

Trends in neurosciences·2026
Same journal

Enteric neuroimmune interactions in health and disease.

Trends in neurosciences·2026
Same journal

Atomic insights into the physiological and functional diversity of NMDA receptors.

Trends in neurosciences·2026
Same journal

Cognitive functions of the GPe.

Trends in neurosciences·2026
See all related articles

Related Experiment Video

Updated: Dec 4, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning
09:22

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

14.9K

Distributional Reinforcement Learning in the Brain.

Adam S Lowet1, Qiao Zheng2, Sara Matias1

  • 1Department of Molecular and Cellular Biology, Center for Brain Science, Harvard University, Cambridge, MA 02138, USA.

Trends in Neurosciences
|October 23, 2020
PubMed
Summary
This summary is machine-generated.

Understanding reward distributions is key for survival. This study explores biologically plausible machine learning algorithms for reconstructing reward distributions and their neurobiological implementation.

Keywords:
artificial intelligencedeep neural networksdopaminemachine learningpopulation codingreward

More Related Videos

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
09:01

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

12.9K
A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents
09:13

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

14.7K

Related Experiment Videos

Last Updated: Dec 4, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning
09:22

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

14.9K
The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents
09:01

The Double-H Maze: A Robust Behavioral Test for Learning and Memory in Rodents

Published on: July 8, 2015

12.9K
A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents
09:13

A Fully Automated and Highly Versatile System for Testing Multi-cognitive Functions and Recording Neuronal Activities in Rodents

Published on: May 3, 2012

14.7K

Area of Science:

  • Neuroscience
  • Machine Learning
  • Computational Neuroscience

Background:

  • Survival depends on learning from rewards and punishments.
  • Dopamine neuron firing correlates with reward prediction errors in reinforcement learning.
  • Learning the full reward distribution, not just the mean, offers survival advantages.

Purpose of the Study:

  • To review the mathematical foundations of machine learning algorithms for reconstructing reward distributions.
  • To examine the neurobiological evidence supporting these algorithms.
  • To identify future research directions in distributional reinforcement learning.

Main Methods:

  • Review of mathematical frameworks for distributional reinforcement learning.
  • Analysis of existing neurobiological evidence for distributional coding.
  • Synthesis of current understanding and future outlook.

Main Results:

  • Biologically plausible machine learning algorithms exist for reconstructing reward distributions.
  • Initial evidence suggests neurobiological implementation of these distributional codes.
  • The study highlights the potential for a more complete understanding of reward learning.

Conclusions:

  • Distributional reinforcement learning offers a more comprehensive model of reward processing.
  • Further research is needed to understand the neural circuits and behavioral outputs of distributional codes.
  • This approach advances our understanding of decision-making and adaptive behavior.