Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Primary and Secondary Reinforcers

Primary and Secondary Reinforcers

In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...

Instinctive Drift

Instinctive Drift

Instinctive drift refers to the tendency of animals to revert to their innate behaviors despite repeated reinforcement. Breland and Breland demonstrated this concept in an experiment with a raccoon. The raccoon was trained to pick up two coins and place them in a container in exchange for food. Initially, the raccoon learned to associate the coins with food, making them a conditioned stimulus or a substitute for food. However, over time, the raccoon became less willing to put the coins into the...

Timing and Consequences on Behavior

Timing and Consequences on Behavior

In operant conditioning, the timing of reinforcement is crucial. For animals like rats and cats, immediate reinforcement (within a few seconds) is much more effective than delayed reinforcement. For example, a food reward for a rat needs to follow within 30 seconds of pressing a bar to be effective.
Humans, however, can respond to delayed reinforcers. We often make decisions between immediate small rewards and delayed larger rewards. This ability to delay gratification is a significant...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Latent subdimensions of anxiety and depression differentially influence exertion of effort in pursuit of reward versus avoidance of threat.

Translational psychiatry·2026

Same author

Neural signatures of model-based and model-free reinforcement learning across prefrontal cortex and striatum.

eLife·2026

Same author

Children and adults differ in how primary and secondary incentives modulate valuation, effort, and cognitive control.

PloS one·2026

Same author

Effects of overt and covert attention on decision-making dynamics in prefrontal cortex.

bioRxiv : the preprint server for biology·2026

Same author

Functional reorganization of motor cortex connectivity during learning.

bioRxiv : the preprint server for biology·2026

Same author

Rethinking how research is reviewed and published.

eLife·2026

Same journal

Author Correction: Spinal cord Tau pathology induces tactile deficits and cognitive impairment in Alzheimer's disease via dysregulation of CCK neurons.

Nature neuroscience·2026

Same journal

Hippocampal theta sweeps indicate goal direction during navigation.

Nature neuroscience·2026

Same journal

Just how goal-directed are hippocampal theta sweeps, anyway?

Nature neuroscience·2026

Same journal

Goal-directed hippocampal theta sweeps during memory-guided navigation.

Nature neuroscience·2026

Same journal

Connectomic evidence that ordered activity drives neuromuscular network formation.

Nature neuroscience·2026

Same journal

Noninvasive decoding of typed sentences from human brain activity.

Nature neuroscience·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jul 6, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

在前额叶皮层的分布强化学习.

Timothy H Muller^1,2, James L Butler^3,4, Sebastijan Veselic^3,4,5

¹Department of Experimental Psychology, University of Oxford, Oxford, UK. timothymuller127@gmail.com.

Nature neuroscience

|January 10, 2024

概括

此摘要是机器生成的。

分布强化学习 (RL) 比经典的RL理论更好地解释了与前带带皮层中奖励导向学习相关的大脑活动. 这表明大脑如何从奖励中学习的共同机制.

更多相关视频

Operant Procedures for Assessing Behavioral Flexibility in Rats

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

An Operant Intra-/Extra-dimensional Set-shift Task for Mice

An Operant Intra-/Extra-dimensional Set-shift Task for Mice

Published on: January 22, 2016

相关实验视频

Last Updated: Jul 6, 2025

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

Operant Procedures for Assessing Behavioral Flexibility in Rats

Operant Procedures for Assessing Behavioral Flexibility in Rats

Published on: February 15, 2015

An Operant Intra-/Extra-dimensional Set-shift Task for Mice

An Operant Intra-/Extra-dimensional Set-shift Task for Mice

Published on: January 22, 2016

科学领域:

神经科学是一个神经科学.
计算神经科学是一种神经科学.
认知科学认知科学

背景情况:

前额叶皮质对于学习和决策至关重要.
经典的强化学习 (RL) 理论侧重于预期的奖励,并解释前额叶皮层神经数据.
分布式RL解释了奖励的全部分布,并更好地解释了多巴胺反应.

研究的目的:

调查分布式RL是否与经典RL相比,更好地解释前带状皮层中神经元反应.
确定分布式RL是否代表不同大脑区域的奖励导向学习的共同机制.

主要方法:

来自前带状皮层的神经元记录的分析.
对神经数据的经典RL和分布式RL模型的解释能力的比较.
基于奖励的学习过程的建模.

主要成果:

与经典RL模型相比,分布式RL模型为前带带皮层神经元反应提供了更好的解释.
这些发现表明,分布式RL捕捉了这个大脑区域的奖励处理的关键方面.
这表明,学习结果的全部分布是奖励导向行为的基本方面.

结论:

分布强化学习为理解奖励导向学习的神经机制提供了更全面的框架.
前带状皮质利用分布式RL原理,这表明这是大脑中普遍存在的机制.
这些发现提升了我们对大脑决策和学习过程的理解.