Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Reinforcement01:23

Reinforcement

343
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
343
Observational Learning01:12

Observational Learning

314
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
314
Law of Effect01:06

Law of Effect

1.6K
B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...
1.6K
Primary and Secondary Reinforcers01:23

Primary and Secondary Reinforcers

410
In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...
410
Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

802
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
802
Reinforcement Schedules01:24

Reinforcement Schedules

242
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
242

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Decoding species coexistence: A reinforcement learning perspective.

Physical review. E·2026
Same author

Study on the ecotoxicological effects of airplane tire wear particles versus automobile tire wear particles on earthworms.

Ecotoxicology and environmental safety·2026
Same author

Effect of esketamine-based patient-controlled intravenous analgesia on postoperative pain and quality of recovery after video-assisted thoracoscopic lobectomy: A prospective, double-blind, randomized controlled trial.

PloS one·2026
Same author

Optimal coordination of resource: A solution from reinforcement learning.

Physical review. E·2026
Same author

Evolution of cooperation in a dual-mode mixture of conditional cooperators.

Physical review. E·2025
Same author

Promethazine Combined with Metoclopramide for the Prevention of Postoperative Nausea and Vomiting in Patients Undergoing Laparoscopic Colorectal Cancer Surgery: A Randomized Controlled Trial.

Drug design, development and therapy·2025
Same journal

Erratum: Low-dimensional model for adaptive networks of spiking neurons [Phys. Rev. E 111, 014422 (2025)].

Physical review. E·2026
Same journal

Disentangling the effects of many-body forces on depletion interactions.

Physical review. E·2026
Same journal

Charge transport and mode transition in dual-energy electron beam diodes.

Physical review. E·2026
Same journal

Optimization of multisite reactions in complex compartmentalized media.

Physical review. E·2026
Same journal

Origin of geometric cohesion in nonconvex granular materials: Interplay between interdigitation and rotational constraints enhancing frictional stability.

Physical review. E·2026
Same journal

Interaction of walkers with a standing Faraday wave.

Physical review. E·2026
查看所有相关文章

相关实验视频

Updated: Sep 13, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.8K

解码公平:一个强化学习的观点.

Guozhong Zheng1, Jiqiang Zhang2, Xin Ou1

  • 1Shaanxi Normal University, School of Physics and Information Technology, Xi'an 710061, People's Republic of China.

Physical review. E
|August 1, 2025
PubMed
概括
此摘要是机器生成的。

在最后通游戏 (UG) 中的人类行为显示出对公平的偏好,挑战传统的经济模式. 这项研究表明,内部动机,而不是外部因素,通过强化学习推动公平的决策.

更多相关视频

The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies
08:24

The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies

Published on: August 25, 2023

803
Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
07:05

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

6.1K

相关实验视频

Last Updated: Sep 13, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

8.8K
The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies
08:24

The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies

Published on: August 25, 2023

803
Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
07:05

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

6.1K

科学领域:

  • 行为经济学是一种行为经济学.
  • 计算神经科学是一种神经科学.
  • 游戏理论 游戏理论

背景情况:

  • 关于最后通游戏 (UG) 的行为实验表明人类更喜欢公平,这与正统的经济预测相矛盾.
  • 现有的解释往往依赖于模仿学习框架内的外部 (外源) 因素.

研究的目的:

  • 通过强化学习 (RL) 范式,调查最后通游戏中公平的出现.
  • 确定内生激励单独是否可以解释没有外部因素的公平性.

主要方法:

  • 应用Q学习到最后通游戏中,为每个玩家分配两个Q表,用于提出者和响应者的角色.
  • 分析了一个双人场景,扩展到具有各种角色分配 (随机,固定,旋转) 的格子化群体.

主要成果:

  • 公平在UG中显著出现,当时玩家重视经验和未来的奖励.
  • 成功交易的概率随着更高的报价而增加,与经验观察一致.
  • 该系统表现出两个阶段,稳定到公平或合理的策略中,在不同的条件下强大.

结论:

  • 强化学习框架中的内生激励措施足以解释最后通游戏中公平的出现.
  • 在经济决策中观察到的公平行为不需要外部因素来解释.