Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

Primary and Secondary Reinforcers

Primary and Secondary Reinforcers

In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Decoding species coexistence: A reinforcement learning perspective.

Physical review. E·2026

Same author

Study on the ecotoxicological effects of airplane tire wear particles versus automobile tire wear particles on earthworms.

Ecotoxicology and environmental safety·2026

Same author

Effect of esketamine-based patient-controlled intravenous analgesia on postoperative pain and quality of recovery after video-assisted thoracoscopic lobectomy: A prospective, double-blind, randomized controlled trial.

PloS one·2026

Same author

Optimal coordination of resource: A solution from reinforcement learning.

Physical review. E·2026

Same author

Evolution of cooperation in a dual-mode mixture of conditional cooperators.

Physical review. E·2025

Same author

Promethazine Combined with Metoclopramide for the Prevention of Postoperative Nausea and Vomiting in Patients Undergoing Laparoscopic Colorectal Cancer Surgery: A Randomized Controlled Trial.

Drug design, development and therapy·2025

Same journal

Erratum: Low-dimensional model for adaptive networks of spiking neurons [Phys. Rev. E 111, 014422 (2025)].

Physical review. E·2026

Same journal

Disentangling the effects of many-body forces on depletion interactions.

Physical review. E·2026

Same journal

Charge transport and mode transition in dual-energy electron beam diodes.

Physical review. E·2026

Same journal

Optimization of multisite reactions in complex compartmentalized media.

Physical review. E·2026

Same journal

Origin of geometric cohesion in nonconvex granular materials: Interplay between interdigitation and rotational constraints enhancing frictional stability.

Physical review. E·2026

Same journal

Interaction of walkers with a standing Faraday wave.

Physical review. E·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Sep 13, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

解码公平:一个强化学习的观点.

Guozhong Zheng¹, Jiqiang Zhang², Xin Ou¹

¹Shaanxi Normal University, School of Physics and Information Technology, Xi'an 710061, People's Republic of China.

Physical review. E

|August 1, 2025

概括

此摘要是机器生成的。

在最后通游戏 (UG) 中的人类行为显示出对公平的偏好,挑战传统的经济模式. 这项研究表明,内部动机,而不是外部因素,通过强化学习推动公平的决策.

更多相关视频

The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies

The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies

Published on: August 25, 2023

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

相关实验视频

Last Updated: Sep 13, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies

The Joint Effect of Social Comparison and Social Distance on Evaluation of Intertemporal Choice Outcomes in Event-related Potential Studies

Published on: August 25, 2023

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

Published on: September 10, 2018

科学领域:

行为经济学是一种行为经济学.
计算神经科学是一种神经科学.
游戏理论游戏理论

背景情况:

关于最后通游戏 (UG) 的行为实验表明人类更喜欢公平,这与正统的经济预测相矛盾.
现有的解释往往依赖于模仿学习框架内的外部 (外源) 因素.

研究的目的:

通过强化学习 (RL) 范式,调查最后通游戏中公平的出现.
确定内生激励单独是否可以解释没有外部因素的公平性.

主要方法:

应用Q学习到最后通游戏中,为每个玩家分配两个Q表,用于提出者和响应者的角色.
分析了一个双人场景,扩展到具有各种角色分配 (随机,固定,旋转) 的格子化群体.

主要成果:

公平在UG中显著出现,当时玩家重视经验和未来的奖励.
成功交易的概率随着更高的报价而增加,与经验观察一致.
该系统表现出两个阶段,稳定到公平或合理的策略中,在不同的条件下强大.

结论:

强化学习框架中的内生激励措施足以解释最后通游戏中公平的出现.
在经济决策中观察到的公平行为不需要外部因素来解释.