Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Reinforcement Schedules01:24

Reinforcement Schedules

436
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
436
Reinforcement01:23

Reinforcement

791
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
791
Observational Learning01:12

Observational Learning

795
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
795
Steps in the Modeling Process01:14

Steps in the Modeling Process

601
Albert Bandura's theory of observational learning identifies four critical processes: attention, retention, motor reproduction, and reinforcement or motivation.
Attention is the first necessary component for observational learning. It involves focusing on what the model is doing and saying. For example, if you decide to take a drawing class to enhance your skills, you need to pay close attention to the instructor's words and hand movements. The characteristics of the model significantly...
601
Comparison between RL and RC circuits01:24

Comparison between RL and RC circuits

5.9K
An RC circuit consists of resistance and capacitance, while in an RL circuit, capacitance is replaced by an inductor. RL and RC circuits are first-order differential circuits that store energy. An RC circuit stores energy in the electric field, while an RL circuit stores energy in the magnetic field. When connected to a battery, an RC circuit charges the capacitor, causing the current to decrease from maximum to zero upon being fully charged. This increases the voltage across the capacitor from...
5.9K
Stereotype Content Model02:16

Stereotype Content Model

15.3K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
15.3K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Sparse identification of nonlinear dynamics and Koopman operators with Shallow Recurrent Decoder Networks.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

T-SHRED: symbolic regression for regularization and model discovery with transformer shallow recurrent decoders.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026
Same author

ECG-Based Prediction of Shock-Refractory Ventricular Fibrillation During Resuscitation Without Interrupting CPR.

Circulation. Arrhythmia and electrophysiology·2026
Same author

Reduced order modeling with shallow recurrent decoder networks.

Nature communications·2025
Same author

Arousal as a universal embedding for spatiotemporal brain dynamics.

Nature·2025
Same author

Lagrangian gradient regression for the detection of coherent structures from sparse trajectory data.

Royal Society open science·2024
Same journal

Demonstration of a quantum C-NOT gate in a time-multiplexed fully reconfigurable photonic processor.

Nature communications·2026
Same journal

Nonlinear quantum light source with van der Waals ferroelectric NbOX<sub>2</sub> (X = Br, I).

Nature communications·2026
Same journal

Antagonistic histone H2A variants and autonomous heterochromatin formation shape epigenomic patterns in Arabidopsis.

Nature communications·2026
Same journal

The long tail of nitrate pollution in groundwater challenges governance of global water quality.

Nature communications·2026
Same journal

Select microbial metabolites promote tau aggregation in a murine tauopathy model.

Nature communications·2026
Same journal

Warming climate has lengthened global intense tropical cyclone seasons.

Nature communications·2026
查看所有相关文章

相关实验视频

Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.1K

SINDy-RL对于可解释和高效的基于模型的强化学习.

Nicholas Zolman1,2, Christian Lagemann3, Urban Fasel4

  • 1Department of Mechanical Engineering, University of Washington, Seattle, WA, USA. nzolman@uw.edu.

Nature communications
|November 28, 2025
PubMed
概括
此摘要是机器生成的。

本研究介绍了SINDy-RL,这是一个结合稀疏字典学习和深度强化学习 (DRL) 的新框架. 与传统的DRL相比,SINDy-RL使用的培训示例要少得多,以创建高效,可解释的控制政策.

更多相关视频

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K
The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

9.0K

相关实验视频

Last Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.1K
Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task
11:18

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

11.1K
The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

9.0K

科学领域:

  • 控制理论 控制理论
  • 机器学习 机器学习
  • 流体动力学 流体动力学

背景情况:

  • 深度强化学习 (DRL) 在复杂的控制方面表现出色,但需要大量的数据,并产生黑子政策.
  • 像SINDy这样稀缺的词典学习方法提供了高效的,可解释的模型,特别是在数据不足的情况下.

研究的目的:

  • 引入SINDy-RL,一个整合SINDy和DRL的统一框架.
  • 为动态,奖励和控制政策开发高效,可解释和可靠的数据驱动模型.
  • 为了解决传统DRL的数据低效和解释性限制.

主要方法:

  • 非线性动态的稀疏识别 (SINDy) 与深度强化学习 (DRL) 的整合.
  • 关于学习动态,奖励功能和控制政策的统一框架 (SINDy-RL) 的开发.
  • 用于对控制任务和流量控制问题进行基准测试的应用,包括在气形上减轻风暴.

主要成果:

  • SINDy-RL的性能与最先进的DRL算法相美.
  • 与传统的DRL相比,该框架要求培训的环境相互作用要少得多.
  • 由此产生的控制政策比DRL衍生的政策要小很多次,更易于解释.

结论:

  • 对于控制任务,SINDy-RL提供了一个比标准DRL更高效,更易于解释的数据替代方案.
  • 该框架提供了可靠和计算效率高的模型,适合各种应用,包括嵌入式系统.
  • 这种方法提高了强化学习在复杂的动态环境中的实际应用性.