Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Steps in the Modeling Process

Steps in the Modeling Process

Albert Bandura's theory of observational learning identifies four critical processes: attention, retention, motor reproduction, and reinforcement or motivation.
Attention is the first necessary component for observational learning. It involves focusing on what the model is doing and saying. For example, if you decide to take a drawing class to enhance your skills, you need to pay close attention to the instructor's words and hand movements. The characteristics of the model significantly...

Comparison between RL and RC circuits

Comparison between RL and RC circuits

An RC circuit consists of resistance and capacitance, while in an RL circuit, capacitance is replaced by an inductor. RL and RC circuits are first-order differential circuits that store energy. An RC circuit stores energy in the electric field, while an RL circuit stores energy in the magnetic field. When connected to a battery, an RC circuit charges the capacitor, causing the current to decrease from maximum to zero upon being fully charged. This increases the voltage across the capacitor from...

Stereotype Content Model

Stereotype Content Model

The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Sparse identification of nonlinear dynamics and Koopman operators with Shallow Recurrent Decoder Networks.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same author

T-SHRED: symbolic regression for regularization and model discovery with transformer shallow recurrent decoders.

Philosophical transactions. Series A, Mathematical, physical, and engineering sciences·2026

Same author

ECG-Based Prediction of Shock-Refractory Ventricular Fibrillation During Resuscitation Without Interrupting CPR.

Circulation. Arrhythmia and electrophysiology·2026

Same author

Reduced order modeling with shallow recurrent decoder networks.

Nature communications·2025

Same author

Arousal as a universal embedding for spatiotemporal brain dynamics.

Nature·2025

Same author

Lagrangian gradient regression for the detection of coherent structures from sparse trajectory data.

Royal Society open science·2024

Same journal

Demonstration of a quantum C-NOT gate in a time-multiplexed fully reconfigurable photonic processor.

Nature communications·2026

Same journal

Nonlinear quantum light source with van der Waals ferroelectric NbOX<sub>2</sub> (X = Br, I).

Nature communications·2026

Same journal

Antagonistic histone H2A variants and autonomous heterochromatin formation shape epigenomic patterns in Arabidopsis.

Nature communications·2026

Same journal

The long tail of nitrate pollution in groundwater challenges governance of global water quality.

Nature communications·2026

Same journal

Select microbial metabolites promote tau aggregation in a murine tauopathy model.

Nature communications·2026

Same journal

Warming climate has lengthened global intense tropical cyclone seasons.

Nature communications·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

SINDy-RL对于可解释和高效的基于模型的强化学习.

Nicholas Zolman^1,2, Christian Lagemann³, Urban Fasel⁴

¹Department of Mechanical Engineering, University of Washington, Seattle, WA, USA. nzolman@uw.edu.

Nature communications

|November 28, 2025

概括

此摘要是机器生成的。

本研究介绍了SINDy-RL,这是一个结合稀疏字典学习和深度强化学习 (DRL) 的新框架. 与传统的DRL相比,SINDy-RL使用的培训示例要少得多,以创建高效,可解释的控制政策.

更多相关视频

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

相关实验视频

Last Updated: Jan 10, 2026

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Quantifying Learning in Young Infants: Tracking Leg Actions During a Discovery-learning Task

Published on: June 1, 2015

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

科学领域:

控制理论控制理论
机器学习机器学习
流体动力学流体动力学

背景情况:

深度强化学习 (DRL) 在复杂的控制方面表现出色,但需要大量的数据,并产生黑子政策.
像SINDy这样稀缺的词典学习方法提供了高效的,可解释的模型,特别是在数据不足的情况下.

研究的目的:

引入SINDy-RL,一个整合SINDy和DRL的统一框架.
为动态,奖励和控制政策开发高效,可解释和可靠的数据驱动模型.
为了解决传统DRL的数据低效和解释性限制.

主要方法:

非线性动态的稀疏识别 (SINDy) 与深度强化学习 (DRL) 的整合.
关于学习动态,奖励功能和控制政策的统一框架 (SINDy-RL) 的开发.
用于对控制任务和流量控制问题进行基准测试的应用,包括在气形上减轻风暴.

主要成果:

SINDy-RL的性能与最先进的DRL算法相美.
与传统的DRL相比,该框架要求培训的环境相互作用要少得多.
由此产生的控制政策比DRL衍生的政策要小很多次,更易于解释.

结论:

对于控制任务,SINDy-RL提供了一个比标准DRL更高效,更易于解释的数据替代方案.
该框架提供了可靠和计算效率高的模型,适合各种应用,包括嵌入式系统.
这种方法提高了强化学习在复杂的动态环境中的实际应用性.