Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

Classification of Systems-I

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026

Same author

Prescribed-rate target tracking for time-delayed systems using output measurements.

Neural networks : the official journal of the International Neural Network Society·2026

Same author

Inverse Reinforcement Learning for Disturbed Networked Nonlinear Systems With Data Dropouts.

IEEE transactions on neural networks and learning systems·2025

Same author

Distributed FilterNet Reinforcement Learning for Achieving Output Consensus in Heterogeneous Multiplayer Multiagent Systems.

IEEE transactions on neural networks and learning systems·2025

Same author

Neuroadaptive Control With Enhanced Stability and Reliability.

IEEE transactions on neural networks and learning systems·2025

Same author

Inverse Reinforcement Learning for Discrete-Time Systems With Data Dropouts.

IEEE transactions on cybernetics·2025

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jul 12, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

对复杂的非线性系统进行数据高效的强化学习.

Vrushabh S Donge, Bosen Lian, Frank L Lewis

IEEE transactions on cybernetics

|October 31, 2023

概括

此摘要是机器生成的。

本研究介绍了一种数据效率高的强化学习 (RL) 算法,用于非线性系统使用库普曼运算符. 它通过利用线性模型表示方式,以较少的数据实现最佳控制.

更多相关视频

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

相关实验视频

Last Updated: Jul 12, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

科学领域:

控制理论控制理论
机器学习机器学习
动态系统动态系统

背景情况:

复杂的非线性系统对传统的控制方法构成挑战.
强化学习 (RL) 为最佳控制提供了一个强大的框架.
数据效率对于现实世界系统中的实际RL应用至关重要.

研究的目的:

为非线性系统开发一个数据效率高的无模型强化学习 (RL) 算法.
通过将非线性动态提升到线性模型中,实现基于高维数据的最佳控制.
为了减少学习最佳控制策略的数据要求.

主要方法:

使用库普曼运算符在线性框架中表示非线性动态.
采用数据驱动型,基于模型的RL方法来推导出非政策的贝尔曼方程.
推导出一种新的数据效率高的RL算法,它绕过了基于库普曼的明确线性模型的需求.
分析库普曼固有函数对数据集截断效应的分析.

主要成果:

拟议的算法实现了非线性系统的数据效率最佳控制.
它有效地保留了基本的动态信息,同时最大限度地减少了数据需求.
该框架证明了对电力系统激发控制的成功验证.
理论和数值分析证实了库普曼特有函数在数据集截断中的有效性.

结论:

开发的无模型RL算法在有效控制复杂的非线性系统方面取得了重大进展.
这种方法减少了数据依赖,使最佳控制更容易获得.
库普曼运算符框架为分析和控制非线性动态提供了一个强大的方法.
对电力系统的成功应用凸显了拟议方法的实际实用性.