Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

Classification of Systems-I

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026

Same author

Prescribed-rate target tracking for time-delayed systems using output measurements.

Neural networks : the official journal of the International Neural Network Society·2026

Same author

Inverse Reinforcement Learning for Disturbed Networked Nonlinear Systems With Data Dropouts.

IEEE transactions on neural networks and learning systems·2025

Same author

Distributed FilterNet Reinforcement Learning for Achieving Output Consensus in Heterogeneous Multiplayer Multiagent Systems.

IEEE transactions on neural networks and learning systems·2025

Same author

Neuroadaptive Control With Enhanced Stability and Reliability.

IEEE transactions on neural networks and learning systems·2025

Same author

Inverse Reinforcement Learning for Discrete-Time Systems With Data Dropouts.

IEEE transactions on cybernetics·2025

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 12, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Data-Efficient Reinforcement Learning for Complex Nonlinear Systems.

Vrushabh S Donge, Bosen Lian, Frank L Lewis

IEEE Transactions on Cybernetics

|October 31, 2023

Summary

This summary is machine-generated.

This study introduces a data-efficient reinforcement learning (RL) algorithm using Koopman operators for nonlinear systems. It enables optimal control with less data by leveraging a linear model representation.

More Related Videos

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

Related Experiment Videos

Last Updated: Jul 12, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

Published on: March 8, 2024

Area of Science:

Control Theory
Machine Learning
Dynamical Systems

Background:

Complex nonlinear systems pose challenges for traditional control methods.
Reinforcement learning (RL) offers a powerful framework for optimal control.
Data efficiency is crucial for practical RL applications in real-world systems.

Purpose of the Study:

To develop a data-efficient model-free reinforcement learning (RL) algorithm for nonlinear systems.
To enable high-dimensional data-driven optimal control by lifting nonlinear dynamics into a linear model.
To reduce the data requirements for learning optimal control strategies.

Main Methods:

Utilizing Koopman operators to represent nonlinear dynamics in a linear framework.
Employing a data-driven, model-based RL approach to derive an off-policy Bellman equation.
Deducing a novel data-efficient RL algorithm that bypasses the need for an explicit Koopman-based linear model.
Analyzing Koopman eigenfunctions for dataset truncation effects.

Main Results:

The proposed algorithm achieves data-efficient optimal control for nonlinear systems.
It effectively preserves essential dynamic information while minimizing data needs.
The framework demonstrates successful validation on power system excitation control.
Theoretical and numerical analyses confirm the efficacy of Koopman eigenfunctions in dataset truncation.

Conclusions:

The developed model-free RL algorithm offers a significant advancement in controlling complex nonlinear systems efficiently.
This approach reduces data dependency, making optimal control more accessible.
The Koopman operator framework provides a robust method for analyzing and controlling nonlinear dynamics.
The successful application to power systems highlights the practical utility of the proposed method.