Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Feedback control systems01:26

Feedback control systems

319
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
319
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

110
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
110
Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

85
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
85
Classification of Systems-I01:26

Classification of Systems-I

191
Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
191
Observational Learning01:12

Observational Learning

188
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
188
Associative Learning01:27

Associative Learning

412
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
412

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026
Same author

Prescribed-rate target tracking for time-delayed systems using output measurements.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Inverse Reinforcement Learning for Disturbed Networked Nonlinear Systems With Data Dropouts.

IEEE transactions on neural networks and learning systems·2025
Same author

Distributed FilterNet Reinforcement Learning for Achieving Output Consensus in Heterogeneous Multiplayer Multiagent Systems.

IEEE transactions on neural networks and learning systems·2025
Same author

Neuroadaptive Control With Enhanced Stability and Reliability.

IEEE transactions on neural networks and learning systems·2025
Same author

Inverse Reinforcement Learning for Discrete-Time Systems With Data Dropouts.

IEEE transactions on cybernetics·2025
Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026
Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026
Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026
Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026
Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026
Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026
See all related articles

Related Experiment Video

Updated: Jul 12, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.0K

Data-Efficient Reinforcement Learning for Complex Nonlinear Systems.

Vrushabh S Donge, Bosen Lian, Frank L Lewis

    IEEE Transactions on Cybernetics
    |October 31, 2023
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a data-efficient reinforcement learning (RL) algorithm using Koopman operators for nonlinear systems. It enables optimal control with less data by leveraging a linear model representation.

    More Related Videos

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
    05:41

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

    Published on: February 6, 2020

    9.4K
    Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
    09:44

    Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

    Published on: March 8, 2024

    4.8K

    Related Experiment Videos

    Last Updated: Jul 12, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.0K
    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
    05:41

    A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

    Published on: February 6, 2020

    9.4K
    Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology
    09:44

    Author Spotlight: Advancing Large-Scale Neural Dynamics Through HD-MEA Technology

    Published on: March 8, 2024

    4.8K

    Area of Science:

    • Control Theory
    • Machine Learning
    • Dynamical Systems

    Background:

    • Complex nonlinear systems pose challenges for traditional control methods.
    • Reinforcement learning (RL) offers a powerful framework for optimal control.
    • Data efficiency is crucial for practical RL applications in real-world systems.

    Purpose of the Study:

    • To develop a data-efficient model-free reinforcement learning (RL) algorithm for nonlinear systems.
    • To enable high-dimensional data-driven optimal control by lifting nonlinear dynamics into a linear model.
    • To reduce the data requirements for learning optimal control strategies.

    Main Methods:

    • Utilizing Koopman operators to represent nonlinear dynamics in a linear framework.
    • Employing a data-driven, model-based RL approach to derive an off-policy Bellman equation.
    • Deducing a novel data-efficient RL algorithm that bypasses the need for an explicit Koopman-based linear model.
    • Analyzing Koopman eigenfunctions for dataset truncation effects.

    Main Results:

    • The proposed algorithm achieves data-efficient optimal control for nonlinear systems.
    • It effectively preserves essential dynamic information while minimizing data needs.
    • The framework demonstrates successful validation on power system excitation control.
    • Theoretical and numerical analyses confirm the efficacy of Koopman eigenfunctions in dataset truncation.

    Conclusions:

    • The developed model-free RL algorithm offers a significant advancement in controlling complex nonlinear systems efficiently.
    • This approach reduces data dependency, making optimal control more accessible.
    • The Koopman operator framework provides a robust method for analyzing and controlling nonlinear dynamics.
    • The successful application to power systems highlights the practical utility of the proposed method.