Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Feedback control systems01:26

Feedback control systems

411
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
411
Reinforcement Schedules01:24

Reinforcement Schedules

235
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
235
Reinforcement01:23

Reinforcement

319
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
319
Observational Learning01:12

Observational Learning

296
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
296
PD Controller: Design01:26

PD Controller: Design

339
In automotive engineering, car suspension systems often employ Proportional Derivative (PD) controllers to enhance performance. PD controllers are utilized to adjust the damping force in response to road conditions. A controller, acting as an amplifier with a constant gain, demonstrates proportional control, with output directly mirroring input.
Designing a continuous-data controller requires selecting and linking components like adders and integrators, which are fundamental in Proportional,...
339
Open and closed-loop control systems01:17

Open and closed-loop control systems

971
Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...
971

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026
Same author

PID-Optimized Deep Learning for Adaptive Time-Frequency Forecasting in Dynamic Systems: Coal Calorific Value Prediction.

IEEE transactions on cybernetics·2026
Same author

Adaptive Sensor Fault-Tolerant Control for Distributed Parameter Systems.

IEEE transactions on cybernetics·2026
Same author

Prescribed-rate target tracking for time-delayed systems using output measurements.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

ADR-DMOEA: A Dynamic Multiobjective Optimization Evolutionary Algorithm Based on Adaptive Dynamic Response Strategy.

IEEE transactions on cybernetics·2026
Same author

DuaDiff: Dual-Conditional Diffusion Model for Guided Thermal Image Super-Resolution.

IEEE transactions on neural networks and learning systems·2025
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
Same journal

Self-Supervised Continuous Dynamic Graph Representation Learning via Hawkes Processes.

IEEE transactions on neural networks and learning systems·2026
Same journal

cPU: Consistent Risk Estimator for Positive-Unlabeled Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Tuning-Free Latent Diffusion Models for Ultrahigh-Resolution Image Editing.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Sep 5, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.0K

Data-Driven Inverse Reinforcement Learning Control for Linear Multiplayer Games.

Bosen Lian, Vrushabh S Donge, Frank L Lewis

    IEEE Transactions on Neural Networks and Learning Systems
    |July 5, 2022
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a data-driven inverse reinforcement learning (RL) algorithm for multiplayer games. The method reconstructs expert cost functions from demonstrated data, enabling control without knowing system dynamics.

    More Related Videos

    Automated Interactive Video Playback for Studies of Animal Communication
    07:21

    Automated Interactive Video Playback for Studies of Animal Communication

    Published on: February 9, 2011

    13.6K
    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
    11:54

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

    Published on: May 8, 2021

    4.7K

    Related Experiment Videos

    Last Updated: Sep 5, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.0K
    Automated Interactive Video Playback for Studies of Animal Communication
    07:21

    Automated Interactive Video Playback for Studies of Animal Communication

    Published on: February 9, 2011

    13.6K
    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
    11:54

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

    Published on: May 8, 2021

    4.7K

    Area of Science:

    • Control Theory
    • Game Theory
    • Machine Learning

    Background:

    • Multilayer games in linear systems present complex control challenges.
    • Inverse reinforcement learning (RL) aims to infer cost functions from expert behavior.
    • Existing methods often require knowledge of system dynamics or expert gains.

    Purpose of the Study:

    • To propose a data-driven inverse RL control algorithm for nonzero-sum multiplayer games.
    • To develop a method for reconstructing unknown expert cost functions using demonstrated trajectories.
    • To enable control policy acquisition without prior knowledge of system dynamics.

    Main Methods:

    • A model-based inverse RL policy iteration framework is proposed, involving policy evaluation, state-reward weight improvement, and policy improvement.
    • An online, data-driven, off-policy inverse RL algorithm is developed based on the model-based approach.
    • Lyapunov functions and inverse optimal control (IOC) are utilized for cost function reconstruction.

    Main Results:

    • The proposed algorithms allow a learner to obtain expert control feedback gains and trajectories using only system trajectory data.
    • The online off-policy algorithm guarantees unbiased solutions, even with added probing noise satisfying the persistence of excitation (PE) condition.
    • Convergence and stability analyses confirm the theoretical soundness of the developed algorithms.

    Conclusions:

    • The data-driven inverse RL approach effectively solves control problems in multiplayer games without system dynamics knowledge.
    • The developed algorithms provide a robust and stable method for learning control policies from expert demonstrations.
    • Simulation examples validate the practical effectiveness of the proposed inverse RL techniques.