Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

PD Controller: Design

PD Controller: Design

In automotive engineering, car suspension systems often employ Proportional Derivative (PD) controllers to enhance performance. PD controllers are utilized to adjust the damping force in response to road conditions. A controller, acting as an amplifier with a constant gain, demonstrates proportional control, with output directly mirroring input.
Designing a continuous-data controller requires selecting and linking components like adders and integrators, which are fundamental in Proportional,...

Open and closed-loop control systems

Open and closed-loop control systems

Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026

Same author

PID-Optimized Deep Learning for Adaptive Time-Frequency Forecasting in Dynamic Systems: Coal Calorific Value Prediction.

IEEE transactions on cybernetics·2026

Same author

Adaptive Sensor Fault-Tolerant Control for Distributed Parameter Systems.

IEEE transactions on cybernetics·2026

Same author

Prescribed-rate target tracking for time-delayed systems using output measurements.

Neural networks : the official journal of the International Neural Network Society·2026

Same author

ADR-DMOEA: A Dynamic Multiobjective Optimization Evolutionary Algorithm Based on Adaptive Dynamic Response Strategy.

IEEE transactions on cybernetics·2026

Same author

DuaDiff: Dual-Conditional Diffusion Model for Guided Thermal Image Super-Resolution.

IEEE transactions on neural networks and learning systems·2025

Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026

Same journal

Self-Supervised Continuous Dynamic Graph Representation Learning via Hawkes Processes.

IEEE transactions on neural networks and learning systems·2026

Same journal

cPU: Consistent Risk Estimator for Positive-Unlabeled Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Tuning-Free Latent Diffusion Models for Ultrahigh-Resolution Image Editing.

IEEE transactions on neural networks and learning systems·2026

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 5, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Data-Driven Inverse Reinforcement Learning Control for Linear Multiplayer Games.

Bosen Lian, Vrushabh S Donge, Frank L Lewis

IEEE Transactions on Neural Networks and Learning Systems

|July 5, 2022

Summary

This summary is machine-generated.

This study introduces a data-driven inverse reinforcement learning (RL) algorithm for multiplayer games. The method reconstructs expert cost functions from demonstrated data, enabling control without knowing system dynamics.

More Related Videos

Automated Interactive Video Playback for Studies of Animal Communication

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Related Experiment Videos

Last Updated: Sep 5, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Automated Interactive Video Playback for Studies of Animal Communication

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Area of Science:

Control Theory
Game Theory
Machine Learning

Background:

Multilayer games in linear systems present complex control challenges.
Inverse reinforcement learning (RL) aims to infer cost functions from expert behavior.
Existing methods often require knowledge of system dynamics or expert gains.

Purpose of the Study:

To propose a data-driven inverse RL control algorithm for nonzero-sum multiplayer games.
To develop a method for reconstructing unknown expert cost functions using demonstrated trajectories.
To enable control policy acquisition without prior knowledge of system dynamics.

Main Methods:

A model-based inverse RL policy iteration framework is proposed, involving policy evaluation, state-reward weight improvement, and policy improvement.
An online, data-driven, off-policy inverse RL algorithm is developed based on the model-based approach.
Lyapunov functions and inverse optimal control (IOC) are utilized for cost function reconstruction.

Main Results:

The proposed algorithms allow a learner to obtain expert control feedback gains and trajectories using only system trajectory data.
The online off-policy algorithm guarantees unbiased solutions, even with added probing noise satisfying the persistence of excitation (PE) condition.
Convergence and stability analyses confirm the theoretical soundness of the developed algorithms.

Conclusions:

The data-driven inverse RL approach effectively solves control problems in multiplayer games without system dynamics knowledge.
The developed algorithms provide a robust and stable method for learning control policies from expert demonstrations.
Simulation examples validate the practical effectiveness of the proposed inverse RL techniques.