Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

Time-Domain Interpretation of PD Control

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

Controller Configurations

Controller Configurations

Controller configurations are crucial in a car's cruise control system because they manage speed over time to maintain a consistent pace regardless of road conditions, thereby meeting design goals. In traditional control systems, fixed-configuration design involves predetermined controller placement. System performance modifications are known as compensation.
Control-system compensation involves various configurations, most commonly series or cascade compensation, in which the controller...

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Open and closed-loop control systems

Open and closed-loop control systems

Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Error analysis of Cm measurement under the whole-cell patch-clamp recording.

Journal of neuroscience methods·2009

Same author

Understanding the self-assembly of charged nanoparticles at the water/oil interface.

Physical chemistry chemical physics : PCCP·2009

Same author

[Development of new SSR markers from EST of SSH cDNA libraries on rose fragrance].

Yi chuan = Hereditas·2009

Same author

Crocin and geniposide profiles and radical scavenging activity of gardenia fruits (Gardenia jasminoides Ellis) from different cultivars and at the various stages of maturation.

Fitoterapia·2009

Same author

Small-molecule screening using a human primary cell model of HIV latency identifies compounds that reverse latency without cellular activation.

The Journal of clinical investigation·2009

Same author

Berberine lowers blood glucose in type 2 diabetes mellitus patients through increasing insulin receptor expression.

Metabolism: clinical and experimental·2009

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 17, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning.

Xindi Yang, Hao Zhang, Zhuping Wang

IEEE Transactions on Neural Networks and Learning Systems

|February 15, 2021

Summary

This summary is machine-generated.

This study introduces a data-driven distributed control algorithm for multiagent systems with unknown dynamics. It ensures real-time performance and stability, even with varying agent computational abilities.

More Related Videos

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Related Experiment Videos

Last Updated: Nov 17, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Area of Science:

Control Systems Engineering
Artificial Intelligence
Distributed Computing

Background:

Multiagent systems often face challenges with unknown dynamics and differing computational capabilities among agents.
Achieving consensus control in these complex systems requires robust and adaptive learning algorithms.
Existing methods may struggle with real-time performance and asynchronous learning scenarios.

Purpose of the Study:

To develop a data-based distributed control algorithm for discrete-time multiagent systems with unknown dynamics.
To address challenges posed by computational ability differences and ensure asynchronous learning.
To guarantee the convergence, stability, and optimality of the proposed control strategies.

Main Methods:

A data-based distributed control algorithm using offline system interaction data sets.
Distributed policy gradient reinforcement learning (RL) for policy improvement.
Functional analysis and Lyapunov method for convergence and stability guarantees.
An asynchronous extension to handle varying computational abilities.
An actor-critic neural network structure with the method of weighted residuals.

Main Results:

The proposed algorithm ensures real-time performance and improves system performance using interactive data.
Convergence and stability are mathematically guaranteed for the distributed control system.
The asynchronous version effectively handles differing agent computational speeds.
The actor-critic networks demonstrate convergence and optimality, with approximation errors tending to zero.
Simulations validate the effectiveness of the developed algorithm.

Conclusions:

The data-based distributed reinforcement learning approach provides an effective solution for consensus control in complex multiagent systems.
The algorithm is robust to unknown system dynamics and computational heterogeneity.
The work advances the field of distributed control by enabling stable and optimal performance in asynchronous learning environments.