Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

258
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
258
Time-Domain Interpretation of PD Control01:07

Time-Domain Interpretation of PD Control

238
Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...
238
Controller Configurations01:22

Controller Configurations

222
Controller configurations are crucial in a car's cruise control system because they manage speed over time to maintain a consistent pace regardless of road conditions, thereby meeting design goals. In traditional control systems, fixed-configuration design involves predetermined controller placement. System performance modifications are known as compensation.
Control-system compensation involves various configurations, most commonly series or cascade compensation, in which the controller...
222
Feedback control systems01:26

Feedback control systems

556
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
556
Reinforcement01:23

Reinforcement

597
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
597
Open and closed-loop control systems01:17

Open and closed-loop control systems

1.3K
Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...
1.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Error analysis of Cm measurement under the whole-cell patch-clamp recording.

Journal of neuroscience methods·2009
Same author

Understanding the self-assembly of charged nanoparticles at the water/oil interface.

Physical chemistry chemical physics : PCCP·2009
Same author

[Development of new SSR markers from EST of SSH cDNA libraries on rose fragrance].

Yi chuan = Hereditas·2009
Same author

Crocin and geniposide profiles and radical scavenging activity of gardenia fruits (Gardenia jasminoides Ellis) from different cultivars and at the various stages of maturation.

Fitoterapia·2009
Same author

Small-molecule screening using a human primary cell model of HIV latency identifies compounds that reverse latency without cellular activation.

The Journal of clinical investigation·2009
Same author

Berberine lowers blood glucose in type 2 diabetes mellitus patients through increasing insulin receptor expression.

Metabolism: clinical and experimental·2009
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Nov 17, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.2K

Data-Based Optimal Consensus Control for Multiagent Systems With Policy Gradient Reinforcement Learning.

Xindi Yang, Hao Zhang, Zhuping Wang

    IEEE Transactions on Neural Networks and Learning Systems
    |February 15, 2021
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a data-driven distributed control algorithm for multiagent systems with unknown dynamics. It ensures real-time performance and stability, even with varying agent computational abilities.

    More Related Videos

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
    11:54

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

    Published on: May 8, 2021

    4.9K
    Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
    11:53

    Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

    Published on: December 9, 2012

    13.2K

    Related Experiment Videos

    Last Updated: Nov 17, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.2K
    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
    11:54

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

    Published on: May 8, 2021

    4.9K
    Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
    11:53

    Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

    Published on: December 9, 2012

    13.2K

    Area of Science:

    • Control Systems Engineering
    • Artificial Intelligence
    • Distributed Computing

    Background:

    • Multiagent systems often face challenges with unknown dynamics and differing computational capabilities among agents.
    • Achieving consensus control in these complex systems requires robust and adaptive learning algorithms.
    • Existing methods may struggle with real-time performance and asynchronous learning scenarios.

    Purpose of the Study:

    • To develop a data-based distributed control algorithm for discrete-time multiagent systems with unknown dynamics.
    • To address challenges posed by computational ability differences and ensure asynchronous learning.
    • To guarantee the convergence, stability, and optimality of the proposed control strategies.

    Main Methods:

    • A data-based distributed control algorithm using offline system interaction data sets.
    • Distributed policy gradient reinforcement learning (RL) for policy improvement.
    • Functional analysis and Lyapunov method for convergence and stability guarantees.
    • An asynchronous extension to handle varying computational abilities.
    • An actor-critic neural network structure with the method of weighted residuals.

    Main Results:

    • The proposed algorithm ensures real-time performance and improves system performance using interactive data.
    • Convergence and stability are mathematically guaranteed for the distributed control system.
    • The asynchronous version effectively handles differing agent computational speeds.
    • The actor-critic networks demonstrate convergence and optimality, with approximation errors tending to zero.
    • Simulations validate the effectiveness of the developed algorithm.

    Conclusions:

    • The data-based distributed reinforcement learning approach provides an effective solution for consensus control in complex multiagent systems.
    • The algorithm is robust to unknown system dynamics and computational heterogeneity.
    • The work advances the field of distributed control by enabling stable and optimal performance in asynchronous learning environments.