Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

State Space Representation

State Space Representation

The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...

Time-Domain Interpretation of PD Control

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

Decision Making: P-value Method

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim is also stated. These statements can act as null and alternative hypotheses: a null hypothesis would be a neutral statement while the alternative hypothesis can have a...

PD Controller: Design

PD Controller: Design

In automotive engineering, car suspension systems often employ Proportional Derivative (PD) controllers to enhance performance. PD controllers are utilized to adjust the damping force in response to road conditions. A controller, acting as an amplifier with a constant gain, demonstrates proportional control, with output directly mirroring input.
Designing a continuous-data controller requires selecting and linking components like adders and integrators, which are fundamental in Proportional,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Corrigendum to "Variable pore size of mesoporous silica in improving physical stability and oral bioavailability of insoluble drugs" [Int. J. Pharm. 674 (2025) 125394].

International journal of pharmaceutics·2026

Same author

A Toolkit for Targeted Neuromodulation of Striatal Direct Pathway Neurons Rescues Parkinsonian Motor Deficits in Mice.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026

Same author

Plasma proteomics reveal SERPINA1 and CD59 as candidate biomarkers for COVID-19 severity stratification and prognosis prediction.

Annals of medicine·2026

Same author

Structural basis for LMBD1-dependent trafficking and cobalamin export of ABCD4.

Nature communications·2026

Same author

Precise Multi-Site Borylation Engineering of Indolo[2,3-c]Carbazole-Bridged Narrowband MR-TADF Emitters.

Angewandte Chemie (International ed. in English)·2026

Same author

Contaminative Data-Driven Koopman Resilient Distributed Filtering for Unknown Stochastic Nonlinear Systems.

IEEE transactions on neural networks and learning systems·2026

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Videos

Safe Reinforcement Learning for Nonlinear Multiagent Systems Based on Min-Max DMPC.

Yang Peng, Huaicheng Yan, Qiwei Liu

IEEE Transactions on Cybernetics

|May 12, 2026

Summary

This summary is machine-generated.

This study introduces a safe reinforcement learning (RL) framework for nonlinear multiagent systems (MASs). It enhances control strategies by adaptively updating parameters, ensuring safety and stability during learning.

Related Experiment Videos

Area of Science:

Control Theory
Artificial Intelligence
Robotics

Background:

Nonlinear multiagent systems (MASs) require robust control strategies to handle disturbances and ensure safety.
Traditional robust distributed model predictive control (DMPC) can be overly conservative due to fixed disturbance bounds and precise model requirements.

Purpose of the Study:

To develop a safe reinforcement learning (RL) framework for nonlinear MASs.
To enhance the performance and safety of control strategies by mitigating the conservatism of traditional robust DMPC.
To ensure recursive feasibility and closed-loop stability during the adaptive learning process.

Main Methods:

Integration of min-max DMPC as a baseline for robust control strategy generation.
Application of safe RL to adaptively update controller parameters and disturbance sets online.
Formal guarantee of recursive feasibility for the DMPC algorithm throughout the learning phase.
Theoretical analysis of closed-loop stability.

Main Results:

The proposed safe RL framework successfully mitigates the conservatism of robust DMPC.
Adaptive online updates of controller parameters and disturbance sets are achieved.
Recursive feasibility of the DMPC algorithm is formally guaranteed during learning.
Theoretical stability analyses confirm the robustness of the approach.

Conclusions:

The developed framework offers a safe and effective approach for controlling nonlinear MASs.
The method enhances performance by adaptively managing uncertainties and ensuring stability.
Validated through simulations, the approach demonstrates effectiveness and scalability for complex systems.