Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
Reinforcement Schedules01:24

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
State Space Representation01:27

State Space Representation

The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...
Time-Domain Interpretation of PD Control01:07

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...
Decision Making: P-value Method01:09

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim  is also stated. These statements can act as null and alternative hypotheses:  a null hypothesis would be a neutral statement while the alternative hypothesis can have a...
PD Controller: Design01:26

PD Controller: Design

In automotive engineering, car suspension systems often employ Proportional Derivative (PD) controllers to enhance performance. PD controllers are utilized to adjust the damping force in response to road conditions. A controller, acting as an amplifier with a constant gain, demonstrates proportional control, with output directly mirroring input.
Designing a continuous-data controller requires selecting and linking components like adders and integrators, which are fundamental in Proportional,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Corrigendum to "Variable pore size of mesoporous silica in improving physical stability and oral bioavailability of insoluble drugs" [Int. J. Pharm. 674 (2025) 125394].

International journal of pharmaceutics·2026
Same author

A Toolkit for Targeted Neuromodulation of Striatal Direct Pathway Neurons Rescues Parkinsonian Motor Deficits in Mice.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same author

Plasma proteomics reveal SERPINA1 and CD59 as candidate biomarkers for COVID-19 severity stratification and prognosis prediction.

Annals of medicine·2026
Same author

Structural basis for LMBD1-dependent trafficking and cobalamin export of ABCD4.

Nature communications·2026
Same author

Precise Multi-Site Borylation Engineering of Indolo[2,3-c]Carbazole-Bridged Narrowband MR-TADF Emitters.

Angewandte Chemie (International ed. in English)·2026
Same author

Contaminative Data-Driven Koopman Resilient Distributed Filtering for Unknown Stochastic Nonlinear Systems.

IEEE transactions on neural networks and learning systems·2026
Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026
Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026
Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026
Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026
Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026
Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026
See all related articles

Related Experiment Videos

Safe Reinforcement Learning for Nonlinear Multiagent Systems Based on Min-Max DMPC.

Yang Peng, Huaicheng Yan, Qiwei Liu

    IEEE Transactions on Cybernetics
    |May 12, 2026
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a safe reinforcement learning (RL) framework for nonlinear multiagent systems (MASs). It enhances control strategies by adaptively updating parameters, ensuring safety and stability during learning.

    Related Experiment Videos

    Area of Science:

    • Control Theory
    • Artificial Intelligence
    • Robotics

    Background:

    • Nonlinear multiagent systems (MASs) require robust control strategies to handle disturbances and ensure safety.
    • Traditional robust distributed model predictive control (DMPC) can be overly conservative due to fixed disturbance bounds and precise model requirements.

    Purpose of the Study:

    • To develop a safe reinforcement learning (RL) framework for nonlinear MASs.
    • To enhance the performance and safety of control strategies by mitigating the conservatism of traditional robust DMPC.
    • To ensure recursive feasibility and closed-loop stability during the adaptive learning process.

    Main Methods:

    • Integration of min-max DMPC as a baseline for robust control strategy generation.
    • Application of safe RL to adaptively update controller parameters and disturbance sets online.
    • Formal guarantee of recursive feasibility for the DMPC algorithm throughout the learning phase.
    • Theoretical analysis of closed-loop stability.

    Main Results:

    • The proposed safe RL framework successfully mitigates the conservatism of robust DMPC.
    • Adaptive online updates of controller parameters and disturbance sets are achieved.
    • Recursive feasibility of the DMPC algorithm is formally guaranteed during learning.
    • Theoretical stability analyses confirm the robustness of the approach.

    Conclusions:

    • The developed framework offers a safe and effective approach for controlling nonlinear MASs.
    • The method enhances performance by adaptively managing uncertainties and ensuring stability.
    • Validated through simulations, the approach demonstrates effectiveness and scalability for complex systems.