Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Open and closed-loop control systems

Open and closed-loop control systems

Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

State Space Representation

State Space Representation

The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

PI Controller: Design

PI Controller: Design

Proportional Integral (PI) controllers are a fundamental component in modern control systems, widely used to enhance performance and mitigate steady-state errors. They are particularly effective in applications such as automatic brightness adjustment on smartphones, where they excel at mitigating steady-state errors for step-function inputs. Unlike PD controllers, which require time-varying errors to function optimally, PI controllers leverage their integral component to address residual...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Predefined-time distributed optimal formation control for constrained UAV-UGV systems.

ISA transactions·2026

Same author

GA-Enhanced Control for Autonomous Vehicles: Coordinating FlexRay Protocol Under Randomly Perturbed Sampling Periods.

IEEE transactions on cybernetics·2026

Same author

Dual Event-Triggered Polynomial Dynamic Output Control for Positive Fuzzy Systems via an IT2 Membership Function Relaxation Method.

IEEE transactions on cybernetics·2026

Same author

Extended Dissipative Analysis for Uncertain Delayed Genetic Regulatory Networks via Interval Type-2 T-S Fuzzy Framework.

IEEE transactions on cybernetics·2026

Same author

A Hybrid GCN-LSTM Model for Ventricular Arrhythmia Classification Based on ECG Pattern Similarity.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same author

Stochastic Neural Network Control for Stochastic Nonlinear Systems With Quadratic Local Asymmetric Prescribed Performance.

IEEE transactions on cybernetics·2025

Same journal

Relaxed Stability Conditions for Model Predictive Control of Hybrid Dynamical Systems Using Hybrid Recurrent Neural Networks.

IEEE transactions on cybernetics·2026

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 28, 2026

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

Reinforcement Learning-Based Fuzzy Control for Nonlinear Systems With Unknown Dynamics via Parallel Composite Policy

Yiqun Liu, Lifei Dai, Changzhu Zhang

IEEE Transactions on Cybernetics

|February 25, 2026

Summary

This summary is machine-generated.

This study introduces a novel Parallel Composite Policy Iteration (PCPI) algorithm for reinforcement learning (RL)-based fuzzy control in nonlinear systems. The PCPI algorithm overcomes limitations of traditional methods, enabling efficient control even with unknown system dynamics.

More Related Videos

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

Related Experiment Videos

Last Updated: Jun 28, 2026

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

Area of Science:

Control Systems Engineering
Artificial Intelligence
Fuzzy Logic Systems

Background:

Reinforcement learning (RL) and fuzzy control are crucial for nonlinear systems.
Traditional policy iteration (PI) and value iteration (VI) methods face challenges like initial stabilizing policies and persistent excitation (PE) conditions.
Solving fuzzy algebraic Riccati equations (FARE) for complex nonlinear systems is difficult with conventional approaches.

Purpose of the Study:

To develop a novel Parallel Composite Policy Iteration (PCPI) algorithm for RL-based fuzzy control.
To address limitations of existing PI/VI algorithms, including the need for initial stabilizing control policies and PE conditions.
To solve the complex fuzzy algebraic Riccati equation (FARE) in nonlinear systems with unknown dynamics.

Main Methods:

A novel PCPI algorithm is proposed, incorporating adaptive parameters to remove the need for an initial stabilizing control policy.
An online, model-free PCPI variant is introduced for systems with difficult-to-obtain dynamic information.
The PE condition is relaxed to an initial excitation (IE) condition by utilizing online data, and algorithms run concurrently per fuzzy rule.

Main Results:

The proposed PCPI algorithm effectively alleviates drawbacks of traditional RL-based fuzzy control methods.
The adaptive parameters eliminate the requirement for an initial stabilizing control policy.
The online, model-free PCPI relaxes the PE condition to IE, enhancing applicability to systems with unknown dynamics.

Conclusions:

The developed PCPI algorithm offers an effective solution for RL-based fuzzy control of nonlinear systems with unknown dynamics.
The algorithm's ability to relax excitation conditions and operate model-free enhances its practical applicability.
Experimental validation on a robot arm and active suspension system confirms the algorithm's effectiveness.