Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

BIBO stability of continuous and discrete -time systems

BIBO stability of continuous and discrete -time systems

System stability is a fundamental concept in signal processing, often assessed using convolution. For a system to be considered bounded-input bounded-output (BIBO) stable, any bounded input signal must produce a bounded output signal. A bounded input signal is one where the modulus does not exceed a certain constant at any point in time.
To determine the BIBO stability, the convolution integral is utilized when a bounded continuous-time input is applied to a Linear Time-Invariant (LTI) system....

Control System Problem

Control System Problem

In an open-loop system, such as a basic thermostat, the poles of the transfer function influence the system's response but do not determine its stability. However, when feedback is introduced to form a closed-loop system, such as an advanced thermostat that adjusts heating based on room temperature, stability is governed by the new poles of the closed-loop transfer function.
When forming a closed-loop system, issues can arise if the poles cross into the unstable region, leading to potential...

Linear time-invariant Systems

Linear time-invariant Systems

A system is linear if it displays the characteristics of homogeneity and additivity, together termed the superposition property. This principle is fundamental in all linear systems. Linear time-invariant (LTI) systems include systems with linear elements and constant parameters.
The input-output behavior of an LTI system can be fully defined by its response to an impulsive excitation at its input. Once this impulse response is known, the system's reaction to any other input can be...

Pole and System Stability

Pole and System Stability

The transfer function is a fundamental concept representing the ratio of two polynomials. The numerator and denominator encapsulate the system's dynamics. The zeros and poles of this transfer function are critical in determining the system's behavior and stability.
Simple poles are unique roots of the denominator polynomial. Each simple pole corresponds to a distinct solution to the system's characteristic equation, typically resulting in exponential decay terms in the system's...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026

Same author

PainFedMVL: A Federated Multi-View Learning Approach for Multi-Level Pain Recognition.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2026

Same author

Wavelet-Transformer Attention Network for Accurate Fetal ECG Estimation from Multi-Channel Abdominal Signals.

IEEE journal of biomedical and health informatics·2026

Same author

An Efficient Regenerated Cross-Modal Hashing: Improving Existing Hash Codes with the Arbitrary Length.

IEEE transactions on pattern analysis and machine intelligence·2026

Same author

Lung microbiota analysis in early-stage lung adenocarcinoma.

Microbiology spectrum·2026

Same author

Adapting Domain-Aware Knowledge to Vision-Language Model for Zero-Shot Anomaly Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

Relaxed Stability Conditions for Model Predictive Control of Hybrid Dynamical Systems Using Hybrid Recurrent Neural Networks.

IEEE transactions on cybernetics·2026

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 20, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Policy Iteration-Based Learning Design for Linear Continuous-Time Systems Under Initial Stabilizing OPFB Policy.

Chengye Zhang, Ci Chen, Frank L Lewis

IEEE Transactions on Cybernetics

|July 22, 2024

Summary

This summary is machine-generated.

This study introduces a novel policy iteration (PI) method for output-feedback (OPFB) systems, overcoming limitations of initial full state-feedback (FSFB) policies. The new approach effectively learns optimal control laws directly from OPFB policies.

More Related Videos

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Interactive and Visualized Online Experimentation System for Engineering Education and Research

Interactive and Visualized Online Experimentation System for Engineering Education and Research

Published on: November 24, 2021

Related Experiment Videos

Last Updated: Jun 20, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Interactive and Visualized Online Experimentation System for Engineering Education and Research

Interactive and Visualized Online Experimentation System for Engineering Education and Research

Published on: November 24, 2021

Area of Science:

Reinforcement Learning
Control Theory
Continuous-Time Systems

Background:

Policy iteration (PI) is valuable for learning decision laws in unknown environments.
Existing PI methods for output-feedback (OPFB) continuous-time systems require an initial stabilizing full state-feedback (FSFB) policy, violating the OPFB principle.
This limitation hinders the direct application of PI in scenarios where only output feedback is available.

Purpose of the Study:

To establish policy iteration (PI) under an initial stabilizing output-feedback (OPFB) policy for continuous-time systems.
To address the violation of the OPFB principle in existing PI-based control methods.
To develop an efficient PI algorithm that approximates optimal control using only OPFB information.

Main Methods:

An off-policy Bellman equation is utilized to transform any OPFB policy into an FSFB policy.
The traditional PI algorithm is revised by incorporating an additional iteration based on this transformation.
Theoretical analysis and a case study are employed to demonstrate the method's effectiveness.

Main Results:

The proposed method successfully establishes policy iteration (PI) under an initial stabilizing OPFB policy.
The transformation property of the off-policy Bellman equation enables the use of OPFB policies.
The revised PI algorithm efficiently approximates the optimal control law under OPFB constraints.

Conclusions:

The developed PI method effectively overcomes the reliance on initial FSFB policies for continuous-time systems.
This work provides a theoretical foundation and practical demonstration for PI in OPFB settings.
The proposed approach enhances the applicability of reinforcement learning for control in real-world systems with limited feedback.