Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Feedback control systems01:26

Feedback control systems

303
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
303
Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

81
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
81
BIBO stability of continuous and discrete -time systems01:24

BIBO stability of continuous and discrete -time systems

381
System stability is a fundamental concept in signal processing, often assessed using convolution. For a system to be considered bounded-input bounded-output (BIBO) stable, any bounded input signal must produce a bounded output signal. A bounded input signal is one where the modulus does not exceed a certain constant at any point in time.
To determine the BIBO stability, the convolution integral is utilized when a bounded continuous-time input is applied to a Linear Time-Invariant (LTI) system....
381
Control System Problem01:21

Control System Problem

110
In an open-loop system, such as a basic thermostat, the poles of the transfer function influence the system's response but do not determine its stability. However, when feedback is introduced to form a closed-loop system, such as an advanced thermostat that adjusts heating based on room temperature, stability is governed by the new poles of the closed-loop transfer function.
When forming a closed-loop system, issues can arise if the poles cross into the unstable region, leading to potential...
110
Linear time-invariant Systems01:23

Linear time-invariant Systems

242
A system is linear if it displays the characteristics of homogeneity and additivity, together termed the superposition property. This principle is fundamental in all linear systems. Linear time-invariant (LTI) systems include systems with linear elements and constant parameters.
The input-output behavior of an LTI system can be fully defined by its response to an impulsive excitation at its input. Once this impulse response is known, the system's reaction to any other input can be...
242
Pole and System Stability01:24

Pole and System Stability

268
The transfer function is a fundamental concept representing the ratio of two polynomials. The numerator and denominator encapsulate the system's dynamics. The zeros and poles of this transfer function are critical in determining the system's behavior and stability.
Simple poles are unique roots of the denominator polynomial. Each simple pole corresponds to a distinct solution to the system's characteristic equation, typically resulting in exponential decay terms in the system's...
268

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026
Same author

PainFedMVL: A Federated Multi-View Learning Approach for Multi-Level Pain Recognition.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2026
Same author

Wavelet-Transformer Attention Network for Accurate Fetal ECG Estimation from Multi-Channel Abdominal Signals.

IEEE journal of biomedical and health informatics·2026
Same author

An Efficient Regenerated Cross-Modal Hashing: Improving Existing Hash Codes with the Arbitrary Length.

IEEE transactions on pattern analysis and machine intelligence·2026
Same author

Lung microbiota analysis in early-stage lung adenocarcinoma.

Microbiology spectrum·2026
Same author

Adapting Domain-Aware Knowledge to Vision-Language Model for Zero-Shot Anomaly Detection.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

Relaxed Stability Conditions for Model Predictive Control of Hybrid Dynamical Systems Using Hybrid Recurrent Neural Networks.

IEEE transactions on cybernetics·2026
Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026
Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026
Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026
Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026
Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026
See all related articles

Related Experiment Video

Updated: Jun 20, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.0K

Policy Iteration-Based Learning Design for Linear Continuous-Time Systems Under Initial Stabilizing OPFB Policy.

Chengye Zhang, Ci Chen, Frank L Lewis

    IEEE Transactions on Cybernetics
    |July 22, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a novel policy iteration (PI) method for output-feedback (OPFB) systems, overcoming limitations of initial full state-feedback (FSFB) policies. The new approach effectively learns optimal control laws directly from OPFB policies.

    More Related Videos

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    1.6K
    Interactive and Visualized Online Experimentation System for Engineering Education and Research
    08:35

    Interactive and Visualized Online Experimentation System for Engineering Education and Research

    Published on: November 24, 2021

    2.4K

    Related Experiment Videos

    Last Updated: Jun 20, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.0K
    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    1.6K
    Interactive and Visualized Online Experimentation System for Engineering Education and Research
    08:35

    Interactive and Visualized Online Experimentation System for Engineering Education and Research

    Published on: November 24, 2021

    2.4K

    Area of Science:

    • Reinforcement Learning
    • Control Theory
    • Continuous-Time Systems

    Background:

    • Policy iteration (PI) is valuable for learning decision laws in unknown environments.
    • Existing PI methods for output-feedback (OPFB) continuous-time systems require an initial stabilizing full state-feedback (FSFB) policy, violating the OPFB principle.
    • This limitation hinders the direct application of PI in scenarios where only output feedback is available.

    Purpose of the Study:

    • To establish policy iteration (PI) under an initial stabilizing output-feedback (OPFB) policy for continuous-time systems.
    • To address the violation of the OPFB principle in existing PI-based control methods.
    • To develop an efficient PI algorithm that approximates optimal control using only OPFB information.

    Main Methods:

    • An off-policy Bellman equation is utilized to transform any OPFB policy into an FSFB policy.
    • The traditional PI algorithm is revised by incorporating an additional iteration based on this transformation.
    • Theoretical analysis and a case study are employed to demonstrate the method's effectiveness.

    Main Results:

    • The proposed method successfully establishes policy iteration (PI) under an initial stabilizing OPFB policy.
    • The transformation property of the off-policy Bellman equation enables the use of OPFB policies.
    • The revised PI algorithm efficiently approximates the optimal control law under OPFB constraints.

    Conclusions:

    • The developed PI method effectively overcomes the reliance on initial FSFB policies for continuous-time systems.
    • This work provides a theoretical foundation and practical demonstration for PI in OPFB settings.
    • The proposed approach enhances the applicability of reinforcement learning for control in real-world systems with limited feedback.