Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Linear time-invariant Systems

Linear time-invariant Systems

A system is linear if it displays the characteristics of homogeneity and additivity, together termed the superposition property. This principle is fundamental in all linear systems. Linear time-invariant (LTI) systems include systems with linear elements and constant parameters.
The input-output behavior of an LTI system can be fully defined by its response to an impulsive excitation at its input. Once this impulse response is known, the system's reaction to any other input can be...

BIBO stability of continuous and discrete -time systems

BIBO stability of continuous and discrete -time systems

System stability is a fundamental concept in signal processing, often assessed using convolution. For a system to be considered bounded-input bounded-output (BIBO) stable, any bounded input signal must produce a bounded output signal. A bounded input signal is one where the modulus does not exceed a certain constant at any point in time.
To determine the BIBO stability, the convolution integral is utilized when a bounded continuous-time input is applied to a Linear Time-Invariant (LTI) system....

Linear Momentum in Control Volume

Linear Momentum in Control Volume

Newton's second law is applied to obtain the linear momentum in a control volume in a fluid system. According to this law, the rate of change of linear momentum is equal to the sum of external forces acting on the system. When a control volume matches the fluid system at a specific moment, the forces acting on both are identical. Reynolds transport theorem helps explain this by breaking down the system's linear momentum into two components: the rate of change of linear momentum within...

Root Loci for Positive-Feedback Systems

Root Loci for Positive-Feedback Systems

The Hartley oscillator is a positive feedback system that sustains oscillations by feeding the output back to the input in phase, thereby reinforcing the signal. Positive feedback systems can be viewed as negative feedback systems with inverted feedback signals. In these systems, the root locus encompasses all points on the s-plane where the angle of the system transfer function equals 360 degrees.
The construction rules for the root locus in positive feedback systems are similar to those in...

Control Systems

Control Systems

Control systems are everywhere in contemporary society, influencing diverse applications from aerospace to automated manufacturing. These systems can be found naturally within biological processes, such as blood sugar regulation and heart rate adjustment in response to stress, as well as in man-made systems like elevators and automated vehicles. A control system is essentially a network of subsystems and processes that collaboratively convert specific inputs into desired outputs.
At the heart...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Early-life exposures and risk of multiple gynecological diseases: evidence from a large community-based study of 272,706 women.

BMC women's health·2026

Same author

CD68<sup>+</sup> tumor-associated macrophages exhibit prognostic value in surgically resected small cell lung cancer: a retrospective cohort study of 614 patients.

Cancer immunology, immunotherapy : CII·2026

Same author

A novel high-sensitivity TaqMan qPCR assay reveals amdoparvovirus DNA in zoo-housed small mammals in southern China.

Veterinary research communications·2026

Same author

Corneal Stromal Microdots in Dry Eye Disease: Clinical Characterization and Associations With Corneal Nerve Parameters.

Translational vision science & technology·2026

Same author

Microalgae-Based Semiartificial Photosynthesis: Strategies, Applications, and Future Prospects.

Environmental science & technology·2026

Same author

Direct Observation of Two-Dimensional Electron Gas with Low Effective Mass in Atomically Thin InTe.

Nano letters·2026

Same journal

Robust Semiglobal and Global Stabilization for Nonlinear Normal Form Systems by Time-Varying Feedback.

IEEE transactions on cybernetics·2026

Same journal

Adaptive Global Asymptotic Output Stabilization of Uncertain Nonlinear Systems Under Dynamic State/Input Quantization.

IEEE transactions on cybernetics·2026

Same journal

Accelerated Distributed Gradient Tracking for Constrained Aggregative Optimization Over Time-Varying Digraphs.

IEEE transactions on cybernetics·2026

Same journal

Small-Gain-Based Plug-and-Play Distributed Control Framework for DC Microgrids With Decentralized Reconfiguration.

IEEE transactions on cybernetics·2026

Same journal

Prescribed-Time Impulsive Control of High-Order Integrator Systems.

IEEE transactions on cybernetics·2026

Same journal

Relaxed Stability Conditions for Model Predictive Control of Hybrid Dynamical Systems Using Hybrid Recurrent Neural Networks.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 25, 2026

Control of Eating Behavior Using a Novel Feedback System

Control of Eating Behavior Using a Novel Feedback System

Published on: May 8, 2018

Output-Feedback Control of Linear Continuous-Time Systems Using Discounted Inverse Reinforcement Learning.

Han Wu, Qinglei Hu, Jianying Zheng

IEEE Transactions on Cybernetics

|January 23, 2026

Summary

This summary is machine-generated.

This study introduces a new discounted inverse reinforcement learning (DIRL) algorithm for controlling unknown systems using only output data. The method reconstructs states and learns optimal control policies efficiently, outperforming existing techniques.

More Related Videos

Movement Retraining using Real-time Feedback of Performance

Movement Retraining using Real-time Feedback of Performance

Published on: January 17, 2013

Force and Position Control in Humans - The Role of Augmented Feedback

Force and Position Control in Humans - The Role of Augmented Feedback

Published on: June 19, 2016

Related Experiment Videos

Last Updated: Jan 25, 2026

Control of Eating Behavior Using a Novel Feedback System

Control of Eating Behavior Using a Novel Feedback System

Published on: May 8, 2018

Movement Retraining using Real-time Feedback of Performance

Movement Retraining using Real-time Feedback of Performance

Published on: January 17, 2013

Force and Position Control in Humans - The Role of Augmented Feedback

Force and Position Control in Humans - The Role of Augmented Feedback

Published on: June 19, 2016

Area of Science:

Control Systems Engineering
Machine Learning
Robotics

Background:

Discounted inverse reinforcement learning (DIRL) typically requires full-state feedback, limiting its use in real-world applications with only input-output data.
Unknown continuous-time (CT) systems with partially observable states present significant control challenges.
Learning unknown discounted value functions is crucial for optimal control policy derivation.

Purpose of the Study:

To develop a novel model-free, output-feedback (OPFB) DIRL algorithm for linear quadratic (LQ) control of unknown CT systems.
To address the limitations of existing DIRL methods by enabling learning from input-output data.
To reconstruct system states using expert control output data for policy learning.

Main Methods:

A state reconstruction method is designed utilizing expert control and measured output data.
A model-free OPFB DIRL algorithm is presented to iteratively learn the unknown value function and optimal control policy.
Rigorous analysis of algorithm convergence and solution uniqueness is performed.

Main Results:

The proposed algorithm effectively recovers the expert control policy.
Simulations demonstrate superior computational efficiency compared to state-of-the-art methods.
The algorithm successfully handles partially observable states and unknown value functions.

Conclusions:

The novel OPFB DIRL algorithm provides an effective solution for controlling unknown CT systems with limited state information.
The method enhances the applicability of DIRL in practical scenarios by utilizing only input-output data.
The algorithm offers a computationally efficient and robust approach to learning optimal control policies.