Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Classification of Systems-II

Classification of Systems-II

Continuous-time systems have continuous input and output signals, with time measured continuously. These systems are generally defined by differential or algebraic equations. For instance, in an RC circuit, the relationship between input and output voltage is expressed through a differential equation derived from Ohm's law and the capacitor relation,

Second Order systems II

Second Order systems II

In an underdamped second-order system, where the damping ratio ζ is between 0 and 1, a unit-step input results in a transfer function that, when transformed using the inverse Laplace method, reveals the output response. The output exhibits a damped sinusoidal oscillation, and the difference between the input and output is termed the error signal. This error signal also demonstrates damped oscillatory behavior. Eventually, as the system reaches a steady state, the error diminishes to zero.

Sampling Continuous Time Signal

Sampling Continuous Time Signal

In signal processing, a continuous-time signal can be sampled using an impulse-train sampling technique, followed by the zero-order hold method. Impulse-train sampling involves the use of a periodic impulse train, which consists of a series of delta functions spaced at regular intervals determined by the sampling period. When a continuous-time signal is multiplied by this impulse train, it generates impulses with amplitudes corresponding to the signal's values at the sampling points.
In the...

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

BIBO stability of continuous and discrete -time systems

BIBO stability of continuous and discrete -time systems

System stability is a fundamental concept in signal processing, often assessed using convolution. For a system to be considered bounded-input bounded-output (BIBO) stable, any bounded input signal must produce a bounded output signal. A bounded input signal is one where the modulus does not exceed a certain constant at any point in time.
To determine the BIBO stability, the convolution integral is utilized when a bounded continuous-time input is applied to a Linear Time-Invariant (LTI) system....

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A FRET-Mediated AIE Biosensor Based on Functionalized Peptide-Stabilized Gold Nanoclusters for Sensitive Detection of Matrix Metalloproteinase-2.

Analytical chemistry·2026

Same author

Two cases of proteasome inhibitors related thrombotic microangiopathies and literature review.

Thrombosis journal·2026

Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026

Same author

<i>De novo</i> design of NIR-II thioxanthene dye and phosphate-driven charge transfer-coupled <i>J</i>-aggregates for high resolution tumor angiography and type I phototherapy against hypoxic tumors.

Chemical science·2026

Same author

The evolution of China's youth sport policies-a systematic analysis based on national policy texts 2013-2023.

Frontiers in sports and active living·2026

Same author

Combined healthy lifestyles and osteoarthritis among middle-aged and older adults: A cross-sectional study in US adults.

Science progress·2026

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 24, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Inverse Reinforcement Learning for Discrete-Time Systems With Data Dropouts.

Jialu Fan, Pengfei Shi, Wenqian Xue

IEEE Transactions on Cybernetics

|March 4, 2025

Summary

This summary is machine-generated.

This study introduces inverse reinforcement learning (IRL) algorithms to enable control systems to track target systems effectively, even with data loss during wireless transmission. The methods allow systems to learn unknown target behaviors for improved tracking performance.

More Related Videos

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Related Experiment Videos

Last Updated: May 24, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

A Method for Remotely Silencing Neural Activity in Rodents During Discrete Phases of Learning

Published on: June 22, 2015

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Area of Science:

Control Systems Engineering
Machine Learning
Wireless Communication

Background:

Networked control systems (NCS) face challenges with data loss during wireless transmission.
Tracking control requires understanding an unknown target system's behavior, often defined by an unknown cost function.

Purpose of the Study:

To develop inverse reinforcement learning (IRL) algorithms for tracking control of linear NCS with random state dropouts.
To enable a controlled system to infer the unknown cost function and optimal policy of a target system.

Main Methods:

A model-based IRL algorithm integrating a Smith predictor for state estimation was developed.
A state-dropout-aware inverse Q-learning algorithm was proposed, requiring only accessible system data.

Main Results:

The proposed algorithms effectively infer the target's cost function and optimal control policy.
Theoretical validity was rigorously established.
Numerical simulations confirmed the practical effectiveness of the algorithms.

Conclusions:

The developed IRL algorithms provide a robust solution for tracking control in NCS with random state dropouts.
These methods enhance tracking performance by enabling systems to learn and adapt to unknown target dynamics despite communication uncertainties.