Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

PD Controller: Design

PD Controller: Design

In automotive engineering, car suspension systems often employ Proportional Derivative (PD) controllers to enhance performance. PD controllers are utilized to adjust the damping force in response to road conditions. A controller, acting as an amplifier with a constant gain, demonstrates proportional control, with output directly mirroring input.
Designing a continuous-data controller requires selecting and linking components like adders and integrators, which are fundamental in Proportional,...

Time-Domain Interpretation of PD Control

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

PID Controller

PID Controller

Proportional-Integral-Derivative (PID) controllers are widely used in various control systems to enhance stability and performance. In a thermostat, it adjusts heating or cooling based on the temperature difference between the actual and desired levels. They are often used in automotive speed systems, effectively managing sudden speed changes while maintaining a constant speed under varying conditions. On the other hand, PI controllers, commonly employed in voltage regulation, enhance stability...

PI Controller: Design

PI Controller: Design

Proportional Integral (PI) controllers are a fundamental component in modern control systems, widely used to enhance performance and mitigate steady-state errors. They are particularly effective in applications such as automatic brightness adjustment on smartphones, where they excel at mitigating steady-state errors for step-function inputs. Unlike PD controllers, which require time-varying errors to function optimally, PI controllers leverage their integral component to address residual...

Controller Configurations

Controller Configurations

Controller configurations are crucial in a car's cruise control system because they manage speed over time to maintain a consistent pace regardless of road conditions, thereby meeting design goals. In traditional control systems, fixed-configuration design involves predetermined controller placement. System performance modifications are known as compensation.
Control-system compensation involves various configurations, most commonly series or cascade compensation, in which the controller...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Reinforcement Learning-Based Fuzzy Control for Nonlinear Systems With Unknown Dynamics via Parallel Composite Policy Iteration Scheme.

IEEE transactions on cybernetics·2026

Same author

Pan-cancer analysis of TASL: a novel immune infiltration-related biomarker for tumor prognosis and immunotherapy response prediction.

BMC cancer·2023

Same author

Output Consensus of Heterogeneous Linear Multiagent Systems With Directed Graphs via Adaptive Dynamic Event-Triggered Mechanism.

IEEE transactions on cybernetics·2021

Same author

Determination of single-kidney glomerular filtration rate (GFR) with CT urography versus renal dynamic imaging Gates method.

European radiology·2017

Same author

Reliable Output Feedback Control for T-S Fuzzy Systems With Decentralized Event Triggering Communication and Actuator Failures.

IEEE transactions on cybernetics·2017

Same author

Berberine lowers blood glucose in type 2 diabetes mellitus patients through increasing insulin receptor expression.

Metabolism: clinical and experimental·2009

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 14, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Data-Based Predictive Control via Multistep Policy Gradient Reinforcement Learning.

Xindi Yang, Hao Zhang, Zhuping Wang

IEEE Transactions on Cybernetics

|November 9, 2021

Summary

This summary is machine-generated.

This study introduces a model-free predictive control algorithm using reinforcement learning. It enhances real-time system performance by learning from data, eliminating the need for system dynamics knowledge.

More Related Videos

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

Related Experiment Videos

Last Updated: Oct 14, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Pavlovian Conditioned Approach Training in Rats

Pavlovian Conditioned Approach Training in Rats

Published on: February 4, 2016

Area of Science:

Control Systems Engineering
Artificial Intelligence
Machine Learning

Background:

Model-free predictive control (MFPC) offers an alternative to traditional methods by avoiding explicit system models.
Data-driven approaches are increasingly important for real-time systems.
Reinforcement learning (RL) provides a powerful framework for learning optimal control policies.

Purpose of the Study:

To present a novel model-free predictive control algorithm for real-time systems.
To leverage multistep policy gradient reinforcement learning for performance improvement.
To develop a data-driven control strategy that does not require prior knowledge of system dynamics.

Main Methods:

The algorithm utilizes multistep policy gradient reinforcement learning.
Cooperative games model predictive control as multiagent optimization problems.
Neural networks approximate the action-state value function and control policy.
Weighted residual methods determine network weights.

Main Results:

The proposed algorithm effectively improves real-time system performance.
Model-free design is achieved by learning from offline and real-time data.
Optimality of the predictive control policy is guaranteed through the game-theoretic approach.

Conclusions:

The developed model-free predictive control algorithm is effective for real-time systems.
The data-driven, reinforcement learning-based approach eliminates the need for system dynamics knowledge.
The integration of neural networks and cooperative games ensures efficient and optimal control.