Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

PD Controller: Design01:26

PD Controller: Design

385
In automotive engineering, car suspension systems often employ Proportional Derivative (PD) controllers to enhance performance. PD controllers are utilized to adjust the damping force in response to road conditions. A controller, acting as an amplifier with a constant gain, demonstrates proportional control, with output directly mirroring input.
Designing a continuous-data controller requires selecting and linking components like adders and integrators, which are fundamental in Proportional,...
385
Time-Domain Interpretation of PD Control01:07

Time-Domain Interpretation of PD Control

200
Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...
200
PID Controller01:19

PID Controller

271
Proportional-Integral-Derivative (PID) controllers are widely used in various control systems to enhance stability and performance. In a thermostat, it adjusts heating or cooling based on the temperature difference between the actual and desired levels. They are often used in automotive speed systems, effectively managing sudden speed changes while maintaining a constant speed under varying conditions. On the other hand, PI controllers, commonly employed in voltage regulation, enhance stability...
271
PI Controller: Design01:24

PI Controller: Design

580
Proportional Integral (PI) controllers are a fundamental component in modern control systems, widely used to enhance performance and mitigate steady-state errors. They are particularly effective in applications such as automatic brightness adjustment on smartphones, where they excel at mitigating steady-state errors for step-function inputs. Unlike PD controllers, which require time-varying errors to function optimally, PI controllers leverage their integral component to address residual...
580
Controller Configurations01:22

Controller Configurations

184
Controller configurations are crucial in a car's cruise control system because they manage speed over time to maintain a consistent pace regardless of road conditions, thereby meeting design goals. In traditional control systems, fixed-configuration design involves predetermined controller placement. System performance modifications are known as compensation.
Control-system compensation involves various configurations, most commonly series or cascade compensation, in which the controller...
184
Reinforcement Schedules01:24

Reinforcement Schedules

263
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
263

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Reinforcement Learning-Based Fuzzy Control for Nonlinear Systems With Unknown Dynamics via Parallel Composite Policy Iteration Scheme.

IEEE transactions on cybernetics·2026
Same author

Pan-cancer analysis of TASL: a novel immune infiltration-related biomarker for tumor prognosis and immunotherapy response prediction.

BMC cancer·2023
Same author

Output Consensus of Heterogeneous Linear Multiagent Systems With Directed Graphs via Adaptive Dynamic Event-Triggered Mechanism.

IEEE transactions on cybernetics·2021
Same author

Determination of single-kidney glomerular filtration rate (GFR) with CT urography versus renal dynamic imaging Gates method.

European radiology·2017
Same author

Reliable Output Feedback Control for T-S Fuzzy Systems With Decentralized Event Triggering Communication and Actuator Failures.

IEEE transactions on cybernetics·2017
Same author

Berberine lowers blood glucose in type 2 diabetes mellitus patients through increasing insulin receptor expression.

Metabolism: clinical and experimental·2009
Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026
Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026
Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026
Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026
Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026
Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026
See all related articles

Related Experiment Video

Updated: Oct 14, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.1K

Data-Based Predictive Control via Multistep Policy Gradient Reinforcement Learning.

Xindi Yang, Hao Zhang, Zhuping Wang

    IEEE Transactions on Cybernetics
    |November 9, 2021
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a model-free predictive control algorithm using reinforcement learning. It enhances real-time system performance by learning from data, eliminating the need for system dynamics knowledge.

    More Related Videos

    Pavlovian Conditioned Approach Training in Rats
    06:57

    Pavlovian Conditioned Approach Training in Rats

    Published on: February 4, 2016

    11.1K

    Related Experiment Videos

    Last Updated: Oct 14, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.1K
    Pavlovian Conditioned Approach Training in Rats
    06:57

    Pavlovian Conditioned Approach Training in Rats

    Published on: February 4, 2016

    11.1K

    Area of Science:

    • Control Systems Engineering
    • Artificial Intelligence
    • Machine Learning

    Background:

    • Model-free predictive control (MFPC) offers an alternative to traditional methods by avoiding explicit system models.
    • Data-driven approaches are increasingly important for real-time systems.
    • Reinforcement learning (RL) provides a powerful framework for learning optimal control policies.

    Purpose of the Study:

    • To present a novel model-free predictive control algorithm for real-time systems.
    • To leverage multistep policy gradient reinforcement learning for performance improvement.
    • To develop a data-driven control strategy that does not require prior knowledge of system dynamics.

    Main Methods:

    • The algorithm utilizes multistep policy gradient reinforcement learning.
    • Cooperative games model predictive control as multiagent optimization problems.
    • Neural networks approximate the action-state value function and control policy.
    • Weighted residual methods determine network weights.

    Main Results:

    • The proposed algorithm effectively improves real-time system performance.
    • Model-free design is achieved by learning from offline and real-time data.
    • Optimality of the predictive control policy is guaranteed through the game-theoretic approach.

    Conclusions:

    • The developed model-free predictive control algorithm is effective for real-time systems.
    • The data-driven, reinforcement learning-based approach eliminates the need for system dynamics knowledge.
    • The integration of neural networks and cooperative games ensures efficient and optimal control.