Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Time-Domain Interpretation of PD Control01:07

Time-Domain Interpretation of PD Control

105
Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...
105
Control Systems01:10

Control Systems

1.1K
Control systems are everywhere in contemporary society, influencing diverse applications from aerospace to automated manufacturing. These systems can be found naturally within biological processes, such as blood sugar regulation and heart rate adjustment in response to stress, as well as in man-made systems like elevators and automated vehicles. A control system is essentially a network of subsystems and processes that collaboratively convert specific inputs into desired outputs.
At the heart...
1.1K
Controller Configurations01:22

Controller Configurations

96
Controller configurations are crucial in a car's cruise control system because they manage speed over time to maintain a consistent pace regardless of road conditions, thereby meeting design goals. In traditional control systems, fixed-configuration design involves predetermined controller placement. System performance modifications are known as compensation.
Control-system compensation involves various configurations, most commonly series or cascade compensation, in which the controller...
96
Feedback control systems01:26

Feedback control systems

307
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
307
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

53
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
53
Open and closed-loop control systems01:17

Open and closed-loop control systems

737
Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...
737

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Lung-Protective Ventilation Strategies for Relief from Ventilator-Associated Lung Injury in Patients Undergoing Craniotomy: A Bicenter Randomized, Parallel, and Controlled Trial.

Oxidative medicine and cellular longevity·2017
Same author

Electrochemical Oxidation of EDTA in Nuclear Wastewater Using Platinum Supported on Activated Carbon Fibers.

International journal of environmental research and public health·2017
Same author

Novel biomimetic enzyme for sensitive detection of superoxide anions.

Talanta·2017
Same author

The Expression of Formyl Peptide Receptor 1 is Correlated with Tumor Invasion of Human Colorectal Cancer.

Scientific reports·2017
Same author

An excited state underlies gene regulation of a transcriptional riboswitch.

Nature chemical biology·2017
Same author

(-)-Epigallocatechin-3-gallate attenuates myocardial injury induced by ischemia/reperfusion in diabetic rats and in H9c2 cells under hyperglycemic conditions.

International journal of molecular medicine·2017
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Jun 29, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.0K

Optimal Learning Output Tracking Control: A Model-Free Policy Optimization Method With Convergence Analysis.

Mingduo Lin, Bo Zhao, Derong Liu

    IEEE Transactions on Neural Networks and Learning Systems
    |March 26, 2024
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new model-free policy optimization algorithm for optimal learning output tracking control in unknown linear systems. The method directly improves the control policy using gradient-based techniques, achieving effective tracking control.

    More Related Videos

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
    11:54

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

    Published on: May 8, 2021

    4.4K
    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    1.6K

    Related Experiment Videos

    Last Updated: Jun 29, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.0K
    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
    11:54

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

    Published on: May 8, 2021

    4.4K
    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    1.6K

    Area of Science:

    • Intelligent Control
    • Reinforcement Learning
    • Control Systems

    Background:

    • Model-free optimal learning output tracking control (OLOTC) is gaining traction in intelligent control and reinforcement learning (RL).
    • While off-policy learning and Q-learning have enabled model-free tracking control, direct policy learning (PL) methods remain underexplored despite their ease of implementation.

    Purpose of the Study:

    • To develop a novel model-free policy optimization (PO) algorithm for achieving OLOTC in unknown linear discrete-time (DT) systems.
    • To address the gap in utilizing direct policy learning for model-free tracking control.

    Main Methods:

    • An iterative control policy is parameterized to directly enhance the augmented system's discounted value function using a gradient-based approach.
    • A model-free two-point policy gradient (PG) algorithm is proposed to estimate the discounted value function's gradient from sampled states and reference trajectories.

    Main Results:

    • The proposed model-free PO algorithm demonstrates global convergence to the optimal value function under sufficient samples and appropriate conditions.
    • Numerical simulations validate the effectiveness of the developed model-free PO algorithm for OLOTC.

    Conclusions:

    • The study successfully presents a novel model-free PO algorithm for OLOTC in unknown linear DT systems.
    • The findings highlight the potential of direct policy learning approaches in model-free reinforcement learning for control applications.