Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Feedback control systems01:26

Feedback control systems

564
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
564
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

180
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
180
Open and closed-loop control systems01:17

Open and closed-loop control systems

1.3K
Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...
1.3K
Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

208
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
208
Control Systems01:10

Control Systems

1.6K
Control systems are everywhere in contemporary society, influencing diverse applications from aerospace to automated manufacturing. These systems can be found naturally within biological processes, such as blood sugar regulation and heart rate adjustment in response to stress, as well as in man-made systems like elevators and automated vehicles. A control system is essentially a network of subsystems and processes that collaboratively convert specific inputs into desired outputs.
At the heart...
1.6K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

267
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
267

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Engine on/off management with consideration of thermal dynamics for HEVs.

Heliyon·2025
Same author

Traffic data imputation <i>via</i> knowledge graph-enhanced generative adversarial network.

PeerJ. Computer science·2024
Same author

Publisher Correction: Patients with Moderate-to-Severe Atopic Dermatitis Maintain Stable Response with No or Minimal Fluctuations with 1 Year of Lebrikizumab Treatment.

Dermatology and therapy·2024
Same author

Asymmetric Counteranion-Directed Electrocatalysis for Enantioselective Control of Radical Cation.

Angewandte Chemie (International ed. in English)·2024
Same author

Patients with Moderate-to-Severe Atopic Dermatitis Maintain Stable Response with No or Minimal Fluctuations with 1 Year of Lebrikizumab Treatment.

Dermatology and therapy·2024
Same author

A Novel BCC-Structure Zr-Nb-Ti Medium-Entropy Alloys (MEAs) with Excellent Structure and Irradiation Resistance.

Materials (Basel, Switzerland)·2022
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Nov 24, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.2K

Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems.

Zhenhui Xu, Tielong Shen, Daizhan Cheng

    IEEE Transactions on Neural Networks and Learning Systems
    |December 21, 2020
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a new integral reinforcement learning (IRL) algorithm to solve optimal control for nonlinear systems. The method uses an auxiliary trajectory to manage probing noise, ensuring stable learning and system performance.

    More Related Videos

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    2.0K
    The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
    11:53

    The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

    Published on: October 14, 2017

    12.0K

    Related Experiment Videos

    Last Updated: Nov 24, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.2K
    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    2.0K
    The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
    11:53

    The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

    Published on: October 14, 2017

    12.0K

    Area of Science:

    • Control Systems Engineering
    • Machine Learning
    • Nonlinear Dynamics

    Background:

    • Optimal control for continuous-time nonlinear systems with unknown dynamics presents significant challenges.
    • Externally added probing noise can induce oscillations, complicating the learning process in reinforcement learning algorithms.

    Purpose of the Study:

    • To propose a novel integral reinforcement learning (IRL) algorithm for optimal control of continuous-time nonlinear systems.
    • To address the challenge of rejecting oscillations caused by probing noise during the learning phase.

    Main Methods:

    • An auxiliary trajectory is embedded as an exciting signal to decompose the state trajectory.
    • A model-free policy iteration (PI) algorithm is developed using decoupled trajectories, alternating policy evaluation and improvement.
    • An external input is introduced in the policy improvement step to bypass the need for input-to-state dynamics.

    Main Results:

    • The algorithm is implemented using an actor-critic structure with sequential updates of critic and actor neural network weights via least-squares methods.
    • Convergence of the IRL algorithm and stability of the closed-loop system are mathematically guaranteed.
    • Effectiveness is demonstrated through two illustrative examples.

    Conclusions:

    • The proposed IRL algorithm effectively solves the optimal control problem for continuous-time nonlinear systems with unknown dynamics.
    • The novel approach successfully rejects oscillations from probing noise, ensuring stable learning and system performance.
    • The method offers a robust solution for complex control challenges in dynamic systems.