Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Open and closed-loop control systems01:17

Open and closed-loop control systems

1.9K
Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...
1.9K
Observational Learning01:12

Observational Learning

1.2K
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
1.2K
Feedback control systems01:26

Feedback control systems

767
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
767
Control Systems01:10

Control Systems

2.0K
Control systems are everywhere in contemporary society, influencing diverse applications from aerospace to automated manufacturing. These systems can be found naturally within biological processes, such as blood sugar regulation and heart rate adjustment in response to stress, as well as in man-made systems like elevators and automated vehicles. A control system is essentially a network of subsystems and processes that collaboratively convert specific inputs into desired outputs.
At the heart...
2.0K
Root-Locus Method01:19

Root-Locus Method

550
A cruise control system in a car is designed to maintain a specified speed automatically by adjusting the gas pedal. The system continuously measures the vehicle's speed and makes fine adjustments to the pedal to achieve this goal. The root locus method is particularly useful for understanding how the cruise control system's behavior changes under varying conditions, such as when the car goes uphill, downhill, or faces strong wind resistance.
This system can be represented by a block...
550
Relative Motion Analysis using Rotating Axes-Problem Solving01:29

Relative Motion Analysis using Rotating Axes-Problem Solving

817
Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...
817

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cross-sector deep learning scales life cycle assessment using unified textual descriptions.

Environmental science and ecotechnology·2026
Same author

[Retracted] Acaricidal activity of extracts from <i>Ligularia virgaurea</i> against the <i>Sarcoptes scabiei</i> mite <i>in vitro</i>.

Experimental and therapeutic medicine·2026
Same author

Recent Advances on Off-Policy Reinforcement Learning for Optimization Control.

IEEE transactions on cybernetics·2026
Same author

Optimal cooperative output regulation with norm-based performance specifications.

ISA transactions·2026
Same author

DACESR: Degradation-Aware Conditional Embedding for Real-World Image Super-Resolution.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same author

Time-Varying HJBE-Based Adaptive Safe Critic Control Design for Stochastic Asymmetric Constrained Multiagent Systems.

IEEE transactions on cybernetics·2026
Same journal

Granular Ball-Based Noise-Resistant Fuzzy Multineighborhood Feature Selection via Label Enhancement and Feature Graph.

IEEE transactions on neural networks and learning systems·2026
Same journal

Fighting Evolving Spam With ARTMAP Models: A Noise-Resilient Online Detection Framework.

IEEE transactions on neural networks and learning systems·2026
Same journal

HyperSAT: Unsupervised Hypergraph Neural Networks for Weighted MaxSAT Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

Negation of Basic Belief Assignment in Multisource Information Fusion on Dempster-Shafer Theory With Applications in Pattern Classification.

IEEE transactions on neural networks and learning systems·2026
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Mar 17, 2026

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.5K

Model-Free Optimal Tracking Control via Critic-Only Q-Learning.

Biao Luo, Derong Liu, Tingwen Huang

    IEEE Transactions on Neural Networks and Learning Systems
    |July 15, 2016
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a critic-only Q-learning (CoQL) method for model-free optimal tracking control in nonaffine nonlinear discrete-time systems. The CoQL approach effectively learns optimal control policies from data, simplifying implementation and enhancing exploration.

    More Related Videos

    Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut
    08:32

    Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

    Published on: June 15, 2020

    13.6K

    Related Experiment Videos

    Last Updated: Mar 17, 2026

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.5K
    Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut
    08:32

    Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

    Published on: June 15, 2020

    13.6K

    Area of Science:

    • Control Theory
    • Machine Learning
    • Nonlinear Systems

    Background:

    • Model-free control is crucial for systems where dynamics are unknown.
    • Optimal tracking control for nonaffine nonlinear discrete-time systems presents significant challenges.
    • Existing methods often require complex system models or solving intricate equations.

    Purpose of the Study:

    • To develop a novel model-free optimal tracking control method for nonaffine nonlinear discrete-time systems.
    • To introduce a critic-only Q-learning (CoQL) approach that avoids solving the tracking Hamilton-Jacobi-Bellman equation.
    • To ensure the convergence and effectiveness of the proposed CoQL method, even with neural network approximation errors.

    Main Methods:

    • Development of a critic-only Q-learning (CoQL) algorithm using a single neural network for Q-function approximation.
    • Establishment of Q-learning algorithm convergence based on an augmented system.
    • Proof of CoQL method convergence considering neural network approximation errors.
    • Design of adaptive optimal tracking control using a gradient descent scheme based on the learned Q-function.

    Main Results:

    • The proposed CoQL method successfully learns optimal tracking control policies from real system data.
    • Convergence of the Q-learning algorithm and the CoQL method was theoretically established.
    • Simulation studies demonstrated the effectiveness of the CoQL method in achieving optimal tracking control.
    • The CoQL method was shown to be easy to implement and overcome exploration issues.

    Conclusions:

    • The critic-only Q-learning (CoQL) method provides an effective solution for model-free optimal tracking control of nonaffine nonlinear discrete-time systems.
    • The CoQL approach simplifies implementation by using a critic-only structure and off-policy learning.
    • The method addresses the challenge of inadequate exploration, making it a promising technique for real-world applications.