Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Open and closed-loop control systems

Open and closed-loop control systems

Control systems are foundational elements in automation and engineering. They are broadly categorized into open-loop and closed-loop systems. These classifications hinge on the presence or absence of feedback mechanisms, significantly influencing the system's performance, complexity, and application.
An open-loop control system operates without feedback from the output. It consists of two primary elements: the controller and the controlled process. The controller receives an input signal...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Control Systems

Control Systems

Control systems are everywhere in contemporary society, influencing diverse applications from aerospace to automated manufacturing. These systems can be found naturally within biological processes, such as blood sugar regulation and heart rate adjustment in response to stress, as well as in man-made systems like elevators and automated vehicles. A control system is essentially a network of subsystems and processes that collaboratively convert specific inputs into desired outputs.
At the heart...

Root-Locus Method

Root-Locus Method

A cruise control system in a car is designed to maintain a specified speed automatically by adjusting the gas pedal. The system continuously measures the vehicle's speed and makes fine adjustments to the pedal to achieve this goal. The root locus method is particularly useful for understanding how the cruise control system's behavior changes under varying conditions, such as when the car goes uphill, downhill, or faces strong wind resistance.
This system can be represented by a block...

Relative Motion Analysis using Rotating Axes-Problem Solving

Relative Motion Analysis using Rotating Axes-Problem Solving

Consider a crane whose telescopic boom rotates with an angular velocity of 0.04 rad/s and angular acceleration of 0.02 rad/s2. Along with the rotation, the boom also extends linearly with a uniform speed of 5 m/s. The extension of the boom is measured at point D, which is measured with respect to the fixed point C on the other end of the boom. For the given instant, the distance between points C and D is 60 meters.
Here, in order to determine the magnitude of velocity and acceleration for point...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Cross-sector deep learning scales life cycle assessment using unified textual descriptions.

Environmental science and ecotechnology·2026

Same author

[Retracted] Acaricidal activity of extracts from <i>Ligularia virgaurea</i> against the <i>Sarcoptes scabiei</i> mite <i>in vitro</i>.

Experimental and therapeutic medicine·2026

Same author

Recent Advances on Off-Policy Reinforcement Learning for Optimization Control.

IEEE transactions on cybernetics·2026

Same author

Optimal cooperative output regulation with norm-based performance specifications.

ISA transactions·2026

Same author

DACESR: Degradation-Aware Conditional Embedding for Real-World Image Super-Resolution.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same author

Time-Varying HJBE-Based Adaptive Safe Critic Control Design for Stochastic Asymmetric Constrained Multiagent Systems.

IEEE transactions on cybernetics·2026

Same journal

Granular Ball-Based Noise-Resistant Fuzzy Multineighborhood Feature Selection via Label Enhancement and Feature Graph.

IEEE transactions on neural networks and learning systems·2026

Same journal

Fighting Evolving Spam With ARTMAP Models: A Noise-Resilient Online Detection Framework.

IEEE transactions on neural networks and learning systems·2026

Same journal

HyperSAT: Unsupervised Hypergraph Neural Networks for Weighted MaxSAT Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

Negation of Basic Belief Assignment in Multisource Information Fusion on Dempster-Shafer Theory With Applications in Pattern Classification.

IEEE transactions on neural networks and learning systems·2026

Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 17, 2026

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Model-Free Optimal Tracking Control via Critic-Only Q-Learning.

Biao Luo, Derong Liu, Tingwen Huang

IEEE Transactions on Neural Networks and Learning Systems

|July 15, 2016

Summary

This summary is machine-generated.

This study introduces a critic-only Q-learning (CoQL) method for model-free optimal tracking control in nonaffine nonlinear discrete-time systems. The CoQL approach effectively learns optimal control policies from data, simplifying implementation and enhancing exploration.

More Related Videos

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Published on: June 15, 2020

Related Experiment Videos

Last Updated: Mar 17, 2026

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Tracking Rats in Operant Conditioning Chambers Using a Versatile Homemade Video Camera and DeepLabCut

Published on: June 15, 2020

Area of Science:

Control Theory
Machine Learning
Nonlinear Systems

Background:

Model-free control is crucial for systems where dynamics are unknown.
Optimal tracking control for nonaffine nonlinear discrete-time systems presents significant challenges.
Existing methods often require complex system models or solving intricate equations.

Purpose of the Study:

To develop a novel model-free optimal tracking control method for nonaffine nonlinear discrete-time systems.
To introduce a critic-only Q-learning (CoQL) approach that avoids solving the tracking Hamilton-Jacobi-Bellman equation.
To ensure the convergence and effectiveness of the proposed CoQL method, even with neural network approximation errors.

Main Methods:

Development of a critic-only Q-learning (CoQL) algorithm using a single neural network for Q-function approximation.
Establishment of Q-learning algorithm convergence based on an augmented system.
Proof of CoQL method convergence considering neural network approximation errors.
Design of adaptive optimal tracking control using a gradient descent scheme based on the learned Q-function.

Main Results:

The proposed CoQL method successfully learns optimal tracking control policies from real system data.
Convergence of the Q-learning algorithm and the CoQL method was theoretically established.
Simulation studies demonstrated the effectiveness of the CoQL method in achieving optimal tracking control.
The CoQL method was shown to be easy to implement and overcome exploration issues.

Conclusions:

The critic-only Q-learning (CoQL) method provides an effective solution for model-free optimal tracking control of nonaffine nonlinear discrete-time systems.
The CoQL approach simplifies implementation by using a critic-only structure and off-policy learning.
The method addresses the challenge of inadequate exploration, making it a promising technique for real-world applications.