Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Differential Equations: Problem Solving

Differential Equations: Problem Solving

When analyzing the motion of falling objects, it is essential to consider not only the force of gravity but also the opposing force of air resistance. A practical example involves releasing a heavy test weight during a safety check on a ship. As the weight falls from rest, gravity accelerates it downward while air resistance exerts an upward force that increases with velocity. This dynamic interplay of forces is well described by differential equations, which provide a mathematical framework...

Stability of Equilibrium Configuration: Problem Solving

Stability of Equilibrium Configuration: Problem Solving

The stability of equilibrium configurations is an important concept in physics, engineering, and other related fields. In simple terms, it refers to the tendency of an object or system to return to its equilibrium position after being disturbed. The stability of an equilibrium configuration can be analyzed by considering the potential energy function of the system and examining its behavior near the equilibrium point.
Problem-solving in the context of the stability of equilibrium configuration...

Kinematic Equations: Problem Solving

Kinematic Equations: Problem Solving

When analyzing one-dimensional motion with constant acceleration, the problem-solving strategy involves identifying the known quantities and choosing the appropriate kinematic equations to solve for the unknowns. Either one or two kinematic equations are needed to solve for the unknowns, depending on the known and unknown quantities. Generally, the number of equations required is the same as the number of unknown quantities in the given example. Two-body pursuit problems always require two...

Bernoulli's Equation: Problem Solving

Bernoulli's Equation: Problem Solving

A Venturi meter is essential for measuring fluid flow rates in pipelines. It utilizes the relationship between fluid velocity and pressure described by Bernoulli's equation. When installed in a sewage system, the Venturi meter accurately determines the wastewater flow rate by measuring pressure differences.
The first step is to compute the cross-sectional areas of the pipe and the Venturi throat to analyze the pressure difference indicated by the pressure gauge. Next, the continuity equation is...

Equation of Motion: General Plane motion - Problem Solving

Equation of Motion: General Plane motion - Problem Solving

Consider a lawn roller with a mass of 100 kg, a radius of 0.2 meters, and a radius of gyration of 0.15 meters. A force of 200 N is applied to this roller, angled at 60 degrees from the horizontal plane. What will be the angular acceleration of the lawn roller?
The friction between the roller and the ground is characterized by two coefficients. The static friction coefficient is 0.15, while the kinetic friction coefficient is 0.1. These values are crucial in understanding the interaction between...

Gaussian Elimination: Problem Solving

Gaussian Elimination: Problem Solving

Systems of linear equations in several variables are pivotal in modeling complex scenarios involving multiple unknowns and constraints. Such systems are widely used in various fields to represent relationships where several conditions must be simultaneously satisfied. Each variable in the system corresponds to an unknown quantity, while each equation imposes a linear constraint, leading to a structured approach for analyzing and solving real-world problems.A system of three equations with three...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Cross-sector deep learning scales life cycle assessment using unified textual descriptions.

Environmental science and ecotechnology·2026

Same author

[Retracted] Acaricidal activity of extracts from <i>Ligularia virgaurea</i> against the <i>Sarcoptes scabiei</i> mite <i>in vitro</i>.

Experimental and therapeutic medicine·2026

Same author

Approximate Optimal Control for Morphing Aircraft via Attention Meta-Learning and Continual Learning.

IEEE transactions on neural networks and learning systems·2026

Same author

Recent Advances on Off-Policy Reinforcement Learning for Optimization Control.

IEEE transactions on cybernetics·2026

Same author

Optimal cooperative output regulation with norm-based performance specifications.

ISA transactions·2026

Same author

DACESR: Degradation-Aware Conditional Embedding for Real-World Image Super-Resolution.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026

Same journal

TraNce: Type-aware hypergraph neural network with biological mediators for drug repositioning.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Decentralized ADMM for factorization-based Low-rank matrix estimation.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Memristive neuromorphic circuit design inspired by the neural mechanisms of conditioned fear.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 4, 2026

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Reinforcement learning solution for HJB equation arising in constrained optimal control problem.

Biao Luo¹, Huai-Ning Wu², Tingwen Huang³

¹The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China.

Neural Networks : the Official Journal of the International Neural Network Society

|September 11, 2015

Summary

This summary is machine-generated.

This study introduces a data-based reinforcement learning (RL) method to solve complex Hamilton-Jacobi-Bellman equations (HJBE) for optimal control. The approach uses off-policy RL to learn from real system data, overcoming exploration challenges.

Keywords:

Constrained optimal control Data-based Hamilton–Jacobi–Bellman equation Off-policy reinforcement learning The method of weighted residuals

More Related Videos

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms

Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms

Published on: August 15, 2016

Related Experiment Videos

Last Updated: Apr 4, 2026

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms

Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms

Published on: August 15, 2016

Area of Science:

Control Theory
Machine Learning
Applied Mathematics

Background:

Constrained optimal control problems necessitate solving the complex Hamilton-Jacobi-Bellman equation (HJBE).
Traditional methods often face challenges with data efficiency and exploration.

Purpose of the Study:

To propose a novel data-based, off-policy reinforcement learning (RL) method for solving HJBE and deriving optimal control policies.
To address the insufficient exploration problem inherent in RL through off-policy learning.

Main Methods:

Utilizing an off-policy reinforcement learning (RL) framework to learn from real system data.
Employing an actor-critic neural network architecture with linearly independent basis functions for function approximation.
Proving convergence by establishing equivalence to the successive approximation approach.

Main Results:

Demonstrated the convergence of the proposed off-policy RL method.
Proved the convergence of the implementation procedure incorporating function approximation.
Verified the effectiveness of the method through computer simulations.

Conclusions:

The proposed data-based off-policy RL method effectively learns HJBE solutions and optimal control policies from real system data.
The method successfully addresses exploration limitations and shows convergence guarantees.
Simulation results confirm the practical applicability and effectiveness of the approach.