Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Differential Equations: Problem Solving01:21

Differential Equations: Problem Solving

192
When analyzing the motion of falling objects, it is essential to consider not only the force of gravity but also the opposing force of air resistance. A practical example involves releasing a heavy test weight during a safety check on a ship. As the weight falls from rest, gravity accelerates it downward while air resistance exerts an upward force that increases with velocity. This dynamic interplay of forces is well described by differential equations, which provide a mathematical framework...
192
Stability of Equilibrium Configuration: Problem Solving01:13

Stability of Equilibrium Configuration: Problem Solving

1.2K
The stability of equilibrium configurations is an important concept in physics, engineering, and other related fields. In simple terms, it refers to the tendency of an object or system to return to its equilibrium position after being disturbed. The stability of an equilibrium configuration can be analyzed by considering the potential energy function of the system and examining its behavior near the equilibrium point.
Problem-solving in the context of the stability of equilibrium configuration...
1.2K
Kinematic Equations: Problem Solving01:15

Kinematic Equations: Problem Solving

29.7K
When analyzing one-dimensional motion with constant acceleration, the problem-solving strategy involves identifying the known quantities and choosing the appropriate kinematic equations to solve for the unknowns. Either one or two kinematic equations are needed to solve for the unknowns, depending on the known and unknown quantities. Generally, the number of equations required is the same as the number of unknown quantities in the given example. Two-body pursuit problems always require two...
29.7K
Bernoulli's Equation: Problem Solving01:16

Bernoulli's Equation: Problem Solving

2.1K
A Venturi meter is essential for measuring fluid flow rates in pipelines. It utilizes the relationship between fluid velocity and pressure described by Bernoulli's equation. When installed in a sewage system, the Venturi meter accurately determines the wastewater flow rate by measuring pressure differences.
The first step is to compute the cross-sectional areas of the pipe and the Venturi throat to analyze the pressure difference indicated by the pressure gauge. Next, the continuity equation is...
2.1K
Equation of Motion: General Plane motion - Problem Solving01:16

Equation of Motion: General Plane motion - Problem Solving

593
Consider a lawn roller with a mass of 100 kg, a radius of 0.2 meters, and a radius of gyration of 0.15 meters. A force of 200 N is applied to this roller, angled at 60 degrees from the horizontal plane. What will be the angular acceleration of the lawn roller?
The friction between the roller and the ground is characterized by two coefficients. The static friction coefficient is 0.15, while the kinetic friction coefficient is 0.1. These values are crucial in understanding the interaction between...
593
Gaussian Elimination: Problem Solving01:30

Gaussian Elimination: Problem Solving

281
Systems of linear equations in several variables are pivotal in modeling complex scenarios involving multiple unknowns and constraints. Such systems are widely used in various fields to represent relationships where several conditions must be simultaneously satisfied. Each variable in the system corresponds to an unknown quantity, while each equation imposes a linear constraint, leading to a structured approach for analyzing and solving real-world problems.A system of three equations with three...
281

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Cross-sector deep learning scales life cycle assessment using unified textual descriptions.

Environmental science and ecotechnology·2026
Same author

[Retracted] Acaricidal activity of extracts from <i>Ligularia virgaurea</i> against the <i>Sarcoptes scabiei</i> mite <i>in vitro</i>.

Experimental and therapeutic medicine·2026
Same author

Approximate Optimal Control for Morphing Aircraft via Attention Meta-Learning and Continual Learning.

IEEE transactions on neural networks and learning systems·2026
Same author

Recent Advances on Off-Policy Reinforcement Learning for Optimization Control.

IEEE transactions on cybernetics·2026
Same author

Optimal cooperative output regulation with norm-based performance specifications.

ISA transactions·2026
Same author

DACESR: Degradation-Aware Conditional Embedding for Real-World Image Super-Resolution.

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society·2026
Same journal

TraNce: Type-aware hypergraph neural network with biological mediators for drug repositioning.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Decentralized ADMM for factorization-based Low-rank matrix estimation.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Memristive neuromorphic circuit design inspired by the neural mechanisms of conditioned fear.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Apr 4, 2026

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.5K

Reinforcement learning solution for HJB equation arising in constrained optimal control problem.

Biao Luo1, Huai-Ning Wu2, Tingwen Huang3

  • 1The State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China.

Neural Networks : the Official Journal of the International Neural Network Society
|September 11, 2015
PubMed
Summary
This summary is machine-generated.

This study introduces a data-based reinforcement learning (RL) method to solve complex Hamilton-Jacobi-Bellman equations (HJBE) for optimal control. The approach uses off-policy RL to learn from real system data, overcoming exploration challenges.

Keywords:
Constrained optimal controlData-basedHamilton–Jacobi–Bellman equationOff-policy reinforcement learningThe method of weighted residuals

More Related Videos

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
06:45

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

2.2K
Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms
10:32

Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms

Published on: August 15, 2016

16.2K

Related Experiment Videos

Last Updated: Apr 4, 2026

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.5K
Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
06:45

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

2.2K
Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms
10:32

Robotic Mirror Therapy System for Functional Recovery of Hemiplegic Arms

Published on: August 15, 2016

16.2K

Area of Science:

  • Control Theory
  • Machine Learning
  • Applied Mathematics

Background:

  • Constrained optimal control problems necessitate solving the complex Hamilton-Jacobi-Bellman equation (HJBE).
  • Traditional methods often face challenges with data efficiency and exploration.

Purpose of the Study:

  • To propose a novel data-based, off-policy reinforcement learning (RL) method for solving HJBE and deriving optimal control policies.
  • To address the insufficient exploration problem inherent in RL through off-policy learning.

Main Methods:

  • Utilizing an off-policy reinforcement learning (RL) framework to learn from real system data.
  • Employing an actor-critic neural network architecture with linearly independent basis functions for function approximation.
  • Proving convergence by establishing equivalence to the successive approximation approach.

Main Results:

  • Demonstrated the convergence of the proposed off-policy RL method.
  • Proved the convergence of the implementation procedure incorporating function approximation.
  • Verified the effectiveness of the method through computer simulations.

Conclusions:

  • The proposed data-based off-policy RL method effectively learns HJBE solutions and optimal control policies from real system data.
  • The method successfully addresses exploration limitations and shows convergence guarantees.
  • Simulation results confirm the practical applicability and effectiveness of the approach.