Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Linearization and Approximation

Linearization and Approximation

Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...

Application of Linearization and Approximation

Application of Linearization and Approximation

A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...

Approximate Integration

Approximate Integration

In many practical and theoretical contexts, the exact value of a definite integral may be inaccessible. This limitation typically arises when the antiderivative of a function is either unknown or cannot be expressed in a closed mathematical form. Alternatively, it can occur when a function is defined not by a formula but by a finite set of empirical data points, such as those collected during experiments. In these cases, approximate integration techniques provide a valuable solution.One of the...

Time-Domain Interpretation of PD Control

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Effect of Medicaid coverage of tobacco-dependence treatments on smoking cessation.

International journal of environmental research and public health·2010

Same author

Cytokine and autoantibody patterns in acute liver failure.

Journal of immunotoxicology·2009

Same author

A novel scoring system for prognostic prediction in d-galactosamine/lipopolysaccharide-induced fulminant hepatic failure BALB/c mice.

BMC gastroenterology·2009

Same author

Mammalian target of rapamycin signaling pathway contributes to glioma progression and patients' prognosis.

The Journal of surgical research·2009

Same author

Estrogen receptor neurobiology and its potential for translation into broad spectrum therapeutics for CNS disorders.

Current molecular pharmacology·2009

Same author

Transcriptional and post-translational regulation of adiponectin.

The Biochemical journal·2009

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 28, 2026

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems.

Wentao Guo, Jennie Si, Feng Liu

IEEE Transactions on Neural Networks and Learning Systems

|June 11, 2017

Summary

This summary is machine-generated.

Policy iteration approximate dynamic programming (DP) ensures control policy stability and value function boundedness for nonlinear systems. This research introduces a new condition for value function convergence, enhancing practical applications.

Related Experiment Videos

Last Updated: Feb 28, 2026

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Area of Science:

Control Theory
Optimization
Machine Learning

Background:

Policy iteration approximate dynamic programming (DP) is crucial for optimal control.
Challenges exist in policy approximation for discrete-time nonlinear systems with infinite-horizon undiscounted value functions.

Purpose of the Study:

To address policy approximation errors in policy iteration DP for nonlinear systems.
To demonstrate asymptotic stability of the control policy and boundedness of the value function.
To introduce a novel sufficient condition for value function convergence.

Main Methods:

Analysis of policy approximation error in DP.
Demonstration of asymptotic stability and value function boundedness.
Development of a new convergence condition for value functions.
Application of Volterra series for practical policy implementation.

Main Results:

Asymptotic stability of the control policy is proven.
Boundedness of the value function is shown during policy iteration.
A new sufficient condition for value function convergence to a bounded neighborhood of the optimal is introduced.
Effectiveness illustrated with examples, including hydrogenerator excitation control.

Conclusions:

The proposed methods enhance the stability and convergence properties of policy iteration DP for nonlinear systems.
Volterra series offer a practical approach for implementing approximate policies.
The findings have implications for optimal control problems in various engineering applications.