Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

386
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
386
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

407
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
407
Linearization and Approximation01:26

Linearization and Approximation

110
Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...
110
Application of Linearization and Approximation01:29

Application of Linearization and Approximation

121
A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...
121
Approximate Integration01:24

Approximate Integration

76
In many practical and theoretical contexts, the exact value of a definite integral may be inaccessible. This limitation typically arises when the antiderivative of a function is either unknown or cannot be expressed in a closed mathematical form. Alternatively, it can occur when a function is defined not by a formula but by a finite set of empirical data points, such as those collected during experiments. In these cases, approximate integration techniques provide a valuable solution.One of the...
76
Time-Domain Interpretation of PD Control01:07

Time-Domain Interpretation of PD Control

421
Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...
421

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Effect of Medicaid coverage of tobacco-dependence treatments on smoking cessation.

International journal of environmental research and public health·2010
Same author

Cytokine and autoantibody patterns in acute liver failure.

Journal of immunotoxicology·2009
Same author

A novel scoring system for prognostic prediction in d-galactosamine/lipopolysaccharide-induced fulminant hepatic failure BALB/c mice.

BMC gastroenterology·2009
Same author

Mammalian target of rapamycin signaling pathway contributes to glioma progression and patients' prognosis.

The Journal of surgical research·2009
Same author

Estrogen receptor neurobiology and its potential for translation into broad spectrum therapeutics for CNS disorders.

Current molecular pharmacology·2009
Same author

Transcriptional and post-translational regulation of adiponectin.

The Biochemical journal·2009
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Feb 28, 2026

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
06:45

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

2.2K

Policy Approximation in Policy Iteration Approximate Dynamic Programming for Discrete-Time Nonlinear Systems.

Wentao Guo, Jennie Si, Feng Liu

    IEEE Transactions on Neural Networks and Learning Systems
    |June 11, 2017
    PubMed
    Summary
    This summary is machine-generated.

    Policy iteration approximate dynamic programming (DP) ensures control policy stability and value function boundedness for nonlinear systems. This research introduces a new condition for value function convergence, enhancing practical applications.

    Related Experiment Videos

    Last Updated: Feb 28, 2026

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    2.2K

    Area of Science:

    • Control Theory
    • Optimization
    • Machine Learning

    Background:

    • Policy iteration approximate dynamic programming (DP) is crucial for optimal control.
    • Challenges exist in policy approximation for discrete-time nonlinear systems with infinite-horizon undiscounted value functions.

    Purpose of the Study:

    • To address policy approximation errors in policy iteration DP for nonlinear systems.
    • To demonstrate asymptotic stability of the control policy and boundedness of the value function.
    • To introduce a novel sufficient condition for value function convergence.

    Main Methods:

    • Analysis of policy approximation error in DP.
    • Demonstration of asymptotic stability and value function boundedness.
    • Development of a new convergence condition for value functions.
    • Application of Volterra series for practical policy implementation.

    Main Results:

    • Asymptotic stability of the control policy is proven.
    • Boundedness of the value function is shown during policy iteration.
    • A new sufficient condition for value function convergence to a bounded neighborhood of the optimal is introduced.
    • Effectiveness illustrated with examples, including hydrogenerator excitation control.

    Conclusions:

    • The proposed methods enhance the stability and convergence properties of policy iteration DP for nonlinear systems.
    • Volterra series offer a practical approach for implementing approximate policies.
    • The findings have implications for optimal control problems in various engineering applications.