Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Linearization and Approximation01:26

Linearization and Approximation

108
Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...
108
Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

384
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
384
Newton’s Method01:30

Newton’s Method

81
Newton’s Method is a powerful iterative technique for approximating the roots of real-valued, differentiable functions, particularly when analytical solutions are impractical. This approach is widely used in scientific computing, engineering, and finance, where equations may be too complex for traditional algebraic methods to handle. The method relies on an iterative process that refines an initial estimate using the function’s derivative to approach the true solution progressively.
81
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

407
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
407
Application of Linearization and Approximation01:29

Application of Linearization and Approximation

120
A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...
120
Gauss's Law: Problem-Solving01:10

Gauss's Law: Problem-Solving

2.7K
Gauss's law helps determine electric fields even though the law is not directly about electric fields but electric flux. In situations with certain symmetries (spherical, cylindrical, or planar) in the charge distribution, the electric field can be deduced based on the knowledge of the electric flux. In these systems, we can find a Gaussian surface S over which the electric field has a constant magnitude. Furthermore, suppose the electric field is parallel (or antiparallel) to the area vector...
2.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Therapeutic targeting of AREL1 in hepatic stellate cells attenuates MASH-related liver fibrosis.

Nature communications·2026
Same author

Shenfu Injection Inhibits Cardiac Fibroblast Activation and Improves Myocardial Fibrosis by Down-regulating M1 Macrophage-Derived Exosomal miR-155-5p.

Cardiovascular drugs and therapy·2026
Same author

A rare collision tumor of lymphoepithelioma-like carcinoma and t-cell lymphoma: a case report.

Frontiers in oncology·2026
Same author

Laser-induced glare effect and image reconstruction via a frequency-aware transformer.

Optics express·2026
Same author

Knowledge, attitudes, and practices of anesthesiology healthcare professionals regarding crisis resource management in Shanghai, China: a cross-sectional study.

Scientific reports·2026
Same author

Research on anomaly detection and operational status evaluation methods for smart electricity meters based on hybrid deep learning.

PloS one·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hierarchical Semantic Concept Modeling for Generalizable Myocardial Pathology Segmentation on Multisequence CMR Images.

IEEE transactions on neural networks and learning systems·2026
Same journal

Stability of Time-Varying Impulsive Systems With State-Dependent Delay and Its Application in Complex Networks.

IEEE transactions on neural networks and learning systems·2026
Same journal

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Feb 27, 2026

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
06:45

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

2.2K

Gauss-Newton Temporal Difference Learning With Nonlinear Function Approximation.

Zhifa Ke, Junyu Zhang, Zaiwen Wen

    IEEE Transactions on Neural Networks and Learning Systems
    |February 25, 2026
    PubMed
    Summary
    This summary is machine-generated.

    A new Gauss-Newton temporal difference (GNTD) learning method improves Q-learning with nonlinear approximations. GNTD offers better sample complexity and faster convergence in reinforcement learning benchmarks.

    Related Experiment Videos

    Last Updated: Feb 27, 2026

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator
    06:45

    Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

    Published on: October 28, 2022

    2.2K

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Reinforcement Learning

    Background:

    • Q-learning is a fundamental reinforcement learning algorithm.
    • Nonlinear function approximations are crucial for handling complex state spaces in Q-learning.
    • Existing temporal difference (TD) methods face challenges with sample complexity and convergence rates.

    Purpose of the Study:

    • To propose a novel Gauss-Newton temporal difference (GNTD) learning method.
    • To address the limitations of existing methods in terms of sample complexity and convergence.
    • To enable efficient and stable Q-learning with nonlinear function approximations.

    Main Methods:

    • The proposed GNTD method utilizes Gauss-Newton (GN) steps to optimize a variant of mean-squared Bellman error (MSBE).
    • Target networks are employed to mitigate issues related to double sampling.
    • Inexact GN steps are analyzed for efficient computation using matrix iterations.
    • Nonasymptotic finite-sample convergence guarantees are derived under mild conditions.

    Main Results:

    • GNTD achieves an improved sample complexity of $\tilde {\mathcal {O}}(\varepsilon ^{-1})$ for neural networks with ReLU activation, outperforming existing TD methods.
    • For general smooth function approximations, GNTD establishes a sample complexity of $\tilde {\mathcal {O}}(\varepsilon ^{-1.5})$ .
    • Extensive experiments on reinforcement learning benchmarks demonstrate that GNTD yields higher rewards and faster convergence compared to TD-type methods.

    Conclusions:

    • The GNTD learning method provides a theoretically sound and empirically effective approach for Q-learning with nonlinear function approximations.
    • GNTD significantly enhances sample efficiency and convergence speed, particularly in deep reinforcement learning settings.
    • The proposed method offers a promising advancement for tackling complex reinforcement learning problems.