Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Linearization and Approximation

Linearization and Approximation

Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

Newton’s Method

Newton’s Method

Newton’s Method is a powerful iterative technique for approximating the roots of real-valued, differentiable functions, particularly when analytical solutions are impractical. This approach is widely used in scientific computing, engineering, and finance, where equations may be too complex for traditional algebraic methods to handle. The method relies on an iterative process that refines an initial estimate using the function’s derivative to approach the true solution progressively.

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Application of Linearization and Approximation

Application of Linearization and Approximation

A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...

Gauss's Law: Problem-Solving

Gauss's Law: Problem-Solving

Gauss's law helps determine electric fields even though the law is not directly about electric fields but electric flux. In situations with certain symmetries (spherical, cylindrical, or planar) in the charge distribution, the electric field can be deduced based on the knowledge of the electric flux. In these systems, we can find a Gaussian surface S over which the electric field has a constant magnitude. Furthermore, suppose the electric field is parallel (or antiparallel) to the area vector...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Therapeutic targeting of AREL1 in hepatic stellate cells attenuates MASH-related liver fibrosis.

Nature communications·2026

Same author

Shenfu Injection Inhibits Cardiac Fibroblast Activation and Improves Myocardial Fibrosis by Down-regulating M1 Macrophage-Derived Exosomal miR-155-5p.

Cardiovascular drugs and therapy·2026

Same author

A rare collision tumor of lymphoepithelioma-like carcinoma and t-cell lymphoma: a case report.

Frontiers in oncology·2026

Same author

Laser-induced glare effect and image reconstruction via a frequency-aware transformer.

Optics express·2026

Same author

Knowledge, attitudes, and practices of anesthesiology healthcare professionals regarding crisis resource management in Shanghai, China: a cross-sectional study.

Scientific reports·2026

Same author

Research on anomaly detection and operational status evaluation methods for smart electricity meters based on hybrid deep learning.

PloS one·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

Same journal

Hierarchical Semantic Concept Modeling for Generalizable Myocardial Pathology Segmentation on Multisequence CMR Images.

IEEE transactions on neural networks and learning systems·2026

Same journal

Stability of Time-Varying Impulsive Systems With State-Dependent Delay and Its Application in Complex Networks.

IEEE transactions on neural networks and learning systems·2026

Same journal

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 27, 2026

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Gauss-Newton Temporal Difference Learning With Nonlinear Function Approximation.

Zhifa Ke, Junyu Zhang, Zaiwen Wen

IEEE Transactions on Neural Networks and Learning Systems

|February 25, 2026

Summary

This summary is machine-generated.

A new Gauss-Newton temporal difference (GNTD) learning method improves Q-learning with nonlinear approximations. GNTD offers better sample complexity and faster convergence in reinforcement learning benchmarks.

Related Experiment Videos

Last Updated: Feb 27, 2026

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Design and Application of a Fault Detection Method Based on Adaptive Filters and Rotational Speed Estimation for an Electro-Hydrostatic Actuator

Published on: October 28, 2022

Area of Science:

Artificial Intelligence
Machine Learning
Reinforcement Learning

Background:

Q-learning is a fundamental reinforcement learning algorithm.
Nonlinear function approximations are crucial for handling complex state spaces in Q-learning.
Existing temporal difference (TD) methods face challenges with sample complexity and convergence rates.

Purpose of the Study:

To propose a novel Gauss-Newton temporal difference (GNTD) learning method.
To address the limitations of existing methods in terms of sample complexity and convergence.
To enable efficient and stable Q-learning with nonlinear function approximations.

Main Methods:

The proposed GNTD method utilizes Gauss-Newton (GN) steps to optimize a variant of mean-squared Bellman error (MSBE).
Target networks are employed to mitigate issues related to double sampling.
Inexact GN steps are analyzed for efficient computation using matrix iterations.
Nonasymptotic finite-sample convergence guarantees are derived under mild conditions.

Main Results:

GNTD achieves an improved sample complexity of $\tilde {\mathcal {O}}(\varepsilon ^{-1})$ for neural networks with ReLU activation, outperforming existing TD methods.
For general smooth function approximations, GNTD establishes a sample complexity of $\tilde {\mathcal {O}}(\varepsilon ^{-1.5})$ .
Extensive experiments on reinforcement learning benchmarks demonstrate that GNTD yields higher rewards and faster convergence compared to TD-type methods.

Conclusions:

The GNTD learning method provides a theoretically sound and empirically effective approach for Q-learning with nonlinear function approximations.
GNTD significantly enhances sample efficiency and convergence speed, particularly in deep reinforcement learning settings.
The proposed method offers a promising advancement for tackling complex reinforcement learning problems.