Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Linearization and Approximation

Linearization and Approximation

Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...

Application of Linearization and Approximation

Application of Linearization and Approximation

A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...

Accuracy, limits, and approximation

Accuracy, limits, and approximation

Accuracy, limits, and approximations are common in many fields, especially in engineering calculations. These concepts are imperative for ensuring that a given value is as close as possible to its true value.
Accuracy is defined as the closeness of the measured value to the true or actual value. In engineering mechanics, repeated measurements are taken during theoretical or experimental analyses to ensure that the result is precise and accurate.
The accuracy of any solution is based on the...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The Discovery of HNPC-A0073: A Novel Thiazol-5-ylmethyl-Based 3-Nitropyridin-2-amine Fungicide.

Journal of agricultural and food chemistry·2026

Same author

Lineage tracing of soma-to-primordial germ cell-like conversion in human tumor cell line.

iScience·2026

Same author

Antibacterial activity and multi-target mechanism of naringenin against Alicyclobacillus acidoterrestris and its application in apple juice.

International journal of food microbiology·2026

Same author

Fingertip-scale six-axis tactile interface with high-precision force sensing and position localization for dexterous human-machine interactions.

Microsystems & nanoengineering·2026

Same author

Refine Then Fusion: Robust 3D Brain MRI Synthesis via Vision-Language Collaboration.

IEEE transactions on medical imaging·2026

Same author

Regulatory roles of rpoS and rpoE on the response profiles of Salmonella enterica serovar Typhimurium to acid and osmosis sequential cross-stress treatment.

Food research international (Ottawa, Ont.)·2026

Same journal

Granular Ball-Based Noise-Resistant Fuzzy Multineighborhood Feature Selection via Label Enhancement and Feature Graph.

IEEE transactions on neural networks and learning systems·2026

Same journal

Fighting Evolving Spam With ARTMAP Models: A Noise-Resilient Online Detection Framework.

IEEE transactions on neural networks and learning systems·2026

Same journal

HyperSAT: Unsupervised Hypergraph Neural Networks for Weighted MaxSAT Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

Negation of Basic Belief Assignment in Multisource Information Fusion on Dempster-Shafer Theory With Applications in Pattern Classification.

IEEE transactions on neural networks and learning systems·2026

Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 21, 2026

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

Approximate Policy-Based Accelerated Deep Reinforcement Learning.

Xuesong Wang, Yang Gu, Yuhu Cheng

IEEE Transactions on Neural Networks and Learning Systems

|August 10, 2019

Summary

This summary is machine-generated.

We introduce a novel Approximate Policy-based Accelerated (APA) algorithm to speed up deep reinforcement learning (DRL). This method enhances learning efficiency and demonstrates superior performance across various tasks.

More Related Videos

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Related Experiment Videos

Last Updated: Jan 21, 2026

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

Published on: May 10, 2024

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

Area of Science:

Artificial Intelligence
Machine Learning
Deep Reinforcement Learning

Background:

Deep Reinforcement Learning (DRL) algorithms show high performance but suffer from slow training due to complex networks and numerous parameters, limiting learning efficiency.
The time-consuming training process hinders the practical application of DRL agents.

Purpose of the Study:

To accelerate the learning process of DRL agents.
To improve the learning efficiency of deep reinforcement learning algorithms.

Main Methods:

Propose a novel Approximate Policy-based Accelerated (APA) algorithm based on error analysis of approximate policy iteration reinforcement learning.
Develop three new DRL algorithms: APA-DQN, APA-Double DQN, and APA-DDPG by integrating the APA algorithm with existing DRL frameworks.
Validate the algorithms on both discrete-action and continuous-action tasks.

Main Results:

The APA algorithm is proven to be convergent, even with higher learning rates, leading to faster DRL agent learning.
The proposed APA-DQN, APA-Double DQN, and APA-DDPG algorithms demonstrate adaptability and superior performance compared to baseline methods.
The accelerated algorithms show significant potential for practical applications in diverse tasks.

Conclusions:

The APA algorithm effectively speeds up DRL training and improves learning efficiency.
The integration of APA with DQN, Double DQN, and DDPG enhances their performance and adaptability.
The proposed methods hold great promise for advancing the practical utility of DRL.