Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Linearization and Approximation01:26

Linearization and Approximation

31
Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...
31
Application of Linearization and Approximation01:29

Application of Linearization and Approximation

70
A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...
70
Accuracy, limits, and approximation01:28

Accuracy, limits, and approximation

1.1K
Accuracy, limits, and approximations are common in many fields, especially in engineering calculations. These concepts are imperative for ensuring that a given value is as close as possible to its true value.
Accuracy is defined as the closeness of the measured value to the true or actual value. In engineering mechanics, repeated measurements are taken during theoretical or experimental analyses to ensure that the result is precise and accurate.
The accuracy of any solution is based on the...
1.1K
Reinforcement01:23

Reinforcement

855
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
855
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

360
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
360
Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

349
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
349

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The Discovery of HNPC-A0073: A Novel Thiazol-5-ylmethyl-Based 3-Nitropyridin-2-amine Fungicide.

Journal of agricultural and food chemistry·2026
Same author

Lineage tracing of soma-to-primordial germ cell-like conversion in human tumor cell line.

iScience·2026
Same author

Antibacterial activity and multi-target mechanism of naringenin against Alicyclobacillus acidoterrestris and its application in apple juice.

International journal of food microbiology·2026
Same author

Fingertip-scale six-axis tactile interface with high-precision force sensing and position localization for dexterous human-machine interactions.

Microsystems & nanoengineering·2026
Same author

Refine Then Fusion: Robust 3D Brain MRI Synthesis via Vision-Language Collaboration.

IEEE transactions on medical imaging·2026
Same author

Regulatory roles of rpoS and rpoE on the response profiles of Salmonella enterica serovar Typhimurium to acid and osmosis sequential cross-stress treatment.

Food research international (Ottawa, Ont.)·2026
Same journal

Granular Ball-Based Noise-Resistant Fuzzy Multineighborhood Feature Selection via Label Enhancement and Feature Graph.

IEEE transactions on neural networks and learning systems·2026
Same journal

Fighting Evolving Spam With ARTMAP Models: A Noise-Resilient Online Detection Framework.

IEEE transactions on neural networks and learning systems·2026
Same journal

HyperSAT: Unsupervised Hypergraph Neural Networks for Weighted MaxSAT Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

Negation of Basic Belief Assignment in Multisource Information Fusion on Dempster-Shafer Theory With Applications in Pattern Classification.

IEEE transactions on neural networks and learning systems·2026
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Jan 21, 2026

Deep Learning-Based Segmentation of Cryo-Electron Tomograms
10:25

Deep Learning-Based Segmentation of Cryo-Electron Tomograms

Published on: November 11, 2022

10.6K

Approximate Policy-Based Accelerated Deep Reinforcement Learning.

Xuesong Wang, Yang Gu, Yuhu Cheng

    IEEE Transactions on Neural Networks and Learning Systems
    |August 10, 2019
    PubMed
    Summary
    This summary is machine-generated.

    We introduce a novel Approximate Policy-based Accelerated (APA) algorithm to speed up deep reinforcement learning (DRL). This method enhances learning efficiency and demonstrates superior performance across various tasks.

    More Related Videos

    DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
    04:17

    DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

    Published on: May 10, 2024

    1.5K
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    3.4K

    Related Experiment Videos

    Last Updated: Jan 21, 2026

    Deep Learning-Based Segmentation of Cryo-Electron Tomograms
    10:25

    Deep Learning-Based Segmentation of Cryo-Electron Tomograms

    Published on: November 11, 2022

    10.6K
    DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning
    04:17

    DNA Virus Detection System Based on RPA-CRISPR/Cas12a-SPM and Deep Learning

    Published on: May 10, 2024

    1.5K
    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
    04:48

    Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

    Published on: November 30, 2022

    3.4K

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Deep Reinforcement Learning

    Background:

    • Deep Reinforcement Learning (DRL) algorithms show high performance but suffer from slow training due to complex networks and numerous parameters, limiting learning efficiency.
    • The time-consuming training process hinders the practical application of DRL agents.

    Purpose of the Study:

    • To accelerate the learning process of DRL agents.
    • To improve the learning efficiency of deep reinforcement learning algorithms.

    Main Methods:

    • Propose a novel Approximate Policy-based Accelerated (APA) algorithm based on error analysis of approximate policy iteration reinforcement learning.
    • Develop three new DRL algorithms: APA-DQN, APA-Double DQN, and APA-DDPG by integrating the APA algorithm with existing DRL frameworks.
    • Validate the algorithms on both discrete-action and continuous-action tasks.

    Main Results:

    • The APA algorithm is proven to be convergent, even with higher learning rates, leading to faster DRL agent learning.
    • The proposed APA-DQN, APA-Double DQN, and APA-DDPG algorithms demonstrate adaptability and superior performance compared to baseline methods.
    • The accelerated algorithms show significant potential for practical applications in diverse tasks.

    Conclusions:

    • The APA algorithm effectively speeds up DRL training and improves learning efficiency.
    • The integration of APA with DQN, Double DQN, and DDPG enhances their performance and adaptability.
    • The proposed methods hold great promise for advancing the practical utility of DRL.