Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

84
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
84
Associative Learning01:27

Associative Learning

234
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
234
Reinforcement Schedules01:24

Reinforcement Schedules

108
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
108
Improving Translational Accuracy02:07

Improving Translational Accuracy

8.5K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
8.5K
Decision Making: P-value Method01:09

Decision Making: P-value Method

5.2K
The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim  is also stated. These statements can act as null and alternative hypotheses:  a null hypothesis would be a neutral statement while the alternative hypothesis can...
5.2K
Transformers with Off-Nominal Turns Ratios01:25

Transformers with Off-Nominal Turns Ratios

122
In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...
122

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Characterization and determination of the immunomodulatory activity of intestinal microbiota microorganisms isolated from free-range Cornu aspersum snails in olive groves before and after aestivation.

Developmental and comparative immunology·2026
Same author

Photonic Kolmogorov-Arnold networks based on self-phase modulation in nonlinear waveguides.

Optics letters·2026
Same author

Comparison of ChatGPT-3.5 and GPT-4 as potential tools in artificial intelligence-assisted clinical practice in renal and liver transplantation.

World journal of transplantation·2025
Same author

Inducing Neural Collapse via Anticlasses and One-Cold Cross-Entropy Loss.

IEEE transactions on neural networks and learning systems·2025
Same author

Analog nanophotonic computing going practical: silicon photonic deep learning engines for tiled optical matrix multiplication with dynamic precision.

Nanophotonics (Berlin, Germany)·2024
Same author

Predicting the immunomodulatory activity of probiotic lactic acid bacteria using supervised machine learning in a Cornu aspersum snail model.

Fish & shellfish immunology·2024
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: May 9, 2025

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K

Sign potential-driven multiplicative optimization for robust deep reinforcement learning.

Loukia Avramelou1, Manos Kirtas1, Nikolaos Passalis2

  • 1Computational Intelligence and Deep Learning Research Group, Dept. of Informatics, Aristotle University of Thessaloniki, Greece.

Neural Networks : the Official Journal of the International Neural Network Society
|May 6, 2025
PubMed
Summary
This summary is machine-generated.

Researchers developed a novel optimization method for Deep Reinforcement Learning (DRL) that enhances training stability and speed. This new approach uses a unique sign-change mechanism, improving the robustness of DRL agents in complex tasks.

Keywords:
Deep reinforcement learningMultiplicative optimizerOptimization

More Related Videos

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

897
A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.3K

Related Experiment Videos

Last Updated: May 9, 2025

Deep Neural Networks for Image-Based Dietary Assessment
13:19

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

8.9K
Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

897
A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.3K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Robotics

Background:

  • Deep Reinforcement Learning (DRL) offers solutions for complex problems in robotics, autonomous driving, and finance.
  • DRL models often suffer from training instability and sensitivity, necessitating robust optimization methods.

Purpose of the Study:

  • To introduce a novel momentum-based optimization approach for Deep Reinforcement Learning.
  • To address limitations in existing multiplicative update methods, specifically parameter sign-flipping.

Main Methods:

  • Developed a momentum-based optimizer incorporating a sign-change mechanism inspired by spiking neural networks.
  • The proposed method allows parameters to change signs, enhancing multiplicative updates.

Main Results:

  • The novel optimizer demonstrated effectiveness in accelerating learning and improving robustness during DRL agent training.
  • Experimental evaluations across various tasks confirmed the proposed method's benefits for DRL training.

Conclusions:

  • The proposed optimization approach significantly enhances the stability and efficiency of Deep Reinforcement Learning.
  • This method provides a robust solution for training DRL agents, overcoming limitations of current techniques.