Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Decision Making: P-value Method

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim is also stated. These statements can act as null and alternative hypotheses: a null hypothesis would be a neutral statement while the alternative hypothesis can...

Transformers with Off-Nominal Turns Ratios

Transformers with Off-Nominal Turns Ratios

In scenarios involving parallel transformers with disparate ratings, developing per-unit models requires accommodating off-nominal turns ratios. This situation arises when the selected base voltages are not proportional to the transformer’s voltage ratings. Consider a transformer where the rated voltages are related by the term a. If the chosen voltage bases satisfy a relationship involving term b, term c is defined as the ratio of these bases. This ratio is then substituted into the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Characterization and determination of the immunomodulatory activity of intestinal microbiota microorganisms isolated from free-range Cornu aspersum snails in olive groves before and after aestivation.

Developmental and comparative immunology·2026

Same author

Photonic Kolmogorov-Arnold networks based on self-phase modulation in nonlinear waveguides.

Optics letters·2026

Same author

Comparison of ChatGPT-3.5 and GPT-4 as potential tools in artificial intelligence-assisted clinical practice in renal and liver transplantation.

World journal of transplantation·2025

Same author

Inducing Neural Collapse via Anticlasses and One-Cold Cross-Entropy Loss.

IEEE transactions on neural networks and learning systems·2025

Same author

Analog nanophotonic computing going practical: silicon photonic deep learning engines for tiled optical matrix multiplication with dynamic precision.

Nanophotonics (Berlin, Germany)·2024

Same author

Predicting the immunomodulatory activity of probiotic lactic acid bacteria using supervised machine learning in a Cornu aspersum snail model.

Fish & shellfish immunology·2024

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 9, 2025

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Sign potential-driven multiplicative optimization for robust deep reinforcement learning.

Loukia Avramelou¹, Manos Kirtas¹, Nikolaos Passalis²

¹Computational Intelligence and Deep Learning Research Group, Dept. of Informatics, Aristotle University of Thessaloniki, Greece.

Neural Networks : the Official Journal of the International Neural Network Society

|May 6, 2025

Summary

This summary is machine-generated.

Researchers developed a novel optimization method for Deep Reinforcement Learning (DRL) that enhances training stability and speed. This new approach uses a unique sign-change mechanism, improving the robustness of DRL agents in complex tasks.

Keywords:

Deep reinforcement learning Multiplicative optimizer Optimization

More Related Videos

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Related Experiment Videos

Last Updated: May 9, 2025

Deep Neural Networks for Image-Based Dietary Assessment

Deep Neural Networks for Image-Based Dietary Assessment

Published on: March 13, 2021

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

Area of Science:

Artificial Intelligence
Machine Learning
Robotics

Background:

Deep Reinforcement Learning (DRL) offers solutions for complex problems in robotics, autonomous driving, and finance.
DRL models often suffer from training instability and sensitivity, necessitating robust optimization methods.

Purpose of the Study:

To introduce a novel momentum-based optimization approach for Deep Reinforcement Learning.
To address limitations in existing multiplicative update methods, specifically parameter sign-flipping.

Main Methods:

Developed a momentum-based optimizer incorporating a sign-change mechanism inspired by spiking neural networks.
The proposed method allows parameters to change signs, enhancing multiplicative updates.

Main Results:

The novel optimizer demonstrated effectiveness in accelerating learning and improving robustness during DRL agent training.
Experimental evaluations across various tasks confirmed the proposed method's benefits for DRL training.

Conclusions:

The proposed optimization approach significantly enhances the stability and efficiency of Deep Reinforcement Learning.
This method provides a robust solution for training DRL agents, overcoming limitations of current techniques.