Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Linearization and Approximation01:26

Linearization and Approximation

77
Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...
77
Application of Linearization and Approximation01:29

Application of Linearization and Approximation

105
A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...
105
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

393
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
393
Linear Approximation in Time Domain01:21

Linear Approximation in Time Domain

379
Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...
379
Network Function of a Circuit01:25

Network Function of a Circuit

889
Frequency response analysis in electrical circuits provides vital insights into a circuit's behavior as the frequency of the input signal changes. The transfer function, a mathematical tool, is instrumental in understanding this behavior. It defines the relationship between phasor output and input and comes in four types: voltage gain, current gain, transfer impedance, and transfer admittance. The critical components of the transfer function are the poles and zeros.
889
Approximate Integration01:24

Approximate Integration

59
In many practical and theoretical contexts, the exact value of a definite integral may be inaccessible. This limitation typically arises when the antiderivative of a function is either unknown or cannot be expressed in a closed mathematical form. Alternatively, it can occur when a function is defined not by a formula but by a finite set of empirical data points, such as those collected during experiments. In these cases, approximate integration techniques provide a valuable solution.One of the...
59

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Data-driven inverse optimal control for continuous-time nonlinear systems.

ISA transactions·2025
Same author

A computational model of canonical cortical microcircuits for dynamic Bayesian inference and control as inference.

Neuroscience research·2025
Same author

Possible contribution to data-driven primate research: Comment on "Kinematic coding: Measuring information in naturalistic behaviour" by Becchio, Pullar, Scaliti, and Panzeri.

Physics of life reviews·2025
Same author

Optical Neuroimage Studio (OptiNiSt): Intuitive, scalable, extendable framework for optical neuroimage data analysis.

PLoS computational biology·2025
Same author

Information-Theoretical Analysis of Team Dynamics in Football Matches.

Entropy (Basel, Switzerland)·2025
Same author

The differential effect of optogenetic serotonergic manipulation on sustained motor actions and waiting for future rewards in mice.

Frontiers in neuroscience·2024
Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Related Experiment Video

Updated: Feb 15, 2026

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring
08:47

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 13, 2008

11.8K

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.

Stefan Elfwing1, Eiji Uchibe2, Kenji Doya3

  • 1Department of Brain Robot Interface, ATR Computational Neuroscience Laboratories, 2-2-2 Hikaridai, Seikacho, Soraku-gun, Kyoto 619-0288, Japan.

Neural Networks : the Official Journal of the International Neural Network Society
|February 4, 2018
PubMed
Summary
This summary is machine-generated.

This study introduces novel activation functions (SiLU and dSiLU) for neural networks in reinforcement learning. The research demonstrates competitive performance against deep reinforcement learning algorithms like DQN using traditional methods.

Keywords:
Atari 2600Deep learningFunction approximationReinforcement learningSigmoid-weighted linear unitTetris

More Related Videos

Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging
17:06

Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging

Published on: November 8, 2012

27.1K
Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks
09:49

Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks

Published on: September 25, 2021

4.9K

Related Experiment Videos

Last Updated: Feb 15, 2026

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring
08:47

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 13, 2008

11.8K
Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging
17:06

Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging

Published on: November 8, 2012

27.1K
Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks
09:49

Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks

Published on: September 25, 2021

4.9K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Deep Learning

Background:

  • Neural networks are increasingly used as function approximators in reinforcement learning.
  • Deep reinforcement learning algorithms like DQN have achieved human-level performance in various domains.
  • Traditional reinforcement learning methods with eligibility traces and softmax action selection are being re-evaluated.

Purpose of the Study:

  • To propose two new activation functions for neural networks: sigmoid-weighted linear unit (SiLU) and its derivative (dSiLU).
  • To demonstrate that traditional on-policy learning with eligibility traces can be competitive with deep reinforcement learning methods like DQN.
  • To validate the proposed activation functions and learning approach in challenging game environments.

Main Methods:

  • Implementation of SiLU and dSiLU activation functions for neural network approximation.
  • Utilizing on-policy learning with eligibility traces (TD(λ) and Sarsa(λ)) and softmax action selection.
  • Testing agents in stochastic SZ-Tetris, Tetris, and Atari 2600 games.

Main Results:

  • Achieved new state-of-the-art results in Tetris variants using shallow dSiLU network agents with TD(λ) learning.
  • Outperformed DQN in the Atari 2600 domain using deep Sarsa(λ) agents with SiLU and dSiLU hidden units.
  • Demonstrated the competitiveness of on-policy learning with eligibility traces against DQN without a target network.

Conclusions:

  • The proposed SiLU and dSiLU activation functions enhance neural network performance in reinforcement learning.
  • On-policy learning with eligibility traces offers a competitive alternative to experience replay-based methods like DQN.
  • This research provides effective and efficient approaches for reinforcement learning agents in complex environments.