Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Linearization and Approximation

Linearization and Approximation

Linearization is a mathematical technique used to approximate complex, nonlinear functions with simpler linear models in the vicinity of a chosen reference point. The method is based on the idea that, although a function may be difficult to evaluate exactly, its behavior near a specific input value can often be closely approximated by the tangent line at that point. This approach is particularly useful when small deviations from a known value are involved.Consider the square root function, for...

Application of Linearization and Approximation

Application of Linearization and Approximation

A drone flying through complex terrain often relies on more than one sensing method to estimate small changes in altitude. Along with direct measurements, air pressure provides a useful indirect indicator of vertical movement. Atmospheric pressure decreases as altitude increases, and this relationship is commonly described using an exponential model. Although accurate, converting pressure measurements into altitude values requires calculations that are too complex to perform repeatedly during...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Linear Approximation in Time Domain

Linear Approximation in Time Domain

Nonlinear systems often require sophisticated approaches for accurate modeling and analysis, with state-space representation being particularly effective. This method is especially useful for systems where variables and parameters vary with time or operating conditions, such as in a simple pendulum or a translational mechanical system with nonlinear springs.
For a simple pendulum with a mass evenly distributed along its length and the center of mass located at half the pendulum's length,...

Network Function of a Circuit

Network Function of a Circuit

Frequency response analysis in electrical circuits provides vital insights into a circuit's behavior as the frequency of the input signal changes. The transfer function, a mathematical tool, is instrumental in understanding this behavior. It defines the relationship between phasor output and input and comes in four types: voltage gain, current gain, transfer impedance, and transfer admittance. The critical components of the transfer function are the poles and zeros.

Approximate Integration

Approximate Integration

In many practical and theoretical contexts, the exact value of a definite integral may be inaccessible. This limitation typically arises when the antiderivative of a function is either unknown or cannot be expressed in a closed mathematical form. Alternatively, it can occur when a function is defined not by a formula but by a finite set of empirical data points, such as those collected during experiments. In these cases, approximate integration techniques provide a valuable solution.One of the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Data-driven inverse optimal control for continuous-time nonlinear systems.

ISA transactions·2025

Same author

A computational model of canonical cortical microcircuits for dynamic Bayesian inference and control as inference.

Neuroscience research·2025

Same author

Possible contribution to data-driven primate research: Comment on "Kinematic coding: Measuring information in naturalistic behaviour" by Becchio, Pullar, Scaliti, and Panzeri.

Physics of life reviews·2025

Same author

Optical Neuroimage Studio (OptiNiSt): Intuitive, scalable, extendable framework for optical neuroimage data analysis.

PLoS computational biology·2025

Same author

Information-Theoretical Analysis of Team Dynamics in Football Matches.

Entropy (Basel, Switzerland)·2025

Same author

The differential effect of optogenetic serotonergic manipulation on sustained motor actions and waiting for future rewards in mice.

Frontiers in neuroscience·2024

Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 15, 2026

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 13, 2008

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning.

Stefan Elfwing¹, Eiji Uchibe², Kenji Doya³

¹Department of Brain Robot Interface, ATR Computational Neuroscience Laboratories, 2-2-2 Hikaridai, Seikacho, Soraku-gun, Kyoto 619-0288, Japan.

Neural Networks : the Official Journal of the International Neural Network Society

|February 4, 2018

Summary

This summary is machine-generated.

This study introduces novel activation functions (SiLU and dSiLU) for neural networks in reinforcement learning. The research demonstrates competitive performance against deep reinforcement learning algorithms like DQN using traditional methods.

Keywords:

Atari 2600 Deep learning Function approximation Reinforcement learning Sigmoid-weighted linear unit Tetris

More Related Videos

Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging

Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging

Published on: November 8, 2012

Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks

Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks

Published on: September 25, 2021

Related Experiment Videos

Last Updated: Feb 15, 2026

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Functional Imaging with Reinforcement, Eyetracking, and Physiological Monitoring

Published on: November 13, 2008

Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging

Co-analysis of Brain Structure and Function using fMRI and Diffusion-weighted Imaging

Published on: November 8, 2012

Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks

Divergence of Root Microbiota in Different Habitats based on Weighted Correlation Networks

Published on: September 25, 2021

Area of Science:

Artificial Intelligence
Machine Learning
Deep Learning

Background:

Neural networks are increasingly used as function approximators in reinforcement learning.
Deep reinforcement learning algorithms like DQN have achieved human-level performance in various domains.
Traditional reinforcement learning methods with eligibility traces and softmax action selection are being re-evaluated.

Purpose of the Study:

To propose two new activation functions for neural networks: sigmoid-weighted linear unit (SiLU) and its derivative (dSiLU).
To demonstrate that traditional on-policy learning with eligibility traces can be competitive with deep reinforcement learning methods like DQN.
To validate the proposed activation functions and learning approach in challenging game environments.

Main Methods:

Implementation of SiLU and dSiLU activation functions for neural network approximation.
Utilizing on-policy learning with eligibility traces (TD(λ) and Sarsa(λ)) and softmax action selection.
Testing agents in stochastic SZ-Tetris, Tetris, and Atari 2600 games.

Main Results:

Achieved new state-of-the-art results in Tetris variants using shallow dSiLU network agents with TD(λ) learning.
Outperformed DQN in the Atari 2600 domain using deep Sarsa(λ) agents with SiLU and dSiLU hidden units.
Demonstrated the competitiveness of on-policy learning with eligibility traces against DQN without a target network.

Conclusions:

The proposed SiLU and dSiLU activation functions enhance neural network performance in reinforcement learning.
On-policy learning with eligibility traces offers a competitive alternative to experience replay-based methods like DQN.
This research provides effective and efficient approaches for reinforcement learning agents in complex environments.