Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Time-Domain Interpretation of PD Control

Time-Domain Interpretation of PD Control

Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

Frequency-Domain Interpretation of PD Control

Frequency-Domain Interpretation of PD Control

Proportional-Derivative (PD) controllers are widely used in fan control systems to improve stability and performance. A fan control system can be effectively represented using a Bode plot to illustrate the impact of a PD controller through its transfer function. The Bode plot visually conveys how PD control modifies the fan's response across various frequencies, providing a frequency domain interpretation of the controller's behavior.
The proportional control gain, combined with the...

Time and frequency -Domain Interpretation of Phase-lag Control

Time and frequency -Domain Interpretation of Phase-lag Control

Phase-lag controllers are widely used in control systems to improve stability and reduce steady-state errors. A dimmer switch controlling the brightness of a light bulb serves as a practical example of phase-lag control, gradually adjusting the bulb's brightness. Mathematically, phase-lag control or low-pass filtering is represented when the factor 'a' is less than 1.
Phase-lag controllers do not place a pole at zero, but instead influence the steady-state error by amplifying any...

Cognitive Learning

Cognitive Learning

Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...

Transfer Function in Control Systems

Transfer Function in Control Systems

The transfer function is a fundamental concept in the analysis and design of linear time-invariant (LTI) systems. It offers a concise way to understand how a system responds to different inputs in the frequency domain. It serves as a bridge between the time-domain differential equations that describe system dynamics and the frequency-domain representation that facilitates easier manipulation and analysis.
To derive the transfer function, consider a general nth-order linear time-invariant...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Early diagnosis of capivasertib-associated severe hyperglycemia with diabetic ketosis diagnosed in routine clinical practice: a case report with review of literature.

Endocrine journal·2026

Same author

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026

Same author

Improvements to dark experience replay and reservoir sampling for better balance between consolidation and plasticity.

Frontiers in artificial intelligence·2026

Same author

Proportion of middle ear surgeries feasible via transcanal endoscopic ear surgery: A multicenter study in Japan.

Auris, nasus, larynx·2026

Same author

Neural-enhanced motion-to-EMG: refining simulated muscle activity from musculoskeletal models using a Seq2Seq approach.

Frontiers in bioengineering and biotechnology·2025

Same author

Expression of AQP-10, -11 and -12 in the rat stria vascularis.

Acta oto-laryngologica·2024

Same journal

Passive wheels on legged robots: a survey.

Frontiers in robotics and AI·2026

Same journal

Politeness cannot make up for robots' errors.

Frontiers in robotics and AI·2026

Same journal

Workers expect basic social skills but limited autonomy from future robots - a qualitative interview study and taxonomy for robot social skills.

Frontiers in robotics and AI·2026

Same journal

Human-robot interaction in sustainable hospitality: how robot type shapes customer emotions, green perceptions, and service loyalty.

Frontiers in robotics and AI·2026

Same journal

Dynamic variance-aware federated tuning for efficient autonomous vehicle perception under non-IID settings.

Frontiers in robotics and AI·2026

Same journal

Editorial: Synergizing large language models and computational intelligence for advanced robotic systems.

Frontiers in robotics and AI·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 15, 2026

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Weber-Fechner law in temporal difference learning derived from control as inference.

Keiichiro Takahashi¹, Taisuke Kobayashi², Tomoya Yamanokuchi¹

¹Division of Information Science, Nara Institute of Science and Technology, Ikoma, Japan.

Frontiers in Robotics and AI

|October 13, 2025

Summary

This summary is machine-generated.

This study introduces a nonlinear update rule for reinforcement learning (RL) inspired by biological learning. The Weber-Fechner law (WFL) enhances RL by accelerating reward acquisition and minimizing punishment.

Keywords:

Weber–Fechner law control as inference reinforcement learning reward–punishment framework robot control temporal difference learning

More Related Videos

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

A Two-interval Forced-choice Task for Multisensory Comparisons

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

Related Experiment Videos

Last Updated: Jan 15, 2026

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

A Two-interval Forced-choice Task for Multisensory Comparisons

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

Area of Science:

Computational Neuroscience
Machine Learning
Artificial Intelligence

Background:

Standard reinforcement learning (RL) uses linear temporal difference (TD) error updates, treating all rewards equally.
Biological systems exhibit nonlinearities in TD errors, leading to optimistic or pessimistic learning biases.
These nonlinear biases are potentially adaptive features of biological learning.

Purpose of the Study:

To explore a theoretical framework for leveraging nonlinearity between update degree and TD errors in RL.
To investigate the applicability of the Weber-Fechner law (WFL) within a control-as-inference framework for RL.
To demonstrate the practical utilities of WFL in RL through a reward-punishment system.

Main Methods:

Analysis of a control-as-inference framework to identify nonlinear relationships in RL.
Derivation and application of the Weber-Fechner law (WFL) to model the relationship between TD errors and update magnitudes.
Implementation of a reward-punishment framework to numerically demonstrate WFL's effects on RL policies.

Main Results:

The Weber-Fechner law (WFL) was identified, describing how perception of TD error changes with value function intensity.
WFL implementation demonstrated accelerated escape from low-reward situations.
WFL implementation showed enhanced pursuit of minimal punishment.

Conclusions:

The proposed RL algorithm incorporating WFL accelerates reward maximization and effectively suppresses punishments.
Nonlinear update rules, inspired by biological learning, offer significant advantages in RL.
WFL provides a viable mechanism for introducing beneficial biases into artificial learning systems.