Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Time-Domain Interpretation of PD Control01:07

Time-Domain Interpretation of PD Control

371
Proportional-Derivative (PD) control is a widely used control method in various engineering systems to enhance stability and performance. In a system with only proportional control, common issues include high maximum overshoot and oscillation, observed in both the error signal and its rate of change. This behavior can be divided into three distinct phases: initial overshoot, subsequent undershoot, and gradual stabilization.
Consider the example of control of motor torque. Initially, a positive...
371
Feedback control systems01:26

Feedback control systems

685
Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...
685
Frequency-Domain Interpretation of PD Control01:24

Frequency-Domain Interpretation of PD Control

351
Proportional-Derivative (PD) controllers are widely used in fan control systems to improve stability and performance. A fan control system can be effectively represented using a Bode plot to illustrate the impact of a PD controller through its transfer function. The Bode plot visually conveys how PD control modifies the fan's response across various frequencies, providing a frequency domain interpretation of the controller's behavior.
The proportional control gain, combined with the...
351
Time and frequency -Domain Interpretation of Phase-lag Control01:21

Time and frequency -Domain Interpretation of Phase-lag Control

389
Phase-lag controllers are widely used in control systems to improve stability and reduce steady-state errors. A dimmer switch controlling the brightness of a light bulb serves as a practical example of phase-lag control, gradually adjusting the bulb's brightness. Mathematically, phase-lag control or low-pass filtering is represented when the factor 'a' is less than 1.
Phase-lag controllers do not place a pole at zero, but instead influence the steady-state error by amplifying any...
389
Cognitive Learning01:21

Cognitive Learning

1.0K
Cognitive learning is based on purposive behavior, incidental learning, and insight learning.
E. C. Tolman's theory of purposive behavior emphasizes that much behavior is goal-directed. He argued that to understand behavior, we must look at the entire sequence of actions leading to a goal. For instance, high school students study hard, not just due to past reinforcement but also to achieve the goal of getting into a good college.
Tolman introduced the idea that behavior is influenced by...
1.0K
Transfer Function in Control Systems01:21

Transfer Function in Control Systems

1.5K
The transfer function is a fundamental concept in the analysis and design of linear time-invariant (LTI) systems. It offers a concise way to understand how a system responds to different inputs in the frequency domain. It serves as a bridge between the time-domain differential equations that describe system dynamics and the frequency-domain representation that facilitates easier manipulation and analysis.
To derive the transfer function, consider a general nth-order linear time-invariant...
1.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Early diagnosis of capivasertib-associated severe hyperglycemia with diabetic ketosis diagnosed in routine clinical practice: a case report with review of literature.

Endocrine journal·2026
Same author

DROP: Distributional and Regular Optimism and Pessimism for Reinforcement Learning.

Neural computation·2026
Same author

Improvements to dark experience replay and reservoir sampling for better balance between consolidation and plasticity.

Frontiers in artificial intelligence·2026
Same author

Proportion of middle ear surgeries feasible via transcanal endoscopic ear surgery: A multicenter study in Japan.

Auris, nasus, larynx·2026
Same author

Neural-enhanced motion-to-EMG: refining simulated muscle activity from musculoskeletal models using a Seq2Seq approach.

Frontiers in bioengineering and biotechnology·2025
Same author

Expression of AQP-10, -11 and -12 in the rat stria vascularis.

Acta oto-laryngologica·2024
Same journal

Passive wheels on legged robots: a survey.

Frontiers in robotics and AI·2026
Same journal

Politeness cannot make up for robots' errors.

Frontiers in robotics and AI·2026
Same journal

Workers expect basic social skills but limited autonomy from future robots - a qualitative interview study and taxonomy for robot social skills.

Frontiers in robotics and AI·2026
Same journal

Human-robot interaction in sustainable hospitality: how robot type shapes customer emotions, green perceptions, and service loyalty.

Frontiers in robotics and AI·2026
Same journal

Dynamic variance-aware federated tuning for efficient autonomous vehicle perception under non-IID settings.

Frontiers in robotics and AI·2026
Same journal

Editorial: Synergizing large language models and computational intelligence for advanced robotic systems.

Frontiers in robotics and AI·2026
See all related articles

Related Experiment Video

Updated: Jan 15, 2026

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

5.1K

Weber-Fechner law in temporal difference learning derived from control as inference.

Keiichiro Takahashi1, Taisuke Kobayashi2, Tomoya Yamanokuchi1

  • 1Division of Information Science, Nara Institute of Science and Technology, Ikoma, Japan.

Frontiers in Robotics and AI
|October 13, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a nonlinear update rule for reinforcement learning (RL) inspired by biological learning. The Weber-Fechner law (WFL) enhances RL by accelerating reward acquisition and minimizing punishment.

Keywords:
Weber–Fechner lawcontrol as inferencereinforcement learningreward–punishment frameworkrobot controltemporal difference learning

More Related Videos

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.4K
A Two-interval Forced-choice Task for Multisensory Comparisons
07:13

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

11.4K

Related Experiment Videos

Last Updated: Jan 15, 2026

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

5.1K
WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.4K
A Two-interval Forced-choice Task for Multisensory Comparisons
07:13

A Two-interval Forced-choice Task for Multisensory Comparisons

Published on: November 9, 2018

11.4K

Area of Science:

  • Computational Neuroscience
  • Machine Learning
  • Artificial Intelligence

Background:

  • Standard reinforcement learning (RL) uses linear temporal difference (TD) error updates, treating all rewards equally.
  • Biological systems exhibit nonlinearities in TD errors, leading to optimistic or pessimistic learning biases.
  • These nonlinear biases are potentially adaptive features of biological learning.

Purpose of the Study:

  • To explore a theoretical framework for leveraging nonlinearity between update degree and TD errors in RL.
  • To investigate the applicability of the Weber-Fechner law (WFL) within a control-as-inference framework for RL.
  • To demonstrate the practical utilities of WFL in RL through a reward-punishment system.

Main Methods:

  • Analysis of a control-as-inference framework to identify nonlinear relationships in RL.
  • Derivation and application of the Weber-Fechner law (WFL) to model the relationship between TD errors and update magnitudes.
  • Implementation of a reward-punishment framework to numerically demonstrate WFL's effects on RL policies.

Main Results:

  • The Weber-Fechner law (WFL) was identified, describing how perception of TD error changes with value function intensity.
  • WFL implementation demonstrated accelerated escape from low-reward situations.
  • WFL implementation showed enhanced pursuit of minimal punishment.

Conclusions:

  • The proposed RL algorithm incorporating WFL accelerates reward maximization and effectively suppresses punishments.
  • Nonlinear update rules, inspired by biological learning, offer significant advantages in RL.
  • WFL provides a viable mechanism for introducing beneficial biases into artificial learning systems.