Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

PI Controller: Design01:24

PI Controller: Design

465
Proportional Integral (PI) controllers are a fundamental component in modern control systems, widely used to enhance performance and mitigate steady-state errors. They are particularly effective in applications such as automatic brightness adjustment on smartphones, where they excel at mitigating steady-state errors for step-function inputs. Unlike PD controllers, which require time-varying errors to function optimally, PI controllers leverage their integral component to address residual...
465
Parameters Affecting Nonlinear Elimination: Zero-Order Input, First-Order Absorption and Two-Compartment Model01:13

Parameters Affecting Nonlinear Elimination: Zero-Order Input, First-Order Absorption and Two-Compartment Model

119
Drugs administered through various routes can lead to nonlinear elimination, resulting in complex pharmacokinetic behaviors crucial to understanding efficacious drug dosing.
When a drug is administered through a constant intravenous infusion and eliminated via nonlinear pharmacokinetics, it follows zero-order input. For example, oral drugs undergo first-order absorption upon administration and are eliminated through nonlinear pharmacokinetics.
In the case of subcutaneously administered drugs,...
119
Time and frequency -Domain Interpretation of PI Control01:27

Time and frequency -Domain Interpretation of PI Control

196
Proportional-Integral (PI) controllers are essential in many control systems to improve stability and performance. They are commonly used in everyday devices like thermostats to enhance system damping and reduce steady-state error. When the zero in the controller's transfer function is optimally placed, the system benefits significantly in terms of stability and accuracy.
Acting as a low-pass filter, the PI controller slows the system's response and extends settling times. This requires...
196
Reinforcement01:23

Reinforcement

319
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
319
Reinforcement Schedules01:24

Reinforcement Schedules

232
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
232
Operant Conditioning Intervention01:24

Operant Conditioning Intervention

110
Operant conditioning serves as a foundational principle in therapeutic interventions aimed at modifying maladaptive behaviors. Central to this approach is the notion that behaviors, both adaptive and maladaptive, are learned through reinforcement. By analyzing the environmental factors that reinforce problematic behaviors, clinicians can design interventions to weaken these reinforcements and replace maladaptive behaviors with healthier alternatives.
In operant conditioning, behaviors that are...
110

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adaptive Learning Control of Uncertain Systems via Weight and Intrinsic Plasticity-Based Neural Networks.

IEEE transactions on neural networks and learning systems·2026
Same author

Prescribed-rate target tracking for time-delayed systems using output measurements.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Funny-Valen-Tine: Planning Solution Distribution Enhances Machine Abstract Reasoning Ability.

IEEE transactions on neural networks and learning systems·2026
Same author

Inverse Reinforcement Learning for Disturbed Networked Nonlinear Systems With Data Dropouts.

IEEE transactions on neural networks and learning systems·2025
Same author

Nash Equilibrium in Multiplayer Graphical Games via Reinforcement Learning and Distributed Observers.

IEEE transactions on neural networks and learning systems·2025
Same author

Neuroadaptive Control With Enhanced Stability and Reliability.

IEEE transactions on neural networks and learning systems·2025
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Sep 3, 2025

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
08:18

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

5.0K

Nearly Optimal Control for Mixed Zero-Sum Game Based on Off-Policy Integral Reinforcement Learning.

Ruizhuo Song, Gaofu Yang, Frank L Lewis

    IEEE Transactions on Neural Networks and Learning Systems
    |July 25, 2022
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces an integral reinforcement learning (IRL) algorithm for mixed zero-sum games with unknown nonlinear system dynamics. The method finds optimal control strategies for competitors and collaborators without needing system information.

    More Related Videos

    Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
    07:05

    Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

    Published on: September 10, 2018

    6.1K
    The HoneyComb Paradigm for Research on Collective Human Behavior
    06:48

    The HoneyComb Paradigm for Research on Collective Human Behavior

    Published on: January 19, 2019

    9.5K

    Related Experiment Videos

    Last Updated: Sep 3, 2025

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.0K
    Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents
    07:05

    Operant Protocols for Assessing the Cost-benefit Analysis During Reinforced Decision Making by Rodents

    Published on: September 10, 2018

    6.1K
    The HoneyComb Paradigm for Research on Collective Human Behavior
    06:48

    The HoneyComb Paradigm for Research on Collective Human Behavior

    Published on: January 19, 2019

    9.5K

    Area of Science:

    • Control Theory
    • Game Theory
    • Machine Learning

    Background:

    • Solving mixed zero-sum games with unknown nonlinear system dynamics presents significant challenges.
    • Traditional methods often require complete system information, limiting their applicability.

    Purpose of the Study:

    • To develop a novel policy iterative algorithm using integral reinforcement learning (IRL) for mixed zero-sum games.
    • To achieve optimal control for competing and collaborating players in systems with unknown dynamics.

    Main Methods:

    • A policy iterative algorithm employing integral reinforcement learning (IRL) is proposed, which bypasses the need for system information.
    • An adaptive update law integrating a critic-actor structure with experience replay is introduced.
    • Actor functions are designed to approximate optimal control and estimate auxiliary control simultaneously.

    Main Results:

    • The proposed algorithm successfully obtains optimal control strategies for all players.
    • Parameters of the actor-critic structure are updated simultaneously, ensuring efficient learning.
    • Uniform ultimate boundedness of parameter errors in polynomial approximation is mathematically proven.

    Conclusions:

    • The developed IRL-based algorithm effectively solves mixed zero-sum games with unknown nonlinear dynamics.
    • The adaptive critic-actor structure with experience replay offers a robust approach to control optimization.
    • Simulation results validate the algorithm's effectiveness and practical applicability.