Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Observational Learning01:12

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning because...
State Space Representation01:27

State Space Representation

The frequency-domain technique, commonly used in analyzing and designing feedback control systems, is effective for linear, time-invariant systems. However, it falls short when dealing with nonlinear, time-varying, and multiple-input multiple-output systems. The time-domain or state-space approach addresses these limitations by utilizing state variables to construct simultaneous, first-order differential equations, known as state equations, for an nth-order system.
Consider an RLC circuit, a...
Reinforcement01:23

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
State Space to Transfer Function01:21

State Space to Transfer Function

The conversion of state-space representation to a transfer function is a fundamental process in system analysis. It provides a method for transitioning from a time-domain description to a frequency-domain representation, which is crucial for simplifying the analysis and design of control systems.
The transformation process begins with the state-space representation, characterized by the state equation and the output equation. These equations are typically represented as:
Avoidance Learning and Learned Helplessness01:14

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...
Transfer Function to State Space01:23

Transfer Function to State Space

State-space representation is a powerful tool for simulating physical systems on digital computers, necessitating the conversion of the transfer function into state-space form. Consider an nth-order linear differential equation with constant coefficients, like those encountered in an RLC circuit. The state variables are selected as the output and its n−1 derivatives. Differentiating these variables and substituting them back into the original equation produces the state equations.
In an RLC...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Safe Physics-Informed Machine Learning for Optimal Predefined-Time Stabilization: A Lyapunov-Based Approach.

IEEE transactions on neural networks and learning systems·2025
Same author

Remaining useful life prediction of Lithium-ion batteries using spatio-temporal multimodal attention networks.

Heliyon·2024
Same author

Online and Robust Intermittent Motion Planning in Dynamic and Changing Environments.

IEEE transactions on neural networks and learning systems·2023
Same author

Cooperative Finitely Excited Learning for Dynamical Games.

IEEE transactions on cybernetics·2023
Same author

Safety-Aware Pursuit-Evasion Games in Unknown Environments Using Gaussian Processes and Finite-Time Convergent Reinforcement Learning.

IEEE transactions on neural networks and learning systems·2022
Same author

Adaptive Neural Network Stochastic-Filter-Based Controller for Attitude Tracking With Disturbance Rejection.

IEEE transactions on neural networks and learning systems·2022
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
Same journal

Self-Supervised Continuous Dynamic Graph Representation Learning via Hawkes Processes.

IEEE transactions on neural networks and learning systems·2026
Same journal

cPU: Consistent Risk Estimator for Positive-Unlabeled Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Tuning-Free Latent Diffusion Models for Ultrahigh-Resolution Image Editing.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: May 9, 2026

A Fully Automated Rodent Conditioning Protocol for Sensorimotor Integration and Cognitive Control Experiments
09:43

A Fully Automated Rodent Conditioning Protocol for Sensorimotor Integration and Cognitive Control Experiments

Published on: April 15, 2014

Input-to-State Safety for Reinforcement Learning.

Mayank Shekhar Jha, Satya Marthi, Kyriakos G Vamvoudakis

    IEEE Transactions on Neural Networks and Learning Systems
    |May 7, 2026
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a safe reinforcement learning (RL) method for nonlinear systems with input saturation. It ensures safe exploration and learning, even with noisy inputs, by adaptively adjusting safety constraints.

    More Related Videos

    Rodent Brain Microinjection to Study Molecular Substrates of Motivated Behavior
    10:05

    Rodent Brain Microinjection to Study Molecular Substrates of Motivated Behavior

    Published on: September 16, 2015

    Related Experiment Videos

    Last Updated: May 9, 2026

    A Fully Automated Rodent Conditioning Protocol for Sensorimotor Integration and Cognitive Control Experiments
    09:43

    A Fully Automated Rodent Conditioning Protocol for Sensorimotor Integration and Cognitive Control Experiments

    Published on: April 15, 2014

    Rodent Brain Microinjection to Study Molecular Substrates of Motivated Behavior
    10:05

    Rodent Brain Microinjection to Study Molecular Substrates of Motivated Behavior

    Published on: September 16, 2015

    Area of Science:

    • Control Theory
    • Machine Learning
    • Robotics

    Background:

    • Reinforcement learning (RL) often struggles with safety guarantees in real-world systems, especially under input constraints.
    • Ensuring safety during exploration and learning is critical for deploying RL in safety-critical applications like robotics and autonomous systems.
    • Input saturation in dynamical systems poses significant challenges to traditional control and learning methods.

    Purpose of the Study:

    • To develop a novel off-policy, safe reinforcement learning (RL) approach for nonlinear dynamical systems operating under input saturation.
    • To guarantee safe initialization, exploration, and learning of optimal control laws within system constraints.
    • To rigorously establish the safety, optimality, and stability properties of the proposed RL framework.

    Main Methods:

    • Formulating safe exploration as a robust control problem using input-to-state safe control barrier functions (ISSf-CBFs) to define an enlarged safe set.
    • Proposing a novel epsilon-tuning law for adaptive safety constraint enforcement, encouraging exploration near boundaries while maintaining set invariance.
    • Incorporating a safety-aware cost function and developing a novel off-policy equation under input saturation for learning optimal control laws using neural networks.

    Main Results:

    • The proposed $\epsilon $-tuning law effectively manages exploration noise, enabling efficient state-space exploration without compromising system safety.
    • The framework guarantees safe learning of optimal control laws even under input saturation limits.
    • Mathematical rigor is applied to establish novel safety, optimality, and stability properties of the off-policy safe RL approach.

    Conclusions:

    • The developed off-policy safe reinforcement learning framework effectively addresses safety challenges in nonlinear dynamical systems with input saturation.
    • The approach enables safe initialization, exploration, and learning of control policies, demonstrating high efficacy through simulations.
    • This work provides a robust method for applying RL to safety-critical systems where input constraints are present.