Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Behavior Modification

Behavior Modification

Behavioral approaches have often been criticized for ignoring mental processes and focusing solely on observable behavior. However, these approaches provide an optimistic perspective for individuals seeking to change their behaviors. Rather than concentrating on intrinsic personality traits, behavioral approaches suggest that even longstanding habits can be modified by changing the reward contingencies that maintain them.
A real-world application of operant conditioning principles is applied...

Law of Effect

Law of Effect

B.F. Skinner, a prominent figure in behavioral psychology, introduced operant conditioning by emphasizing the role of consequences in shaping behavior. This theory builds upon the law of effect proposed by Edward Thorndike, which posits that behaviors followed by satisfying outcomes are likely to be repeated. In contrast, those followed by unsatisfying outcomes are less likely to recur.
Edward Thorndike's foundational work involved studying learning in animals, particularly using puzzle...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Classification of Systems-I

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:

Feedback control systems

Feedback control systems

Feedback control systems are categorized in various ways based on their design, analysis, and signal types.
Linear feedback systems are theoretical models that simplify analysis and design. These systems operate under the principle that their output is directly proportional to their input within certain ranges. For instance, an amplifier in a control system behaves linearly as long as the input signal remains within a specific range. However, most physical systems exhibit inherent nonlinearity...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

The role of ferroptosis regulators in the prognosis, immune activity and gemcitabine resistance of pancreatic cancer.

Annals of translational medicine·2020

Same author

The vascular endothelial growth factor trap aflibercept induces vascular dysfunction and hypertension via attenuation of eNOS/NO signaling in mice.

Acta pharmacologica Sinica·2020

Same author

MBNL1 regulates isoproterenol-induced myocardial remodelling in vitro and in vivo.

Journal of cellular and molecular medicine·2020

Same author

Platelets Stimulate Liver Regeneration in a Rat Model of Partial Liver Transplantation.

Liver transplantation : official publication of the American Association for the Study of Liver Diseases and the International Liver Transplantation Society·2020

Same author

Mutant p53 in Cancer Progression and Targeted Therapies.

Frontiers in oncology·2020

Same author

Downregulation of astrocyte elevated gene-1 expression inhibits the development of vasculogenic mimicry in gliomas.

Experimental and therapeutic medicine·2020

Same journal

An Evolutionary Algorithm Assisted by an Ensemble of Pareto-Optimal Surrogate Models.

IEEE transactions on cybernetics·2026

Same journal

A Quantum Self-Attention Neural Network Model on Quantum Circuits.

IEEE transactions on cybernetics·2026

Same journal

Semi-Explicit Solution of Some Discrete-Time Higher-Order-Cost Mean-Field-Type Control.

IEEE transactions on cybernetics·2026

Same journal

A Novel One-Step Small Object Detector for Autonomous Aerial Vehicles.

IEEE transactions on cybernetics·2026

Same journal

Online Data-Driven-Based Optimal Output Tracking Control Without Initial Stabilizing Policy.

IEEE transactions on cybernetics·2026

Same journal

Digital Redesign-Based Interval State Estimation for Continuous Systems With Aperiodic Discrete Measurements.

IEEE transactions on cybernetics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 11, 2026

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

Human Behavior Identification for Linear Systems in Adversarial Environments by Adaptive Inverse Reinforcement

Mi Wang, Huai-Ning Wu, Jingbo Fu

IEEE Transactions on Cybernetics

|November 19, 2025

Summary

This summary is machine-generated.

This study introduces a new method for identifying human behavior in human-in-the-loop (HiTL) systems facing adversarial conditions. The approach uses adaptive inverse reinforcement learning (IRL) to understand human decision-making without needing control input data.

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Decoding Natural Behavior from Neuroethological Embedding

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

Related Experiment Videos

Last Updated: Jan 11, 2026

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

A Real-Time Interactive System for Studying Confrontational Pursuit Behavior in Rodents

Published on: May 16, 2025

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Decoding Natural Behavior from Neuroethological Embedding

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

Area of Science:

Control Systems Engineering
Artificial Intelligence
Human-Computer Interaction

Background:

Human-in-the-loop (HiTL) systems are increasingly complex, especially in adversarial environments.
Identifying human behavior is crucial for predicting system performance and ensuring safety.
Existing methods for human behavior identification often have limitations, such as requiring persistent excitation or direct measurement of control inputs.

Purpose of the Study:

To develop a novel method for human behavior identification in linear HiTL systems operating in adversarial settings.
To overcome limitations of existing approaches by removing the need for persistent excitation and control input measurement.
To model the human and adversarial environment as players in a zero-sum differential game.

Main Methods:

Formulated the HiTL system as a linear-quadratic zero-sum differential game.
Transformed human behavior identification into an inverse reinforcement learning (IRL) problem.
Proposed an integral concurrent learning (ICL) law to estimate the human's feedback matrix.
Retrieved human cost function weighting matrices by minimizing a residual based on the estimated feedback matrix.

Main Results:

Successfully estimated the human feedback matrix using the proposed ICL law.
Accurately retrieved human cost function weighting matrices.
Demonstrated the method's validity through simulations and experiments in a vehicle lane-keeping scenario.
Validated the adaptive-IRL-based human behavior identification strategy.

Conclusions:

The proposed adaptive-IRL-based strategy effectively identifies human behavior in adversarial HiTL systems.
The method removes the need for persistent excitation and control input measurement, offering a significant advantage over existing techniques.
This research contributes to more robust and predictable human-AI interaction in safety-critical applications.