Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Operant Conditioning Intervention

Operant Conditioning Intervention

Operant conditioning serves as a foundational principle in therapeutic interventions aimed at modifying maladaptive behaviors. Central to this approach is the notion that behaviors, both adaptive and maladaptive, are learned through reinforcement. By analyzing the environmental factors that reinforce problematic behaviors, clinicians can design interventions to weaken these reinforcements and replace maladaptive behaviors with healthier alternatives.
In operant conditioning, behaviors that are...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Catecholamine precursor modulation of human exploration: Evidence from a large gender-balanced sample.

PLoS computational biology·2026

Same author

The earlier you know, the smoother you act: anticipatory control in solo and dyadic juggling.

Experimental brain research·2026

Same author

Exploration Strategies and Feature Prioritisation in Contour-based Haptic Perception of 2D Shape.

IEEE transactions on haptics·2026

Same author

Open science practices in behavioral addictions: An exploratory survey.

Journal of behavioral addictions·2026

Same author

[Use of continuous passive motion in inpatient rehabilitation after shoulder replacement-a retrospective study].

Orthopadie (Heidelberg, Germany)·2026

Same author

Hoffa-Kastert Syndrome: A Rare Cause of Acute Knee Blockade.

Indian journal of orthopaedics·2025

Same journal

Passive wheels on legged robots: a survey.

Frontiers in robotics and AI·2026

Same journal

Politeness cannot make up for robots' errors.

Frontiers in robotics and AI·2026

Same journal

Workers expect basic social skills but limited autonomy from future robots - a qualitative interview study and taxonomy for robot social skills.

Frontiers in robotics and AI·2026

Same journal

Human-robot interaction in sustainable hospitality: how robot type shapes customer emotions, green perceptions, and service loyalty.

Frontiers in robotics and AI·2026

Same journal

Dynamic variance-aware federated tuning for efficient autonomous vehicle perception under non-IID settings.

Frontiers in robotics and AI·2026

Same journal

MPM-based simulation and bounded-error compression of material points for magnetic tactile sensors.

Frontiers in robotics and AI·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 19, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.

Dorothea Koert^1,2, Maximilian Kircher¹, Vildan Salikutluk^2,3

¹Intelligent Autonomous Systems Group, Department of Computer Science, Technische Universität Darmstadt, Darmstadt, Germany.

Frontiers in Robotics and AI

|January 27, 2021

Summary

This summary is machine-generated.

This study introduces a new reinforcement learning system that uses multiple human inputs to help robots learn tasks faster. The system can even handle incorrect human guidance by developing self-confidence, improving robot learning efficiency.

Keywords:

human-centered AI human-robot interaction interactive reinforcement learning robotic tasks user studies

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

Related Experiment Videos

Last Updated: Nov 19, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

Area of Science:

Robotics
Artificial Intelligence
Human-Computer Interaction

Background:

Robots need to learn new tasks by combining existing skills for complex applications.
Reinforcement learning (RL) is effective but limited by high sample costs and exploration in real-world robotics.
Human input can accelerate RL, but systems must handle potentially incorrect guidance from inexperienced users.

Purpose of the Study:

To develop and evaluate a unified framework for multi-channel interactive reinforcement learning (IRL) on robotic tasks.
To investigate the effectiveness of a novel self-confidence mechanism for handling potentially incorrect human input.
To analyze human reactions to robot-initiated corrections and suggestions in an IRL setting.

Main Methods:

Implemented a unified framework integrating multiple human input channels for IRL.
Introduced a self-confidence concept enabling robots to question human input post-learning.
Conducted experiments on two robotic tasks with 20 inexperienced human subjects.

Main Results:

The IRL approach successfully accelerated robot learning using human input, even when partially incorrect.
The self-confidence mechanism allowed for learning progress despite erroneous human guidance in a targeted task.
Human acceptance of robot suggestions varied, influenced by understanding of the learning process.

Conclusions:

Multi-channel IRL with self-confidence is a viable method to enhance robot learning efficiency and robustness.
Handling incorrect human input requires careful system design and user interface considerations.
Future IRL systems should prioritize user understanding and trust to maximize human-robot collaboration effectiveness.