Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement Schedules01:24

Reinforcement Schedules

327
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
327
Reinforcement01:23

Reinforcement

603
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
603
Associative Learning01:27

Associative Learning

865
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
865
Observational Learning01:12

Observational Learning

625
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
625
Operant Conditioning Intervention01:24

Operant Conditioning Intervention

303
Operant conditioning serves as a foundational principle in therapeutic interventions aimed at modifying maladaptive behaviors. Central to this approach is the notion that behaviors, both adaptive and maladaptive, are learned through reinforcement. By analyzing the environmental factors that reinforce problematic behaviors, clinicians can design interventions to weaken these reinforcements and replace maladaptive behaviors with healthier alternatives.
In operant conditioning, behaviors that are...
303
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

258
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
258

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Catecholamine precursor modulation of human exploration: Evidence from a large gender-balanced sample.

PLoS computational biology·2026
Same author

The earlier you know, the smoother you act: anticipatory control in solo and dyadic juggling.

Experimental brain research·2026
Same author

Exploration Strategies and Feature Prioritisation in Contour-based Haptic Perception of 2D Shape.

IEEE transactions on haptics·2026
Same author

Open science practices in behavioral addictions: An exploratory survey.

Journal of behavioral addictions·2026
Same author

[Use of continuous passive motion in inpatient rehabilitation after shoulder replacement-a retrospective study].

Orthopadie (Heidelberg, Germany)·2026
Same author

Hoffa-Kastert Syndrome: A Rare Cause of Acute Knee Blockade.

Indian journal of orthopaedics·2025
Same journal

Passive wheels on legged robots: a survey.

Frontiers in robotics and AI·2026
Same journal

Politeness cannot make up for robots' errors.

Frontiers in robotics and AI·2026
Same journal

Workers expect basic social skills but limited autonomy from future robots - a qualitative interview study and taxonomy for robot social skills.

Frontiers in robotics and AI·2026
Same journal

Human-robot interaction in sustainable hospitality: how robot type shapes customer emotions, green perceptions, and service loyalty.

Frontiers in robotics and AI·2026
Same journal

Dynamic variance-aware federated tuning for efficient autonomous vehicle perception under non-IID settings.

Frontiers in robotics and AI·2026
Same journal

MPM-based simulation and bounded-error compression of material points for magnetic tactile sensors.

Frontiers in robotics and AI·2026
See all related articles

Related Experiment Video

Updated: Nov 19, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

8.8K

Multi-Channel Interactive Reinforcement Learning for Sequential Tasks.

Dorothea Koert1,2, Maximilian Kircher1, Vildan Salikutluk2,3

  • 1Intelligent Autonomous Systems Group, Department of Computer Science, Technische Universität Darmstadt, Darmstadt, Germany.

Frontiers in Robotics and AI
|January 27, 2021
PubMed
Summary
This summary is machine-generated.

This study introduces a new reinforcement learning system that uses multiple human inputs to help robots learn tasks faster. The system can even handle incorrect human guidance by developing self-confidence, improving robot learning efficiency.

Keywords:
human-centered AIhuman-robot interactioninteractive reinforcement learningrobotic tasksuser studies

More Related Videos

Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.0K
Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning
11:20

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

12.2K

Related Experiment Videos

Last Updated: Nov 19, 2025

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task
10:39

The "Motor" in Implicit Motor Sequence Learning: A Foot-stepping Serial Reaction Time Task

Published on: May 3, 2018

8.8K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.0K
Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning
11:20

Recording Single Neurons' Action Potentials from Freely Moving Pigeons Across Three Stages of Learning

Published on: June 2, 2014

12.2K

Area of Science:

  • Robotics
  • Artificial Intelligence
  • Human-Computer Interaction

Background:

  • Robots need to learn new tasks by combining existing skills for complex applications.
  • Reinforcement learning (RL) is effective but limited by high sample costs and exploration in real-world robotics.
  • Human input can accelerate RL, but systems must handle potentially incorrect guidance from inexperienced users.

Purpose of the Study:

  • To develop and evaluate a unified framework for multi-channel interactive reinforcement learning (IRL) on robotic tasks.
  • To investigate the effectiveness of a novel self-confidence mechanism for handling potentially incorrect human input.
  • To analyze human reactions to robot-initiated corrections and suggestions in an IRL setting.

Main Methods:

  • Implemented a unified framework integrating multiple human input channels for IRL.
  • Introduced a self-confidence concept enabling robots to question human input post-learning.
  • Conducted experiments on two robotic tasks with 20 inexperienced human subjects.

Main Results:

  • The IRL approach successfully accelerated robot learning using human input, even when partially incorrect.
  • The self-confidence mechanism allowed for learning progress despite erroneous human guidance in a targeted task.
  • Human acceptance of robot suggestions varied, influenced by understanding of the learning process.

Conclusions:

  • Multi-channel IRL with self-confidence is a viable method to enhance robot learning efficiency and robustness.
  • Handling incorrect human input requires careful system design and user interface considerations.
  • Future IRL systems should prioritize user understanding and trust to maximize human-robot collaboration effectiveness.