Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Generalization, Discrimination, and Extinction

Generalization, Discrimination, and Extinction

Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Modeling and Similitude

Modeling and Similitude

Scaled modeling is a fundamental technique in engineering, enabling the study of large and complex systems by creating smaller, manageable replicas that recreate critical characteristics of the original. In hydrology and civil infrastructure, for example, scaled models of dams help analyze water flow, turbulence, and pressure. This method allows for accurate predictions of real-world behavior within a controlled environment, significantly reducing the cost and time involved in full-scale...

Primary and Secondary Reinforcers

Primary and Secondary Reinforcers

In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...

Modeling in Therapy

Modeling in Therapy

Modeling, a key technique in therapy, uses observational learning to help clients acquire and practice new skills by watching therapists demonstrate desired behaviors. This approach, rooted in Albert Bandura's concept of vicarious learning, plays a significant role in therapeutic interventions for various psychological conditions, including social anxiety, ADHD, and depression.
Participant Modeling
Participant modeling involves therapists demonstrating calm and effective behaviors in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Catecholamine precursor modulation of human exploration: Evidence from a large gender-balanced sample.

PLoS computational biology·2026

Same author

The earlier you know, the smoother you act: anticipatory control in solo and dyadic juggling.

Experimental brain research·2026

Same author

Exploration Strategies and Feature Prioritisation in Contour-based Haptic Perception of 2D Shape.

IEEE transactions on haptics·2026

Same author

Open science practices in behavioral addictions: An exploratory survey.

Journal of behavioral addictions·2026

Same author

[Use of continuous passive motion in inpatient rehabilitation after shoulder replacement-a retrospective study].

Orthopadie (Heidelberg, Germany)·2026

Same author

Hoffa-Kastert Syndrome: A Rare Cause of Acute Knee Blockade.

Indian journal of orthopaedics·2025

Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 3, 2026

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

Published on: December 18, 2020

Assessing Transferability From Simulation to Reality for Reinforcement Learning.

Fabio Muratore, Michael Gienger, Jan Peters

IEEE Transactions on Pattern Analysis and Machine Intelligence

|November 15, 2019

Summary

This summary is machine-generated.

This study introduces Simulation-based Policy Optimization with Transferability Assessment (SPOTA) to improve robot control learning. SPOTA reduces simulation optimization bias, enabling policies trained in simulation to transfer directly to real-world robots.

More Related Videos

Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another

Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another

Published on: September 18, 2017

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Related Experiment Videos

Last Updated: Jan 3, 2026

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

Published on: December 18, 2020

Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another

Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another

Published on: September 18, 2017

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Area of Science:

Robotics
Machine Learning
Control Theory

Background:

Learning robot control policies in simulation offers efficiency and safety benefits over real-world experiments.
Direct transfer of policies from simulation to reality is hindered by 'Simulation Optimization Bias' (SOB), where policies exploit simulator inaccuracies and risk damaging robots.

Purpose of the Study:

To develop a method for training robot control policies in simulation that are directly transferable to real-world systems.
To address the challenge of Simulation Optimization Bias (SOB) in simulation-based reinforcement learning for robotics.

Main Methods:

Domain randomization was applied by varying physics simulation parameters during policy learning.
A novel algorithm, Simulation-based Policy Optimization with Transferability Assessment (SPOTA), was proposed.
SPOTA incorporates an estimator of SOB to define a training stopping criterion, quantifying overfitting to simulated domains.

Main Results:

The SPOTA algorithm successfully learned control policies exclusively within a randomized simulation environment.
Experimental validation on two nonlinear systems demonstrated the direct applicability of learned policies to real robots without further training.
The SOB estimator effectively quantified over-fitting, guiding the training process.

Conclusions:

SPOTA enables robust robot control policy learning from simulation, overcoming the simulation-to-reality gap.
The method mitigates risks associated with Simulation Optimization Bias, enhancing the reliability of simulated training.
This approach facilitates faster, cheaper, and safer robot development by reducing reliance on physical prototypes.