Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Generalization, Discrimination, and Extinction01:24

Generalization, Discrimination, and Extinction

1.3K
Generalization, discrimination, and extinction are key concepts in operant conditioning that influence how behaviors are learned and maintained.
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
1.3K
Reinforcement01:23

Reinforcement

758
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
758
Reinforcement Schedules01:24

Reinforcement Schedules

409
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
409
Modeling and Similitude01:12

Modeling and Similitude

562
Scaled modeling is a fundamental technique in engineering, enabling the study of large and complex systems by creating smaller, manageable replicas that recreate critical characteristics of the original. In hydrology and civil infrastructure, for example, scaled models of dams help analyze water flow, turbulence, and pressure. This method allows for accurate predictions of real-world behavior within a controlled environment, significantly reducing the cost and time involved in full-scale...
562
Primary and Secondary Reinforcers01:23

Primary and Secondary Reinforcers

784
In psychology, reinforcement is a key concept in behavior modification. B.F. Skinner demonstrated this with his experiments involving rats in what is known as a Skinner box. The rats learned to press a lever to receive food, a primary reinforcer that fulfilled their innate need for nourishment.
Effective reinforcers for humans vary depending on the individual and the context. Primary reinforcers, such as food, water, sleep, shelter, and pleasure, have inherent value and satisfy basic biological...
784
Modeling in Therapy01:26

Modeling in Therapy

342
Modeling, a key technique in therapy, uses observational learning to help clients acquire and practice new skills by watching therapists demonstrate desired behaviors. This approach, rooted in Albert Bandura's concept of vicarious learning, plays a significant role in therapeutic interventions for various psychological conditions, including social anxiety, ADHD, and depression.
Participant Modeling
Participant modeling involves therapists demonstrating calm and effective behaviors in...
342

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Catecholamine precursor modulation of human exploration: Evidence from a large gender-balanced sample.

PLoS computational biology·2026
Same author

The earlier you know, the smoother you act: anticipatory control in solo and dyadic juggling.

Experimental brain research·2026
Same author

Exploration Strategies and Feature Prioritisation in Contour-based Haptic Perception of 2D Shape.

IEEE transactions on haptics·2026
Same author

Open science practices in behavioral addictions: An exploratory survey.

Journal of behavioral addictions·2026
Same author

[Use of continuous passive motion in inpatient rehabilitation after shoulder replacement-a retrospective study].

Orthopadie (Heidelberg, Germany)·2026
Same author

Hoffa-Kastert Syndrome: A Rare Cause of Acute Knee Blockade.

Indian journal of orthopaedics·2025
Same journal

Relation DETR+: Exploring Explicit Position Relation Prior for Dense Prediction.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought Reasoning.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

CAFE: Cross-View Adaptive Fusion and Cluster Center Enhancement for Robust Multi-View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Ethics-Aware Safe Reinforcement Learning for Rare-Event Risk Control in Interactive Urban Driving.

IEEE transactions on pattern analysis and machine intelligence·2026
Same journal

Learning Shape Anchors for Holistic Indoor Scene Understanding.

IEEE transactions on pattern analysis and machine intelligence·2026
See all related articles

Related Experiment Video

Updated: Jan 3, 2026

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research
07:15

Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

Published on: December 18, 2020

4.9K

Assessing Transferability From Simulation to Reality for Reinforcement Learning.

Fabio Muratore, Michael Gienger, Jan Peters

    IEEE Transactions on Pattern Analysis and Machine Intelligence
    |November 15, 2019
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces Simulation-based Policy Optimization with Transferability Assessment (SPOTA) to improve robot control learning. SPOTA reduces simulation optimization bias, enabling policies trained in simulation to transfer directly to real-world robots.

    More Related Videos

    Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another
    05:12

    Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another

    Published on: September 18, 2017

    548.3K
    Investigating Motor Skill Learning Processes with a Robotic Manipulandum
    07:52

    Investigating Motor Skill Learning Processes with a Robotic Manipulandum

    Published on: February 12, 2017

    9.1K

    Related Experiment Videos

    Last Updated: Jan 3, 2026

    Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research
    07:15

    Tactile Vibrating Toolkit and Driving Simulation Platform for Driving-Related Research

    Published on: December 18, 2020

    4.9K
    Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another
    05:12

    Using Virtual Reality to Transfer Motor Skill Knowledge from One Hand to Another

    Published on: September 18, 2017

    548.3K
    Investigating Motor Skill Learning Processes with a Robotic Manipulandum
    07:52

    Investigating Motor Skill Learning Processes with a Robotic Manipulandum

    Published on: February 12, 2017

    9.1K

    Area of Science:

    • Robotics
    • Machine Learning
    • Control Theory

    Background:

    • Learning robot control policies in simulation offers efficiency and safety benefits over real-world experiments.
    • Direct transfer of policies from simulation to reality is hindered by 'Simulation Optimization Bias' (SOB), where policies exploit simulator inaccuracies and risk damaging robots.

    Purpose of the Study:

    • To develop a method for training robot control policies in simulation that are directly transferable to real-world systems.
    • To address the challenge of Simulation Optimization Bias (SOB) in simulation-based reinforcement learning for robotics.

    Main Methods:

    • Domain randomization was applied by varying physics simulation parameters during policy learning.
    • A novel algorithm, Simulation-based Policy Optimization with Transferability Assessment (SPOTA), was proposed.
    • SPOTA incorporates an estimator of SOB to define a training stopping criterion, quantifying overfitting to simulated domains.

    Main Results:

    • The SPOTA algorithm successfully learned control policies exclusively within a randomized simulation environment.
    • Experimental validation on two nonlinear systems demonstrated the direct applicability of learned policies to real robots without further training.
    • The SOB estimator effectively quantified over-fitting, guiding the training process.

    Conclusions:

    • SPOTA enables robust robot control policy learning from simulation, overcoming the simulation-to-reality gap.
    • The method mitigates risks associated with Simulation Optimization Bias, enhancing the reliability of simulated training.
    • This approach facilitates faster, cheaper, and safer robot development by reducing reliance on physical prototypes.