Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Diagnostic and clinical significance of KIT(CD117) expression in thymic epithelial tumors in China.

Asian Pacific journal of cancer prevention : APJCP·2012

Same author

Characterization of the high cytochalasin E and rosellichalasin producing-Aspergillus sp. nov. F1 isolated from marine solar saltern in China.

World journal of microbiology & biotechnology·2012

Same author

[Endovascular treatment of middle cerebral artery bifurcation aneurysms].

Zhongguo xiu fu chong jian wai ke za zhi = Zhongguo xiufu chongjian waike zazhi = Chinese journal of reparative and reconstructive surgery·2012

Same author

Fluoxetine attenuates chronic methamphetamine-induced pulmonary arterial remodelling: possible involvement of serotonin transporter and serotonin 1B receptor.

Basic & clinical pharmacology & toxicology·2012

Same author

Selective effects of hydroxyapatite nanoparticles on osteosarcoma cells and osteoblasts.

Journal of materials science. Materials in medicine·2012

Same author

A novel polymerization of ultrathin sensitive imprinted film on surface plasmon resonance sensor.

The Analyst·2012

Same journal

Implementation of Q learning and deep Q network for controlling a self balancing robot model.

Robotics and biomimetics·2019

Same journal

Cognition-based variable admittance control for active compliance in flexible manipulation of heavy objects with a power-assist robotic system.

Robotics and biomimetics·2018

Same journal

PID, BFO-optimized PID, and PD-FLC control of a two-wheeled machine with two-direction handling mechanism: a comparative study.

Robotics and biomimetics·2018

Same journal

Systematic engineering design helps creating new soft machines.

Robotics and biomimetics·2018

Same journal

Hybrid control combined with a voluntary biosignal to control a prosthetic hand.

Robotics and biomimetics·2018

Same journal

A multi-jointed underactuated robot hand with fluid-driven stretchable tubes.

Robotics and biomimetics·2018

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 9, 2026

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

Mobile robots exploration through cnn-based reinforcement learning.

Lei Tai¹, Ming Liu²

¹Department of Mechanical and Biomedical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon Tong, 999077 Hong Kong.

Robotics and Biomimetics

|January 10, 2017

Summary

This summary is machine-generated.

This study introduces a novel reinforcement learning method for mobile robot exploration using only depth images. The approach enables robots to navigate unknown corridors and avoid obstacles effectively.

Keywords:

Deep learning Q-learning Robot exploration

More Related Videos

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Related Experiment Videos

Last Updated: Mar 9, 2026

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

Area of Science:

Robotics
Artificial Intelligence
Computer Vision

Background:

Mobile robot exploration is crucial for unknown environments.
Traditional methods often require complex environment mapping or explicit programming.
Deep learning advancements offer new possibilities for sensor-based navigation.

Purpose of the Study:

To develop a reinforcement learning (RL) based exploration strategy for mobile robots.
To enable robots to explore unknown corridor environments using only raw sensor data.
To achieve autonomous obstacle avoidance and efficient exploration.

Main Methods:

Utilized a deep Q-network (DQN) architecture for the RL agent.
Employed a pre-trained convolutional neural network (CNN) to extract features from RGB-D sensor depth images.
Trained the RL model in various simulated corridor environments.

Main Results:

The robot controller demonstrated successful exploration capabilities in diverse simulated environments.
The system achieved effective obstacle avoidance using only depth image input.
The RL agent learned an exploration strategy directly from raw sensor information.

Conclusions:

Reinforcement learning, combined with deep learning feature extraction, provides an effective method for mobile robot exploration.
This approach eliminates the need for explicit environment models or pre-programmed navigation.
It represents a significant advancement in autonomous robot navigation using raw sensory input.