Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reinforcement01:23

Reinforcement

1.1K
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
1.1K
Observational Learning01:12

Observational Learning

1.1K
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
1.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Diagnostic and clinical significance of KIT(CD117) expression in thymic epithelial tumors in China.

Asian Pacific journal of cancer prevention : APJCP·2012
Same author

Characterization of the high cytochalasin E and rosellichalasin producing-Aspergillus sp. nov. F1 isolated from marine solar saltern in China.

World journal of microbiology & biotechnology·2012
Same author

[Endovascular treatment of middle cerebral artery bifurcation aneurysms].

Zhongguo xiu fu chong jian wai ke za zhi = Zhongguo xiufu chongjian waike zazhi = Chinese journal of reparative and reconstructive surgery·2012
Same author

Fluoxetine attenuates chronic methamphetamine-induced pulmonary arterial remodelling: possible involvement of serotonin transporter and serotonin 1B receptor.

Basic & clinical pharmacology & toxicology·2012
Same author

Selective effects of hydroxyapatite nanoparticles on osteosarcoma cells and osteoblasts.

Journal of materials science. Materials in medicine·2012
Same author

A novel polymerization of ultrathin sensitive imprinted film on surface plasmon resonance sensor.

The Analyst·2012
Same journal

Implementation of Q learning and deep Q network for controlling a self balancing robot model.

Robotics and biomimetics·2019
Same journal

Cognition-based variable admittance control for active compliance in flexible manipulation of heavy objects with a power-assist robotic system.

Robotics and biomimetics·2018
Same journal

PID, BFO-optimized PID, and PD-FLC control of a two-wheeled machine with two-direction handling mechanism: a comparative study.

Robotics and biomimetics·2018
Same journal

Systematic engineering design helps creating new soft machines.

Robotics and biomimetics·2018
Same journal

Hybrid control combined with a voluntary biosignal to control a prosthetic hand.

Robotics and biomimetics·2018
Same journal

A multi-jointed underactuated robot hand with fluid-driven stretchable tubes.

Robotics and biomimetics·2018
See all related articles

Related Experiment Video

Updated: Mar 9, 2026

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.9K

Mobile robots exploration through cnn-based reinforcement learning.

Lei Tai1, Ming Liu2

  • 1Department of Mechanical and Biomedical Engineering, City University of Hong Kong, Tat Chee Avenue, Kowloon Tong, 999077 Hong Kong.

Robotics and Biomimetics
|January 10, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a novel reinforcement learning method for mobile robot exploration using only depth images. The approach enables robots to navigate unknown corridors and avoid obstacles effectively.

Keywords:
Deep learningQ-learningRobot exploration

More Related Videos

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

12.3K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.3K

Related Experiment Videos

Last Updated: Mar 9, 2026

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.9K
The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

12.3K
Investigating Motor Skill Learning Processes with a Robotic Manipulandum
07:52

Investigating Motor Skill Learning Processes with a Robotic Manipulandum

Published on: February 12, 2017

9.3K

Area of Science:

  • Robotics
  • Artificial Intelligence
  • Computer Vision

Background:

  • Mobile robot exploration is crucial for unknown environments.
  • Traditional methods often require complex environment mapping or explicit programming.
  • Deep learning advancements offer new possibilities for sensor-based navigation.

Purpose of the Study:

  • To develop a reinforcement learning (RL) based exploration strategy for mobile robots.
  • To enable robots to explore unknown corridor environments using only raw sensor data.
  • To achieve autonomous obstacle avoidance and efficient exploration.

Main Methods:

  • Utilized a deep Q-network (DQN) architecture for the RL agent.
  • Employed a pre-trained convolutional neural network (CNN) to extract features from RGB-D sensor depth images.
  • Trained the RL model in various simulated corridor environments.

Main Results:

  • The robot controller demonstrated successful exploration capabilities in diverse simulated environments.
  • The system achieved effective obstacle avoidance using only depth image input.
  • The RL agent learned an exploration strategy directly from raw sensor information.

Conclusions:

  • Reinforcement learning, combined with deep learning feature extraction, provides an effective method for mobile robot exploration.
  • This approach eliminates the need for explicit environment models or pre-programmed navigation.
  • It represents a significant advancement in autonomous robot navigation using raw sensory input.