Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Multiregulatory hydrogel supramolecular nanomedicine for reprogramming cartilage homeostasis in osteoarthritis.

Materials today. Bio·2026

Same author

A simple method for quantifying microplastics in water utilizing hydrophobic dyes <i>via</i> ultraviolet-visible spectrophotometry.

Analytical methods : advancing methods and applications·2026

Same author

Quantifying ecohydrological boundaries of groundwater-dependent vegetation restoration in a hyper-arid endorheic basin of northwestern China.

Journal of environmental management·2026

Same author

Predicting the onset of myopia in children and adolescents using artificial intelligence: A systematic review and meta-analysis.

Photodiagnosis and photodynamic therapy·2026

Same author

GmTGA9-GmDYT1 Regulates Anther Wall Development to Affect Male Fertility in Soybean.

Plants (Basel, Switzerland)·2026

Same author

A preoperative inflammation-oxidative stress imbalance strongly predicts post-traumatic arthritis after acetabular fracture fixation.

American journal of translational research·2026

Same journal

DSPE-ViT: a lightweight vision transformer with dynamic sparse positional encoding for dense small object detection in UAV imagery.

Frontiers in neurorobotics·2026

Same journal

ST-HONet: Spatio-Temporal Hierarchical Network for long-horizon bimanual visuomotor imitation.

Frontiers in neurorobotics·2026

Same journal

ST-HADP: Spatio-Temporal hierarchical attention diffusion policy for long-horizon generalizable bimanual visuomotor imitation.

Frontiers in neurorobotics·2026

Same journal

EQISP: efficient quantized image signal processing with multi-scale pyramid fusion for resource constrained embodied perception.

Frontiers in neurorobotics·2026

Same journal

Research on embodied agent multimodal perception and real-time path planning algorithms for complex unstructured environments.

Frontiers in neurorobotics·2026

Same journal

NL-YOLOv5: a model with a larger receptive field and the ability to globally acquire features.

Frontiers in neurorobotics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 25, 2025

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

Path planning of mobile robot based on improved double deep Q-network algorithm.

Zhenggang Wang¹, Shuhong Song¹, Shenghui Cheng¹

¹College of Electrical Engineering, Anhui Polytechnic University, Wuhu, China.

Frontiers in Neurorobotics

|February 28, 2025

Summary

This summary is machine-generated.

This study introduces the BiLSTM-D3QN algorithm for improved path planning. It enhances network convergence, stability, and efficiency compared to traditional deep reinforcement learning methods.

Keywords:

BiLSTM Dueling Network deep reinforcement learning mobile robot path planning

More Related Videos

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect

Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect

Published on: December 19, 2016

Related Experiment Videos

Last Updated: May 25, 2025

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect

Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect

Published on: December 19, 2016

Area of Science:

Robotics and Artificial Intelligence
Machine Learning
Path Planning Algorithms

Background:

Traditional deep reinforcement learning algorithms face challenges with slow network convergence, unstable reward convergence, and inefficient path planning.
Existing methods often struggle in complex environments, leading to suboptimal path lengths and increased planning times.

Purpose of the Study:

To propose an advanced path planning algorithm, BiLSTM-D3QN (Bidirectional Long and Short-Term Memory Dueling Double Deep Q-Network), that addresses the limitations of traditional deep reinforcement learning.
To enhance network convergence speed, reward convergence stability, and overall path planning efficiency.

Main Methods:

Integration of Bidirectional Long Short-Term Memory (BiLSTM) networks for improved memory and decision-making stability.
Incorporation of Dueling Network architecture to mitigate Q-value overestimation and accelerate network updates.
Implementation of Adaptive Experience Replay with a frequency penalty function for efficient data extraction.
Introduction of an adaptive action selection mechanism to optimize exploration.

Main Results:

BiLSTM-D3QN demonstrates superior network convergence speed, planning efficiency, reward convergence stability, and success rates in simple environments compared to traditional Deep Reinforcement Learning.
In complex environments, BiLSTM-D3QN achieved a 20m shorter path length, 7 fewer turning points, 0.54s faster planning time, and a 10.4% higher success rate than the improved ERDDQN algorithm.

Conclusions:

The proposed BiLSTM-D3QN algorithm significantly outperforms existing methods in both convergence speed and path planning performance.
BiLSTM-D3QN offers a more stable and efficient solution for complex path planning tasks in reinforcement learning.