Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multiregulatory hydrogel supramolecular nanomedicine for reprogramming cartilage homeostasis in osteoarthritis.

Materials today. Bio·2026
Same author

A simple method for quantifying microplastics in water utilizing hydrophobic dyes <i>via</i> ultraviolet-visible spectrophotometry.

Analytical methods : advancing methods and applications·2026
Same author

Quantifying ecohydrological boundaries of groundwater-dependent vegetation restoration in a hyper-arid endorheic basin of northwestern China.

Journal of environmental management·2026
Same author

Predicting the onset of myopia in children and adolescents using artificial intelligence: A systematic review and meta-analysis.

Photodiagnosis and photodynamic therapy·2026
Same author

GmTGA9-GmDYT1 Regulates Anther Wall Development to Affect Male Fertility in Soybean.

Plants (Basel, Switzerland)·2026
Same author

A preoperative inflammation-oxidative stress imbalance strongly predicts post-traumatic arthritis after acetabular fracture fixation.

American journal of translational research·2026
Same journal

DSPE-ViT: a lightweight vision transformer with dynamic sparse positional encoding for dense small object detection in UAV imagery.

Frontiers in neurorobotics·2026
Same journal

ST-HONet: Spatio-Temporal Hierarchical Network for long-horizon bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

ST-HADP: Spatio-Temporal hierarchical attention diffusion policy for long-horizon generalizable bimanual visuomotor imitation.

Frontiers in neurorobotics·2026
Same journal

EQISP: efficient quantized image signal processing with multi-scale pyramid fusion for resource constrained embodied perception.

Frontiers in neurorobotics·2026
Same journal

Research on embodied agent multimodal perception and real-time path planning algorithms for complex unstructured environments.

Frontiers in neurorobotics·2026
Same journal

NL-YOLOv5: a model with a larger receptive field and the ability to globally acquire features.

Frontiers in neurorobotics·2026
See all related articles

Related Experiment Video

Updated: May 25, 2025

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

11.5K

Path planning of mobile robot based on improved double deep Q-network algorithm.

Zhenggang Wang1, Shuhong Song1, Shenghui Cheng1

  • 1College of Electrical Engineering, Anhui Polytechnic University, Wuhu, China.

Frontiers in Neurorobotics
|February 28, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces the BiLSTM-D3QN algorithm for improved path planning. It enhances network convergence, stability, and efficiency compared to traditional deep reinforcement learning methods.

Keywords:
BiLSTMDueling Networkdeep reinforcement learningmobile robotpath planning

More Related Videos

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.2K
Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect
09:00

Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect

Published on: December 19, 2016

14.6K

Related Experiment Videos

Last Updated: May 25, 2025

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

11.5K
Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.2K
Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect
09:00

Insect-controlled Robot: A Mobile Robot Platform to Evaluate the Odor-tracking Capability of an Insect

Published on: December 19, 2016

14.6K

Area of Science:

  • Robotics and Artificial Intelligence
  • Machine Learning
  • Path Planning Algorithms

Background:

  • Traditional deep reinforcement learning algorithms face challenges with slow network convergence, unstable reward convergence, and inefficient path planning.
  • Existing methods often struggle in complex environments, leading to suboptimal path lengths and increased planning times.

Purpose of the Study:

  • To propose an advanced path planning algorithm, BiLSTM-D3QN (Bidirectional Long and Short-Term Memory Dueling Double Deep Q-Network), that addresses the limitations of traditional deep reinforcement learning.
  • To enhance network convergence speed, reward convergence stability, and overall path planning efficiency.

Main Methods:

  • Integration of Bidirectional Long Short-Term Memory (BiLSTM) networks for improved memory and decision-making stability.
  • Incorporation of Dueling Network architecture to mitigate Q-value overestimation and accelerate network updates.
  • Implementation of Adaptive Experience Replay with a frequency penalty function for efficient data extraction.
  • Introduction of an adaptive action selection mechanism to optimize exploration.

Main Results:

  • BiLSTM-D3QN demonstrates superior network convergence speed, planning efficiency, reward convergence stability, and success rates in simple environments compared to traditional Deep Reinforcement Learning.
  • In complex environments, BiLSTM-D3QN achieved a 20m shorter path length, 7 fewer turning points, 0.54s faster planning time, and a 10.4% higher success rate than the improved ERDDQN algorithm.

Conclusions:

  • The proposed BiLSTM-D3QN algorithm significantly outperforms existing methods in both convergence speed and path planning performance.
  • BiLSTM-D3QN offers a more stable and efficient solution for complex path planning tasks in reinforcement learning.