Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Optimal Foraging00:48

Optimal Foraging

12.7K
How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.
12.7K
Rolling Resistance: Problem Solving01:17

Rolling Resistance: Problem Solving

559
Rolling resistance, also known as rolling friction, is the force that resists the motion of a rolling object, such as a wheel, tire, or ball, when it moves over a surface. It is caused by the deformation of the object and the surface in contact with each other, as well as other factors like internal friction, hysteresis, and energy losses within the materials. Rolling resistance opposes the object's motion, requiring additional energy to overcome it and maintain movement. In practical...
559

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

WS-SSA: workflow scheduling in cloud computing using salp swarm algorithm.

Scientific reports·2026
Same author

A hybrid gazelle optimization and reptile search algorithm for optimal clustering in wireless sensor networks.

Scientific reports·2025
Same author

Medical image segmentation approach based on hybrid adaptive differential evolution and crayfish optimizer.

Computers in biology and medicine·2024
Same author

Fine tuning deep learning models for breast tumor classification.

Scientific reports·2024
Same author

MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads.

BMC bioinformatics·2024
Same author

Deep Learning-Based Approaches for Enhanced Diagnosis and Comprehensive Understanding of Carpal Tunnel Syndrome.

Diagnostics (Basel, Switzerland)·2023
Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026
Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026
Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026
Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026
Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026
Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Nov 2, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.4K

Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization

Nesma M Ashraf1, Reham R Mostafa2, Rasha H Sakr1

  • 1Computer Science Department, Faculty of Computers and Information Sciences, Mansoura University, Mansoura, Egypt.

Plos One
|June 10, 2021
PubMed
Summary
This summary is machine-generated.

This study optimizes Deep Deterministic Policy Gradient (DDPG) hyperparameters using the Whale Optimization Algorithm (WOA) for autonomous driving. Optimized DDPG significantly improves rewards and driving stability in simulations.

More Related Videos

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.2K
Automated Interactive Video Playback for Studies of Animal Communication
07:21

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

13.8K

Related Experiment Videos

Last Updated: Nov 2, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches
09:47

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

1.4K
Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.2K
Automated Interactive Video Playback for Studies of Animal Communication
07:21

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

13.8K

Area of Science:

  • Artificial Intelligence
  • Robotics
  • Control Systems

Background:

  • Deep Reinforcement Learning (DRL) agents learn optimal policies through reward functions without prior environmental knowledge.
  • Hyperparameter tuning is critical for DRL efficiency and performance, presenting a significant challenge.
  • Autonomous driving systems require robust control strategies capable of handling complex, continuous action spaces.

Purpose of the Study:

  • To optimize hyperparameters for the Deep Deterministic Policy Gradient (DDPG) algorithm using a swarm-based approach.
  • To enhance the control strategy of DRL agents in autonomous driving scenarios.
  • To address the challenge of accurate hyperparameter estimation in DRL training.

Main Methods:

  • Employed the Whale Optimization Algorithm (WOA), a swarm-based metaheuristic, for hyperparameter optimization.
  • Applied the optimized DDPG algorithm to an autonomous driving control problem within the TORCS simulation environment.
  • Compared the performance of the DDPG agent with optimized hyperparameters against one with reference hyperparameters.

Main Results:

  • Hyperparameter optimization using WOA led to maximized total rewards for the DDPG agent.
  • The optimized DDPG agent demonstrated improved performance across testing episodes.
  • A more stable driving policy was achieved with the optimized DDPG hyperparameters.

Conclusions:

  • Swarm-based optimization, specifically WOA, is effective for tuning DRL hyperparameters in autonomous driving.
  • Optimized DDPG hyperparameters significantly enhance learning efficiency and policy stability.
  • The proposed method offers a viable solution for improving DRL performance in complex control tasks.