Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Optimal Foraging

Optimal Foraging

How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.

Rolling Resistance: Problem Solving

Rolling Resistance: Problem Solving

Rolling resistance, also known as rolling friction, is the force that resists the motion of a rolling object, such as a wheel, tire, or ball, when it moves over a surface. It is caused by the deformation of the object and the surface in contact with each other, as well as other factors like internal friction, hysteresis, and energy losses within the materials. Rolling resistance opposes the object's motion, requiring additional energy to overcome it and maintain movement. In practical...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

WS-SSA: workflow scheduling in cloud computing using salp swarm algorithm.

Scientific reports·2026

Same author

A hybrid gazelle optimization and reptile search algorithm for optimal clustering in wireless sensor networks.

Scientific reports·2025

Same author

Medical image segmentation approach based on hybrid adaptive differential evolution and crayfish optimizer.

Computers in biology and medicine·2024

Same author

Fine tuning deep learning models for breast tumor classification.

Scientific reports·2024

Same author

MAC-ErrorReads: machine learning-assisted classifier for filtering erroneous NGS reads.

BMC bioinformatics·2024

Same author

Deep Learning-Based Approaches for Enhanced Diagnosis and Comprehensive Understanding of Carpal Tunnel Syndrome.

Diagnostics (Basel, Switzerland)·2023

Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026

Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026

Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026

Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026

Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026

Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 2, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Optimizing hyperparameters of deep reinforcement learning for autonomous driving based on whale optimization

Nesma M Ashraf¹, Reham R Mostafa², Rasha H Sakr¹

¹Computer Science Department, Faculty of Computers and Information Sciences, Mansoura University, Mansoura, Egypt.

|June 10, 2021

Summary

This summary is machine-generated.

This study optimizes Deep Deterministic Policy Gradient (DDPG) hyperparameters using the Whale Optimization Algorithm (WOA) for autonomous driving. Optimized DDPG significantly improves rewards and driving stability in simulations.

More Related Videos

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Automated Interactive Video Playback for Studies of Animal Communication

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

Related Experiment Videos

Last Updated: Nov 2, 2025

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Author Spotlight: Advancing Alzheimer's Research – Exploring Early Detection and Multi-Omics Approaches

Published on: December 15, 2023

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Automated Interactive Video Playback for Studies of Animal Communication

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

Area of Science:

Artificial Intelligence
Robotics
Control Systems

Background:

Deep Reinforcement Learning (DRL) agents learn optimal policies through reward functions without prior environmental knowledge.
Hyperparameter tuning is critical for DRL efficiency and performance, presenting a significant challenge.
Autonomous driving systems require robust control strategies capable of handling complex, continuous action spaces.

Purpose of the Study:

To optimize hyperparameters for the Deep Deterministic Policy Gradient (DDPG) algorithm using a swarm-based approach.
To enhance the control strategy of DRL agents in autonomous driving scenarios.
To address the challenge of accurate hyperparameter estimation in DRL training.

Main Methods:

Employed the Whale Optimization Algorithm (WOA), a swarm-based metaheuristic, for hyperparameter optimization.
Applied the optimized DDPG algorithm to an autonomous driving control problem within the TORCS simulation environment.
Compared the performance of the DDPG agent with optimized hyperparameters against one with reference hyperparameters.

Main Results:

Hyperparameter optimization using WOA led to maximized total rewards for the DDPG agent.
The optimized DDPG agent demonstrated improved performance across testing episodes.
A more stable driving policy was achieved with the optimized DDPG hyperparameters.

Conclusions:

Swarm-based optimization, specifically WOA, is effective for tuning DRL hyperparameters in autonomous driving.
Optimized DDPG hyperparameters significantly enhance learning efficiency and policy stability.
The proposed method offers a viable solution for improving DRL performance in complex control tasks.