Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Buoyancy and Stability for Submerged and Floating Bodies

Buoyancy and Stability for Submerged and Floating Bodies

In fluid mechanics, buoyancy and stability are key concepts for understanding the behavior of submerged and floating bodies. When a stationary body is fully or partially submerged in a fluid, the fluid exerts a force on the body known as the buoyant force. This force acts vertically upward through a point called the center of buoyancy, which is the center of the displaced fluid volume. According to Archimedes' principle, the magnitude of the buoyant force is equal to the weight of the fluid...

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Uniform Depth Channel Flow: Problem Solving

Uniform Depth Channel Flow: Problem Solving

To calculate the flow rate for a trapezoidal channel, first, identify the bottom width, side slope, and flow depth of the channel. The cross-sectional area (A) corresponding to the depth of flow (y), channel bottom width (B), and side slope (θ) is determined by:Next, calculate the wetted perimeter, which includes the bottom width and the sloped side lengths in contact with the water. Using the values of the cross-sectional area and the wetted perimeter, determine the hydraulic radius by...

Avoidance Learning and Learned Helplessness

Avoidance Learning and Learned Helplessness

Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...

Hydraulic Jump: Problem Solving

Hydraulic Jump: Problem Solving

To analyze a hydraulic jump in a rectangular channel with a flow speed of 6 meters per second, follow these steps:Calculate Effective Upstream Velocity:When the downstream gate closes, a hydraulic jump forms, traveling upstream at 2 meters per second. This wave speed combines with the initial channel flow velocity, creating an effective upstream velocity.Identify Flow Velocities Before and After the Hydraulic Jump:Upstream of the hydraulic jump, the effective flow velocity includes both the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Agentic and LLM-Based Multimodal Anomaly Detection: Architectures, Challenges, and Prospects.

Sensors (Basel, Switzerland)·2026

Same author

Macrophages in human atherosclerotic plaques in the era of single-cell and spatial transcriptomics.

ImmunoHorizons·2026

Same author

Editorial: Metabolism in the tumour microenvironment: implications for pathogenesis and therapeutics.

Frontiers in immunology·2026

Same author

The evolution of digital twins from reactive to agentic systems.

Nature computational science·2026

Same author

Digital twin syncing for autonomous surface vessels using reinforcement learning and nonlinear model predictive control.

Scientific reports·2025

Same author

Data-Driven Techniques for Estimating Energy Expenditure in Wheelchair Users.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2025

Same journal

Passive wheels on legged robots: a survey.

Frontiers in robotics and AI·2026

Same journal

Politeness cannot make up for robots' errors.

Frontiers in robotics and AI·2026

Same journal

Workers expect basic social skills but limited autonomy from future robots - a qualitative interview study and taxonomy for robot social skills.

Frontiers in robotics and AI·2026

Same journal

Human-robot interaction in sustainable hospitality: how robot type shapes customer emotions, green perceptions, and service loyalty.

Frontiers in robotics and AI·2026

Same journal

Dynamic variance-aware federated tuning for efficient autonomous vehicle perception under non-IID settings.

Frontiers in robotics and AI·2026

Same journal

Editorial: Synergizing large language models and computational intelligence for advanced robotic systems.

Frontiers in robotics and AI·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Oct 18, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

Comparing Deep Reinforcement Learning Algorithms' Ability to Safely Navigate Challenging Waters.

Thomas Nakken Larsen¹, Halvor Ødegård Teigen¹, Torkel Laache¹

¹Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway.

Frontiers in Robotics and AI

|September 30, 2021

Summary

This summary is machine-generated.

Proximal Policy Optimization (PPO) excels in reinforcement learning for autonomous vehicles, demonstrating superior robustness in path following and collision avoidance across complex environments. Other algorithms struggled with generalization due to sensor limitations and domain gaps.

Keywords:

autonomous surface vehicle collision avoidance deep reinforcement learning machine learning controller path following

More Related Videos

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

Published on: August 26, 2018

Shallow Water Paddling Variants of Water Maze Tests in Mice

Shallow Water Paddling Variants of Water Maze Tests in Mice

Published on: June 3, 2013

Related Experiment Videos

Last Updated: Oct 18, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

Published on: August 26, 2018

Shallow Water Paddling Variants of Water Maze Tests in Mice

Shallow Water Paddling Variants of Water Maze Tests in Mice

Published on: June 3, 2013

Area of Science:

Robotics
Artificial Intelligence
Control Systems

Background:

Reinforcement Learning (RL) controllers are effective for path following and collision avoidance.
Optimizing RL algorithm setups for these dual objectives remains challenging.
Underactuated surface vehicles require sophisticated control strategies.

Purpose of the Study:

To develop a methodology for analyzing RL algorithm performance in path following and collision avoidance.
To compare various RL algorithms in increasingly complex environments.
To identify optimal RL configurations for underactuated surface vehicles.

Main Methods:

Applied a range of RL algorithms to path-following and collision-avoidance tasks.
Analyzed algorithm performance and task-specific behaviors.
Utilized environments of increasing complexity and varied reward functions.
Investigated generalization capabilities with domain gaps and sensor suite dimensionality reduction.

Main Results:

Proximal Policy Optimization (PPO) demonstrated superior robustness to environmental complexity and reward function changes.
PPO generalized effectively to environments with significant domain gaps.
The proposed reward function improved competing algorithms' performance in the training environment.
Sensor dimensionality reduction combined with domain gap impaired generalization for non-PPO algorithms.

Conclusions:

PPO offers a robust solution for path following and collision avoidance in underactuated surface vehicles.
Sensor suite design and domain gap management are critical for RL generalization.
Further research can refine RL strategies for enhanced autonomous navigation.