Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Buoyancy and Stability for Submerged and Floating Bodies01:11

Buoyancy and Stability for Submerged and Floating Bodies

2.2K
In fluid mechanics, buoyancy and stability are key concepts for understanding the behavior of submerged and floating bodies. When a stationary body is fully or partially submerged in a fluid, the fluid exerts a force on the body known as the buoyant force. This force acts vertically upward through a point called the center of buoyancy, which is the center of the displaced fluid volume. According to Archimedes' principle, the magnitude of the buoyant force is equal to the weight of the fluid...
2.2K
Observational Learning01:12

Observational Learning

390
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
390
Uniform Depth Channel Flow: Problem Solving01:18

Uniform Depth Channel Flow: Problem Solving

158
To calculate the flow rate for a trapezoidal channel, first, identify the bottom width, side slope, and flow depth of the channel. The cross-sectional area (A) corresponding to the depth of flow (y), channel bottom width (B), and side slope (θ) is determined by:Next, calculate the wetted perimeter, which includes the bottom width and the sloped side lengths in contact with the water. Using the values of the cross-sectional area and the wetted perimeter, determine the hydraulic radius by...
158
Avoidance Learning and Learned Helplessness01:14

Avoidance Learning and Learned Helplessness

2.0K
Avoidance learning and learned helplessness are critical concepts in understanding behavioral responses to negative stimuli.
Avoidance learning occurs when an organism learns that a specific behavior can prevent an unpleasant outcome. For example, a student who receives a bad grade may start studying harder to avoid future poor grades. This behavior persists even when the negative outcome is no longer present. Avoidance learning is powerful because it maintains behavior in the absence of the...
2.0K
Hydraulic Jump: Problem Solving01:16

Hydraulic Jump: Problem Solving

211
To analyze a hydraulic jump in a rectangular channel with a flow speed of 6 meters per second, follow these steps:Calculate Effective Upstream Velocity:When the downstream gate closes, a hydraulic jump forms, traveling upstream at 2 meters per second. This wave speed combines with the initial channel flow velocity, creating an effective upstream velocity.Identify Flow Velocities Before and After the Hydraulic Jump:Upstream of the hydraulic jump, the effective flow velocity includes both the...
211

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Agentic and LLM-Based Multimodal Anomaly Detection: Architectures, Challenges, and Prospects.

Sensors (Basel, Switzerland)·2026
Same author

Macrophages in human atherosclerotic plaques in the era of single-cell and spatial transcriptomics.

ImmunoHorizons·2026
Same author

Editorial: Metabolism in the tumour microenvironment: implications for pathogenesis and therapeutics.

Frontiers in immunology·2026
Same author

The evolution of digital twins from reactive to agentic systems.

Nature computational science·2026
Same author

Digital twin syncing for autonomous surface vessels using reinforcement learning and nonlinear model predictive control.

Scientific reports·2025
Same author

Data-Driven Techniques for Estimating Energy Expenditure in Wheelchair Users.

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2025
Same journal

Passive wheels on legged robots: a survey.

Frontiers in robotics and AI·2026
Same journal

Politeness cannot make up for robots' errors.

Frontiers in robotics and AI·2026
Same journal

Workers expect basic social skills but limited autonomy from future robots - a qualitative interview study and taxonomy for robot social skills.

Frontiers in robotics and AI·2026
Same journal

Human-robot interaction in sustainable hospitality: how robot type shapes customer emotions, green perceptions, and service loyalty.

Frontiers in robotics and AI·2026
Same journal

Dynamic variance-aware federated tuning for efficient autonomous vehicle perception under non-IID settings.

Frontiers in robotics and AI·2026
Same journal

Editorial: Synergizing large language models and computational intelligence for advanced robotic systems.

Frontiers in robotics and AI·2026
See all related articles

Related Experiment Video

Updated: Oct 18, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.5K

Comparing Deep Reinforcement Learning Algorithms' Ability to Safely Navigate Challenging Waters.

Thomas Nakken Larsen1, Halvor Ødegård Teigen1, Torkel Laache1

  • 1Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway.

Frontiers in Robotics and AI
|September 30, 2021
PubMed
Summary
This summary is machine-generated.

Proximal Policy Optimization (PPO) excels in reinforcement learning for autonomous vehicles, demonstrating superior robustness in path following and collision avoidance across complex environments. Other algorithms struggled with generalization due to sensor limitations and domain gaps.

Keywords:
autonomous surface vehiclecollision avoidancedeep reinforcement learningmachine learning controllerpath following

More Related Videos

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants
06:28

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

Published on: August 26, 2018

6.1K
Shallow Water Paddling Variants of Water Maze Tests in Mice
07:47

Shallow Water Paddling Variants of Water Maze Tests in Mice

Published on: June 3, 2013

23.8K

Related Experiment Videos

Last Updated: Oct 18, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.5K
A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants
06:28

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

Published on: August 26, 2018

6.1K
Shallow Water Paddling Variants of Water Maze Tests in Mice
07:47

Shallow Water Paddling Variants of Water Maze Tests in Mice

Published on: June 3, 2013

23.8K

Area of Science:

  • Robotics
  • Artificial Intelligence
  • Control Systems

Background:

  • Reinforcement Learning (RL) controllers are effective for path following and collision avoidance.
  • Optimizing RL algorithm setups for these dual objectives remains challenging.
  • Underactuated surface vehicles require sophisticated control strategies.

Purpose of the Study:

  • To develop a methodology for analyzing RL algorithm performance in path following and collision avoidance.
  • To compare various RL algorithms in increasingly complex environments.
  • To identify optimal RL configurations for underactuated surface vehicles.

Main Methods:

  • Applied a range of RL algorithms to path-following and collision-avoidance tasks.
  • Analyzed algorithm performance and task-specific behaviors.
  • Utilized environments of increasing complexity and varied reward functions.
  • Investigated generalization capabilities with domain gaps and sensor suite dimensionality reduction.

Main Results:

  • Proximal Policy Optimization (PPO) demonstrated superior robustness to environmental complexity and reward function changes.
  • PPO generalized effectively to environments with significant domain gaps.
  • The proposed reward function improved competing algorithms' performance in the training environment.
  • Sensor dimensionality reduction combined with domain gap impaired generalization for non-PPO algorithms.

Conclusions:

  • PPO offers a robust solution for path following and collision avoidance in underactuated surface vehicles.
  • Sensor suite design and domain gap management are critical for RL generalization.
  • Further research can refine RL strategies for enhanced autonomous navigation.