Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Terlipressin for septic shock patients: a meta-analysis of randomized controlled study.

Journal of intensive care·2019

Same author

One-step synthesis of hexylresorcinol calix[4]arene-capped ZnO-Ag nanocomposites for enhanced degradation of organic pollutants.

Journal of colloid and interface science·2019

Same author

Facile fabrication of visible light photoelectrochemical immunosensor for SCCA detection based on BiOBr/Bi<sub>2</sub>S<sub>3</sub> heterostructures via self-sacrificial synthesis method.

Talanta·2019

Same author

Rice seed priming with sodium selenate: Effects on germination, seedling growth, and biochemical attributes.

Scientific reports·2019

Same author

Coupling S-adenosylmethionine-dependent methylation to growth: Design and uses.

PLoS biology·2019

Same author

A prostate-specific antigen electrochemical immunosensor based on Pd NPs functionalized electroactive Co-MOF signal amplification strategy.

Biosensors & bioelectronics·2019

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

Same journal

Hierarchical Semantic Concept Modeling for Generalizable Myocardial Pathology Segmentation on Multisequence CMR Images.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 11, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

Safe Adaptive Policy Transfer Reinforcement Learning for Distributed Multiagent Control.

Bin Du, Wei Xie, Yang Li

IEEE Transactions on Neural Networks and Learning Systems

|November 2, 2023

Summary

This summary is machine-generated.

This study introduces a safe adaptive policy transfer reinforcement learning (RL) method. It enables follower agents to learn from a pioneer agent, improving cooperative control and safety.

More Related Videos

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

Published on: August 26, 2018

Related Experiment Videos

Last Updated: Jul 11, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

Published on: August 15, 2020

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

Published on: August 26, 2018

Area of Science:

Artificial Intelligence
Machine Learning
Robotics

Background:

Multiagent reinforcement learning (RL) training is challenging due to agent interference and safety constraints.
Existing methods struggle with efficient knowledge transfer and adaptive learning in cooperative multiagent systems.

Purpose of the Study:

To propose a safe adaptive policy transfer RL approach for multiagent cooperative control.
To enhance learning efficiency and safety in multiagent systems through knowledge transfer.

Main Methods:

Introduced a pioneer and follower off-policy policy transfer learning (PFOPT) method.
Enabled transfer of policy representation and sample experience from a pioneer to follower agents.
Utilized Wasserstein distance for adaptive adjustment of learning weights between prior experience and exploration.

Main Results:

Trained distributed agents successfully completed collaborative tasks, maximizing rewards while minimizing constraint violations.
Demonstrated satisfactory performance in learning speed and success rate compared to baseline methods.
The PFOPT method effectively transferred knowledge and adapted learning based on policy distribution differences.

Conclusions:

The proposed safe adaptive policy transfer RL approach significantly improves multiagent cooperative control.
PFOPT offers an efficient and safe solution for complex multiagent training scenarios.
This method provides a robust framework for knowledge sharing and adaptive learning in decentralized systems.