Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Terlipressin for septic shock patients: a meta-analysis of randomized controlled study.

Journal of intensive care·2019
Same author

One-step synthesis of hexylresorcinol calix[4]arene-capped ZnO-Ag nanocomposites for enhanced degradation of organic pollutants.

Journal of colloid and interface science·2019
Same author

Facile fabrication of visible light photoelectrochemical immunosensor for SCCA detection based on BiOBr/Bi<sub>2</sub>S<sub>3</sub> heterostructures via self-sacrificial synthesis method.

Talanta·2019
Same author

Rice seed priming with sodium selenate: Effects on germination, seedling growth, and biochemical attributes.

Scientific reports·2019
Same author

Coupling S-adenosylmethionine-dependent methylation to growth: Design and uses.

PLoS biology·2019
Same author

A prostate-specific antigen electrochemical immunosensor based on Pd NPs functionalized electroactive Co-MOF signal amplification strategy.

Biosensors & bioelectronics·2019
Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026
Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hierarchical Semantic Concept Modeling for Generalizable Myocardial Pathology Segmentation on Multisequence CMR Images.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Jul 11, 2025

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
11:54

Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

Published on: May 8, 2021

4.4K

Safe Adaptive Policy Transfer Reinforcement Learning for Distributed Multiagent Control.

Bin Du, Wei Xie, Yang Li

    IEEE Transactions on Neural Networks and Learning Systems
    |November 2, 2023
    PubMed
    Summary
    This summary is machine-generated.

    This study introduces a safe adaptive policy transfer reinforcement learning (RL) method. It enables follower agents to learn from a pioneer agent, improving cooperative control and safety.

    More Related Videos

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.0K
    A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants
    06:28

    A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

    Published on: August 26, 2018

    6.0K

    Related Experiment Videos

    Last Updated: Jul 11, 2025

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface
    11:54

    Real-Time Proxy-Control of Re-Parameterized Peripheral Signals using a Close-Loop Interface

    Published on: May 8, 2021

    4.4K
    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control
    08:18

    WheelCon: A Wheel Control-Based Gaming Platform for Studying Human Sensorimotor Control

    Published on: August 15, 2020

    5.0K
    A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants
    06:28

    A Networked Desktop Virtual Reality Setup for Decision Science and Navigation Experiments with Multiple Participants

    Published on: August 26, 2018

    6.0K

    Area of Science:

    • Artificial Intelligence
    • Machine Learning
    • Robotics

    Background:

    • Multiagent reinforcement learning (RL) training is challenging due to agent interference and safety constraints.
    • Existing methods struggle with efficient knowledge transfer and adaptive learning in cooperative multiagent systems.

    Purpose of the Study:

    • To propose a safe adaptive policy transfer RL approach for multiagent cooperative control.
    • To enhance learning efficiency and safety in multiagent systems through knowledge transfer.

    Main Methods:

    • Introduced a pioneer and follower off-policy policy transfer learning (PFOPT) method.
    • Enabled transfer of policy representation and sample experience from a pioneer to follower agents.
    • Utilized Wasserstein distance for adaptive adjustment of learning weights between prior experience and exploration.

    Main Results:

    • Trained distributed agents successfully completed collaborative tasks, maximizing rewards while minimizing constraint violations.
    • Demonstrated satisfactory performance in learning speed and success rate compared to baseline methods.
    • The PFOPT method effectively transferred knowledge and adapted learning based on policy distribution differences.

    Conclusions:

    • The proposed safe adaptive policy transfer RL approach significantly improves multiagent cooperative control.
    • PFOPT offers an efficient and safe solution for complex multiagent training scenarios.
    • This method provides a robust framework for knowledge sharing and adaptive learning in decentralized systems.