Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

Collisions in Multiple Dimensions: Problem Solving

Collisions in Multiple Dimensions: Problem Solving

In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...

Vector Algebra: Graphical Method

Vector Algebra: Graphical Method

Vectors can be multiplied by scalars, added to other vectors, or subtracted from other vectors. The vector sum of two (or more) vectors is called the resultant vector or, for short, the resultant.
We use the laws of geometry to construct resultant vectors, followed by trigonometry to find vector magnitudes and directions. For a geometric construction of the sum of two vectors in a plane, we follow the parallelogram rule. Suppose two vectors are at arbitrary positions. Translate either one of...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Ogive Graph

Ogive Graph

An ogive graph is sometimes called a cumulative frequency polygon. It is one type of frequency polygon that shows cumulative frequency. In other words, the cumulative percentages are added to the graph from left to right. An ogive graph plots cumulative frequency on the vertical y-axis and class boundaries along the horizontal x-axis. It’s very similar to a histogram; only instead of rectangles, an ogive displays a single point where the top right of the rectangle would be. Creating this...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Optimized predefined-time control for high-order nonlinear MASs via ICA and reinforcement learning.

ISA transactions·2026

Same author

NHAPL enables homogeneous detection of RNA-associated glycan signals.

Biosensors & bioelectronics·2026

Same author

Data-Driven Optimized Output Regulation for Markov Jump Linear Systems and Its Application.

IEEE transactions on cybernetics·2026

Same author

Turing instability and pattern formation in a diffusive predator-prey system with opportunistic predators and weak Allee effect.

Physical review. E·2026

Same author

Stochastic-Sampling-Based Event-Triggered Control for Switching Reaction-Diffusion Neural Networks.

IEEE transactions on cybernetics·2026

Same author

CO<sub>2</sub>-to-CO Electrolysis in Pure Water at Ampere-Level Current Density and 1000 h Stability via a Rapid-Transport Fixed-Charge Interface.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026

Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026

Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jun 23, 2025

The HoneyComb Paradigm for Research on Collective Human Behavior

The HoneyComb Paradigm for Research on Collective Human Behavior

Published on: January 19, 2019

超通讯 (HyperComm):在多代理强化学习中的基于超图的通信.

Tianyu Zhu¹, Xinli Shi², Xiangping Xu³

¹School of Cyber Science and Engineering, Southeast University, Nanjing 210096, China.

Neural networks : the official journal of the International Neural Network Society

|June 20, 2024

概括

此摘要是机器生成的。

通过使用超图来进行更具体的通信,HyperComm增强了多代理强化学习 (MARL). 这种新的方法改善了复杂的多代理系统中的合作和性能.

关键词:

集中式培训与分散式执行 (CTDE)图形神经网络 (GNN) 是一个神经网络.多代理通信多代理通信多个代理强化学习 (MARL)多代理系统多代理系统

更多相关视频

Automated Interactive Video Playback for Studies of Animal Communication

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

相关实验视频

Last Updated: Jun 23, 2025

The HoneyComb Paradigm for Research on Collective Human Behavior

The HoneyComb Paradigm for Research on Collective Human Behavior

Published on: January 19, 2019

Automated Interactive Video Playback for Studies of Animal Communication

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

科学领域:

人工智能的人工智能
机器学习机器学习
机器人技术机器人技术机器人技术

背景情况:

有效的沟通对于在多代理强化学习 (MARL) 中的合作至关重要.
在MARL中现有的通信方法可能会导致无关紧要的信息交换,阻碍复杂场景中的性能.
之前的研究还没有充分解决代理联盟对沟通策略的影响.

研究的目的:

介绍HyperComm,这是MARL的一个新的框架,它利用超图来建模代理互动.
为了提高合作MARL设置中代理人之间的沟通的准确性和特异性.
解决当前通信方案在处理复杂的多代理环境和联盟动态方面的局限性.

主要方法:

使用超图结构建模多代理系统.
允许代理人在共享的超边缘中进行有效的沟通.
将基于超图的通信集成到现有的MARL框架中.

主要成果:

在多代理系统中,HyperComm表现出更好的合作和性能.
该框架在具有大量代理的场景中表现出了显著的表现.
与最先进的方法相比,HyperComm减轻了与无关紧要信息交换有关的问题.

结论:

超图提供了一个强大的工具,用于增强MARL的沟通.
超通提供了一个更准确,更具体的沟通策略,合作的代理人.
这种新的方法显著推进了MARL的通信领域,特别是在大型系统中.