Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Reinforcement Schedules01:24

Reinforcement Schedules

142
Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...
142
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

3.8K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
3.8K
Vector Algebra: Graphical Method01:10

Vector Algebra: Graphical Method

12.0K
Vectors can be multiplied by scalars, added to other vectors, or subtracted from other vectors. The vector sum of two (or more) vectors is called the resultant vector or, for short, the resultant.
We use the laws of geometry to construct resultant vectors, followed by trigonometry to find vector magnitudes and directions. For a geometric construction of the sum of two vectors in a plane, we follow the parallelogram rule. Suppose two vectors are at arbitrary positions. Translate either one of...
12.0K
Associative Learning01:27

Associative Learning

335
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
335
Ogive Graph01:07

Ogive Graph

5.6K
An ogive graph is sometimes called a cumulative frequency polygon. It is one type of frequency polygon that shows cumulative frequency. In other words, the cumulative percentages are added to the graph from left to right. An ogive graph plots cumulative frequency on the vertical y-axis and class boundaries along the horizontal x-axis. It’s very similar to a histogram; only instead of rectangles, an ogive displays a single point where the top right of the rectangle would be. Creating this...
5.6K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

106
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...
106

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Optimized predefined-time control for high-order nonlinear MASs via ICA and reinforcement learning.

ISA transactions·2026
Same author

NHAPL enables homogeneous detection of RNA-associated glycan signals.

Biosensors & bioelectronics·2026
Same author

Data-Driven Optimized Output Regulation for Markov Jump Linear Systems and Its Application.

IEEE transactions on cybernetics·2026
Same author

Turing instability and pattern formation in a diffusive predator-prey system with opportunistic predators and weak Allee effect.

Physical review. E·2026
Same author

Stochastic-Sampling-Based Event-Triggered Control for Switching Reaction-Diffusion Neural Networks.

IEEE transactions on cybernetics·2026
Same author

CO<sub>2</sub>-to-CO Electrolysis in Pure Water at Ampere-Level Current Density and 1000 h Stability via a Rapid-Transport Fixed-Charge Interface.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

IGFD-Net: Illumination-guided frequency decoupling for polarization image fusion.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Multiple-Strategies dung beetle optimizer and its applications in engineering optimization and bankruptcy prediction.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Aggregating global-scale pixel-wise forgery cues within a graph.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Finite-Time intermittent control for secure synchronization of Neutral-Type stochastic delayed neural networks under aperiodic DoS attacks.

Neural networks : the official journal of the International Neural Network Society·2026
查看所有相关文章

相关实验视频

Updated: Jun 23, 2025

The HoneyComb Paradigm for Research on Collective Human Behavior
06:48

The HoneyComb Paradigm for Research on Collective Human Behavior

Published on: January 19, 2019

9.3K

超通讯 (HyperComm):在多代理强化学习中的基于超图的通信.

Tianyu Zhu1, Xinli Shi2, Xiangping Xu3

  • 1School of Cyber Science and Engineering, Southeast University, Nanjing 210096, China.

Neural networks : the official journal of the International Neural Network Society
|June 20, 2024
PubMed
概括
此摘要是机器生成的。

通过使用超图来进行更具体的通信,HyperComm增强了多代理强化学习 (MARL). 这种新的方法改善了复杂的多代理系统中的合作和性能.

关键词:
集中式培训与分散式执行 (CTDE)图形神经网络 (GNN) 是一个神经网络.多代理通信多代理通信多个代理强化学习 (MARL)多代理系统多代理系统

更多相关视频

Automated Interactive Video Playback for Studies of Animal Communication
07:21

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

13.4K
The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm
06:18

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

2.0K

相关实验视频

Last Updated: Jun 23, 2025

The HoneyComb Paradigm for Research on Collective Human Behavior
06:48

The HoneyComb Paradigm for Research on Collective Human Behavior

Published on: January 19, 2019

9.3K
Automated Interactive Video Playback for Studies of Animal Communication
07:21

Automated Interactive Video Playback for Studies of Animal Communication

Published on: February 9, 2011

13.4K
The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm
06:18

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

2.0K

科学领域:

  • 人工智能的人工智能
  • 机器学习 机器学习
  • 机器人技术 机器人技术 机器人技术

背景情况:

  • 有效的沟通对于在多代理强化学习 (MARL) 中的合作至关重要.
  • 在MARL中现有的通信方法可能会导致无关紧要的信息交换,阻碍复杂场景中的性能.
  • 之前的研究还没有充分解决代理联盟对沟通策略的影响.

研究的目的:

  • 介绍HyperComm,这是MARL的一个新的框架,它利用超图来建模代理互动.
  • 为了提高合作MARL设置中代理人之间的沟通的准确性和特异性.
  • 解决当前通信方案在处理复杂的多代理环境和联盟动态方面的局限性.

主要方法:

  • 使用超图结构建模多代理系统.
  • 允许代理人在共享的超边缘中进行有效的沟通.
  • 将基于超图的通信集成到现有的MARL框架中.

主要成果:

  • 在多代理系统中,HyperComm表现出更好的合作和性能.
  • 该框架在具有大量代理的场景中表现出了显著的表现.
  • 与最先进的方法相比,HyperComm减轻了与无关紧要信息交换有关的问题.

结论:

  • 超图提供了一个强大的工具,用于增强MARL的沟通.
  • 超通提供了一个更准确,更具体的沟通策略,合作的代理人.
  • 这种新的方法显著推进了MARL的通信领域,特别是在大型系统中.