Jove
Visualize
お問い合わせ
JoVE
x logofacebook logolinkedin logoyoutube logo
JoVEについて
概要リーダーシップブログJoVEヘルプセンター
著者向け
出版プロセス編集委員会範囲と方針査読よくある質問投稿
図書館員向け
推薦の声購読アクセスリソース図書館諮問委員会よくある質問
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experimentsアーカイブ
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教員リソースセンター教員サイト
利用規約
プライバシーポリシー
ポリシー

関連する概念動画

Reinforcement01:23

Reinforcement

786
Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:
786
Optimal Foraging00:48

Optimal Foraging

13.4K
How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.
13.4K
Observational Learning01:12

Observational Learning

791
Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...
791
Collisions in Multiple Dimensions: Problem Solving01:06

Collisions in Multiple Dimensions: Problem Solving

5.2K
In multiple dimensions, the conservation of momentum applies in each direction independently. Hence, to solve collisions in multiple dimensions, we should write down the momentum conservation in each direction separately. To help understand collisions in multiple dimensions, consider an example.
A small car of mass 1,200 kg traveling east at 60 km/h collides at an intersection with a truck of mass 3,000 kg traveling due north at 40 km/h. The two vehicles are locked together. What is the...
5.2K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.8K
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

373
Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
373

こちらも読む

関連記事

共著者、ジャーナル、引用グラフによってこの研究に関連する記事。

並び替え
Same author

STE-DC2I Uncovers Driver Genes in Colorectal Cancer Subtypes Using Symbolic Trajectory-Embedded Dark Causal Inference.

Journal of chemical information and modeling·2026
Same author

M-Net: Multiscale hierarchical fusion with dual natural patch attention for spatial-Temporal time series forecasting.

Neural networks : the official journal of the International Neural Network Society·2026
Same author

Application of an l-Cysteine-Enhanced Ag NCs@C NF Amplification-Free ECL Biosensor for miRNA.

Analytical chemistry·2026
Same author

Photoelectrochemical detection of epigenetic 5-hydroxymethylcytosine based on Cu<sub>2</sub>O@CuO@Ag and self-triggered isothermal amplification.

Analytica chimica acta·2026
Same author

Signaling-Driven Incentive Communication for Enhanced Multiagent Reinforcement Learning in Dynamic Environments.

IEEE transactions on cybernetics·2025
Same author

An Efficient Electrochemiluminescence Biosensor Based on Ru(bpy)<sub>3</sub><sup>2+</sup>@AuNPs@SWCNTs for miRNAs Detection Using a Dual Engine-Triggered DNA Walker.

Analytical chemistry·2025
Same journal

Dynamic analysis and reliable mechanical optimization application of ring HNN effected with a memristive neuron.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

DAFF-Net: A detection and search method for small-scale low surface brightness galaxies.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Quasi-synchronization for complex networks with hybrid pinning intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Physics-encoded convolutional neural operators for parametric PDEs: A convergence-guaranteed framework via pre-computed kernel fields.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Exploiting audio-visual modalities in videos: Object detection via multi-stage bilateral coupling network.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Reliability-aware modality completion with cross-modal distillation for federated learning with missing modalities.

Neural networks : the official journal of the International Neural Network Society·2026
関連記事をすべて見る

関連する実験動画

Updated: Jan 8, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.4K

協力のための進化集団を持つグラフベースマルチエージェント強化学習

Kexing Peng1, Hanwen Qi1, Tinghuai Ma2

  • 1School of Computer Science, Nanjing University of Information Science & Technology, Nanjing, 210044, China.

Neural networks : the official journal of the International Neural Network Society
|December 12, 2025
PubMed
まとめ
この要約は機械生成です。

この研究は、複雑なタスクにおける協調を強化する新しいマルチエージェント強化学習(MARL)フレームワークであるGDEを紹介します。GDEは、グラフベースの価値分解と段階的な進化的ポリシー最適化を組み合わせて、エージェントのパフォーマンスを向上させます。

キーワード:
進化アルゴリズムグラフニューラルネットワークマルチエージェント強化学習

さらに関連する動画

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

12.1K
Predicting the Effectiveness of Population Replacement Strategy Using Mathematical Modeling
20:36

Predicting the Effectiveness of Population Replacement Strategy Using Mathematical Modeling

Published on: July 4, 2007

9.1K

関連する実験動画

Last Updated: Jan 8, 2026

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm
11:53

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

13.4K
The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy
11:53

The Modular Design and Production of an Intelligent Robot Based on a Closed-Loop Control Strategy

Published on: October 14, 2017

12.1K
Predicting the Effectiveness of Population Replacement Strategy Using Mathematical Modeling
20:36

Predicting the Effectiveness of Population Replacement Strategy Using Mathematical Modeling

Published on: July 4, 2007

9.1K

科学分野:

  • 人工知能
  • ロボット工学
  • コンピュータサイエンス

背景:

  • 既存のマルチエージェント強化学習(MARL)法は、エージェントの観測が限られており、相互作用が動的であるため、複雑な協調タスクへのスケーリングに課題があります。
  • タスクの複雑さとポリシー空間が増加すると、最適なポリシーへの収束が困難になり、安定したポリシー評価に影響を与えます。

研究 の 目的:

  • スケーラビリティと収束の問題を解決するために設計されたMARLフレームワークであるGDEを提案すること。
  • 状態の合意なしに、動的な環境におけるエージェントの協調と情報伝播を強化すること。

主な方法:

  • GDEは、グラフベースの価値分解と段階的な進化的ポリシー最適化を統合します。
  • 進化アルゴリズム(EA)は、ポリシー探索と収束を改善するために、勾配フリーのランダムサーチに利用されます。
  • グラフニューラルネットワーク(GNN)は、エージェントの受容野を拡張し、情報伝播を容易にするために採用され、動的なデータでの安定した収束のために順列不変性を利用します。

主要な成果:

  • GDEは、StarCraft IIのマイクロマネジメント、MAMuJoCoロボット協調、SUMO自律走行を含む複雑な協調タスクで優れたパフォーマンスを発揮します。
  • このフレームワークは、マルチエージェントチーム形成とGNNを通じて複雑な協調ダイナミクスを効果的に捉えます。
  • 実験結果は、GDEフレームワーク内の各モジュールの有効性と必要性を検証します。

結論:

  • GDEは、MARLにおける協調とポリシー収束を強化するための堅牢なソリューションを提供します。
  • グラフベースの分解と進化的最適化の提案された組み合わせは、複雑なマルチエージェントシステムに効果的です。
  • フレームワークのモジュール設計と適応性により、多様な実世界のアプリケーションに適しています。