Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Masking and Demasking Agents

Masking and Demasking Agents

EDTA titrations may necessitate masking and demasking agents to temporarily protect a particular metal ion in a mixture from the EDTA reaction. These agents facilitate the sequential analysis of the metal ions by forming stable complexes with some—but not all—metal ions during certain steps.
There are many masking agents, such as cyanide, fluoride, triethanolamine, thiourea, and 2,3-bis(sulfanyl)propan-1-ol (formerly 2,3-dimercapto-1-propanol), with the masking agent chosen based on...

Decision Making: P-value Method

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim is also stated. These statements can act as null and alternative hypotheses: a null hypothesis would be a neutral statement while the alternative hypothesis can...

Decision Making: Traditional Method

Decision Making: Traditional Method

The process of hypothesis testing based on the traditional method includes calculating the critical value, testing the value of the test statistic using the sample data, and interpreting these values.
First, a specific claim about the population parameter is decided based on the research question and is stated in a simple form. Further, an opposing statement to this claim is also stated. These statements can act as null and alternative hypotheses, out of which a null hypothesis would be a...

Distribution Reliability and Automation

Distribution Reliability and Automation

Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...

Decision Making

Decision Making

Decision-making is a fundamental cognitive process that involves evaluating alternatives and selecting among them. This process can range from simple choices, such as deciding what to wear, to complex decisions, like choosing a major in college or a career path. The complexity of the decision often dictates the approach we use, which can be broadly categorized into two types: automatic and controlled decision-making.
Automatic decision-making is fast, intuitive, and relies on gut feelings...

Stability of Equilibrium Configuration: Problem Solving

Stability of Equilibrium Configuration: Problem Solving

The stability of equilibrium configurations is an important concept in physics, engineering, and other related fields. In simple terms, it refers to the tendency of an object or system to return to its equilibrium position after being disturbed. The stability of an equilibrium configuration can be analyzed by considering the potential energy function of the system and examining its behavior near the equilibrium point.
Problem-solving in the context of the stability of equilibrium configuration...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

WormSORT: A detection-based multiple object tracking model for individual silkworms in breeding environments.

PLoS computational biology·2026

Same author

Chromosome-level genome and population genomics reveal demographic history, incomplete divergence and coastal adaptation of <i>Rhododendron simsii</i> var. <i>putuoense</i> in East China.

Plant diversity·2026

Same author

Characterization of selenium-enriched Lactiplantibacillus plantarum and its effects on egg selenium deposition and quality in laying hens.

Poultry science·2026

Same author

Research progress on chemical metabolites, processing technologies, and pharmacological activities of asperosaponin VI: a systematic review and critical evaluation.

Frontiers in pharmacology·2026

Same author

Artificial intelligence-assisted detection of epileptic spasms using electroencephalographic-video analysis.

Epilepsia·2026

Same author

Epidemiological characteristics and incidence prediction analysis of brucellosis in Bayingolin mongol autonomous prefecture, Xinjiang.

BMC infectious diseases·2026

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Aug 2, 2025

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

Multiagent Trust Region Policy Optimization.

Hepeng Li, Haibo He

IEEE Transactions on Neural Networks and Learning Systems

|April 13, 2023

Summary

This summary is machine-generated.

This study introduces a decentralized multiagent reinforcement learning (MARL) algorithm for partially observable Markov games (POMGs). The method enables agents to learn without central coordination, enhancing cooperative task performance.

More Related Videos

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Related Experiment Videos

Last Updated: Aug 2, 2025

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

The Collective Trust Game: An Online Group Adaptation of the Trust Game Based on the HoneyComb Paradigm

Published on: October 20, 2022

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Spatial Multiobjective Optimization of Agricultural Conservation Practices using a SWAT Model and an Evolutionary Algorithm

Published on: December 9, 2012

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Area of Science:

Artificial Intelligence
Machine Learning
Robotics

Background:

Cooperative multiagent reinforcement learning (MARL) in partially observable Markov games (POMGs) presents challenges due to decentralized information.
Existing centralized training methods require global state and reward information, limiting scalability and applicability.

Purpose of the Study:

To develop a fully decentralized MARL algorithm for POMGs.
To enable cooperative learning among agents without a central controller.
To adapt Trust Region Policy Optimization (TRPO) for decentralized MARL.

Main Methods:

Extended TRPO to cooperative MARL for POMGs.
Transformed the TRPO policy update into a distributed consensus optimization for networked agents.
Proposed a decentralized MARL algorithm using a distributed alternating direction method of multipliers (ADMM) with local convexification and trust-region.
Agents communicate local policy ratios via a peer-to-peer network.

Main Results:

The proposed algorithm effectively trains agents in a decentralized manner.
Demonstrated effectiveness in two cooperative environments.
Eliminated the need for a central control center to gather global information.

Conclusions:

The decentralized ADMM-based MARL algorithm is effective for cooperative POMGs.
This approach offers a scalable alternative to centralized training methods in MARL.
Enables efficient learning in networked multiagent systems with partial observability.