Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Graphs of Equations in Two Variables

Graphs of Equations in Two Variables

An equation with two variables, typically written in the form y = f(x) or Ax + By = C, describes a relationship between quantities represented by x and y. Each solution to such an equation is an ordered pair (x, y) that satisfies the equation when substituted. These pairs can be represented graphically to understand the variables' relationship visually.A common technique for constructing the graph of a two-variable equation is to create a value table. Begin by choosing several values for the...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Graphical Representation of Inequalities

Graphical Representation of Inequalities

The graph of the equation where y equals x squared forms a curve known as a parabola. This curve acts as a boundary in the coordinate plane, dividing it into distinct regions based on the relative position of points.When the equality sign in the equation is replaced with an inequality—such as greater than, less than, greater than or equal to, or less than or equal to—the graphical representation changes from a single curve into a broader shaded area that signifies the set of all...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Peripheral and central vestibular neuromodulation improve postural control in adolescent idiopathic scoliosis: a randomized, sham-controlled, multi-arm intervention study.

Journal of neuroengineering and rehabilitation·2026

Same author

Chemically Fueled Interfacial Supramolecular Polymerization.

ACS nano·2026

Same author

scCCVGBen for benchmarking of single-cell representation learning anchored on a centroid-coupled variational graph attention autoencoder across scRNA-seq and scATAC-seq.

Frontiers in genetics·2026

Same author

Reduced HAV IgG Seropositivity Among Unvaccinated People Living with HIV: The Weak Shield.

Tropical medicine and infectious disease·2026

Same author

Immunosuppression, resistance burden, and qSOFA on short-term prognosis and difficult clearance in hospitalized patients with Salmonella infection: a single-center retrospective cohort study.

BMC infectious diseases·2026

Same author

LAIOR: a hyperbolic neural ODE variational framework for interpretable single-cell manifold learning and trajectory inference.

Frontiers in genetics·2026

Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026

Same journal

CAFF-CIL: Causality-Aware Freedom Forgetting Approach for Class-Incremental Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Harmonic Autoencoding Framework for Multiple Tasks in Magnetic Particle Imaging Reconstruction.

IEEE transactions on neural networks and learning systems·2026

Same journal

A Survey on Human-Centric Voice-Face Multimodal Learning.

IEEE transactions on neural networks and learning systems·2026

Same journal

Vision-Assisted Foundation Model for Solving Multitask Vehicle Routing Problems.

IEEE transactions on neural networks and learning systems·2026

Same journal

FP3O: Enabling Proximal Policy Optimization in Multiagent Cooperation With Parameter-Sharing Versatility.

IEEE transactions on neural networks and learning systems·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 11, 2026

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

GCM: Interpretable Multiagent Reinforcement Learning via Graph Cooperation Modeling.

Xuefei Wu, Yuanyang Zhu, Caihua Chen

IEEE Transactions on Neural Networks and Learning Systems

|November 11, 2025

Summary

This summary is machine-generated.

Graph cooperation modeling (GCM) enhances multiagent reinforcement learning (MARL) by using graph structures for transparent decision-making. This approach improves performance and provides insights into agent cooperation patterns.

More Related Videos

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Related Experiment Videos

Last Updated: Jan 11, 2026

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Area of Science:

Artificial Intelligence
Machine Learning
Multiagent Systems

Background:

Multiagent reinforcement learning (MARL) faces challenges with opaque neural network decision-making.
Lack of transparency hinders human understanding and trust in MARL models.
Data's inherent topological structure offers potential for transparency in MARL.

Purpose of the Study:

To introduce Graph Cooperation Modeling (GCM) for transparent MARL.
To capture and interpret complex collaborative dynamics among agents using graph structures.
To enhance agent credit assignment and focus on task-relevant information.

Main Methods:

Developed GCM utilizing graph structures to model agent interactions.
Integrated a learned metric function to identify beneficial agent collaborations.
Employed graph neural networks (GNNs) for arbitrary-order interaction modeling.
Utilized identity semantics, global state, and individual value functions for agent credit estimation.

Main Results:

GCM achieved up to 28.75% relative performance gains on challenging MARL benchmarks.
Demonstrated significant improvements on super-hard maps.
Provided clear interpretability of underlying cooperative patterns among agents.

Conclusions:

GCM offers a transparent and effective approach to MARL.
The graph-based method enhances both performance and interpretability in multiagent systems.
GCM facilitates a deeper understanding of cooperative dynamics in complex MARL tasks.