Projects per year
Abstract
In multi-agent reinforcement learning, centralized training with decentralized execution (CTDE) methods typically assume that agents make decisions based on their local observations independently, which may not lead to a correlated joint policy with coordination. Coordination can be explicitly encouraged during training and individual policies can be trained to imitate the correlated joint policy. However, this may lead to an asymmetric learning failure due to the observation mismatch between the joint and individual policies. Inspired by the concept of correlated equilibrium, we introduce a strategy modification called AgentMixer that allows agents to correlate their policies. AgentMixer combines individual partially observable policies into a joint fully observable policy non-linearly. To enable decentralized execution, we introduce Individual-Global-Consistency to guarantee mode consistency during joint training of the centralized and decentralized policies and prove that AgentMixer converges to an ϵ-approximate Correlated Equilibrium. In the Multi-Agent MuJoCo, SMAC-v2, Matrix Game, and Predator-Prey benchmarks, AgentMixer outperforms or matches state-of-the-art methods.
| Original language | English |
|---|---|
| Pages (from-to) | 18611-18619 |
| Number of pages | 9 |
| Journal | Proceedings of the AAAI Conference on Artificial Intelligence |
| Volume | 39 |
| Issue number | 17 |
| DOIs | |
| Publication status | Published - 11 Apr 2025 |
| MoE publication type | A4 Conference publication |
| Event | AAAI Conference on Artificial Intelligence - Philadelphia, United States Duration: 25 Feb 2025 → 4 Mar 2025 Conference number: 39 |
Fingerprint
Dive into the research topics of 'AgentMixer: Multi-Agent Correlated Policy Factorization'. Together they form a unique fingerprint.-
MARL: Efficient and Principled Multi-Agent Reinforcement Learning
Pajarinen, J. (Principal investigator), Tuomisto, J. (Project Member), Yang, W. (Project Member), Zhao, Y. (Project Member), Zhao, W. (Project Member), Xuan, C. (Project Member) & Wang, H. (Project Member)
01/09/2023 → 31/08/2027
Project: RCF Academy Project
-
-: Finnish Center for Artificial Intelligence
Kaski, S. (Principal investigator)
01/01/2019 → 31/12/2022
Project: Academy of Finland: Other research funding