| A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning | Jun 4, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| EXPODE: EXploiting POlicy Discrepancy for Efficient Exploration in Multi-agent Reinforcement Learning | May 30, 2023 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL? | May 27, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Boosting Value Decomposition via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning | May 12, 2023 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles | Apr 3, 2023 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning | Mar 16, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning | Mar 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization | Feb 21, 2023 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| AIIR-MIX: Multi-Agent Reinforcement Learning Meets Attention Individual Intrinsic Reward Mixing Network | Feb 19, 2023 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning | Feb 11, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Equivariant MuZero | Feb 9, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| PushWorld: A benchmark for manipulation planning with tools and movable obstacles | Jan 24, 2023 | OpenAI GymStarcraft | CodeCode Available | 1 |
| TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems | Jan 13, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Self-Motivated Multi-Agent Exploration | Jan 5, 2023 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 0 |
| Strangeness-driven Exploration in Multi-Agent Reinforcement Learning | Dec 27, 2022 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Graph AttentionMulti-agent Reinforcement Learning | —Unverified | 0 |
| System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games | Dec 8, 2022 | Continual LearningLifelong learning | —Unverified | 0 |
| CURO: Curriculum Learning for Relative Overgeneralization | Dec 6, 2022 | Efficient ExplorationMulti-agent Reinforcement Learning | —Unverified | 0 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Decision-making with Speculative Opponent Models | Nov 22, 2022 | Decision MakingSMAC | CodeCode Available | 0 |
| Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition | Nov 21, 2022 | Starcraft | CodeCode Available | 0 |