| Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization | Aug 8, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| N-Agent Ad Hoc Teamwork | Apr 16, 2024 | Autonomous DrivingMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Semantic HELM: A Human-Readable Memory for Reinforcement Learning | Jun 15, 2023 | Dota 2Language Modelling | CodeCode Available | 1 | 5 |
| Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization | Jul 21, 2023 | ManagementMuJoCo | CodeCode Available | 1 | 5 |
| QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Mar 30, 2018 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| QGNN: Value Function Factorisation with Graph Neural Networks | May 25, 2022 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Counterfactual Multi-Agent Policy Gradients | May 24, 2017 | Autonomous Vehiclescounterfactual | CodeCode Available | 1 | 5 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 | 5 |
| CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning | Mar 16, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Energy-based Surprise Minimization for Multi-Agent Value Factorization | Sep 16, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 | 5 |
| Perceiver IO: A General Architecture for Structured Inputs & Outputs | Jul 30, 2021 | Optical Flow EstimationStarcraft | CodeCode Available | 1 | 5 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 | 5 |
| QPLEX: Duplex Dueling Multi-Agent Q-Learning | Aug 3, 2020 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 | 5 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Jun 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 | 5 |
| A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon | Jun 12, 2025 | Large Language ModelStarcraft | CodeCode Available | 1 | 5 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning | Apr 17, 2024 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| DefogGAN: Predicting Hidden Information in the StarCraft Fog of War with Generative Adversarial Nets | Mar 4, 2020 | Starcraft | CodeCode Available | 1 | 5 |
| Gym-μRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning | May 21, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 1 | 5 |
| Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge? | Nov 18, 2020 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 | 5 |
| C-COMA: A CONTINUAL REINFORCEMENT LEARNING MODEL FOR DYNAMIC MULTIAGENT ENVIRONMENTS | Apr 5, 2021 | Continual LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| PushWorld: A benchmark for manipulation planning with tools and movable obstacles | Jan 24, 2023 | OpenAI GymStarcraft | CodeCode Available | 1 | 5 |
| Real World Games Look Like Spinning Tops | Apr 20, 2020 | ClusteringStarcraft | CodeCode Available | 1 | 5 |
| Celebrating Diversity in Shared Multi-Agent Reinforcement Learning | Jun 4, 2021 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Settling the Variance of Multi-Agent Policy Gradients | Aug 19, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| An Introduction of mini-AlphaStar | Apr 14, 2021 | StarcraftStarcraft II | CodeCode Available | 1 | 5 |
| TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game | Nov 27, 2020 | AI AgentImitation Learning | CodeCode Available | 1 | 5 |
| PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning | Jun 22, 2022 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Action Semantics Network: Considering the Effects of Actions in Multiagent Systems | Jul 26, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 | 5 |
| A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning | Jun 4, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| On the Limitations of Elo: Real-World Games, are Transitive, not Additive | Jun 21, 2022 | StarcraftStarcraft II | CodeCode Available | 0 | 5 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 | 5 |
| Off-Policy Correction For Multi-Agent Reinforcement Learning | Nov 22, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| PPS-QMIX: Periodically Parameter Sharing for Accelerating Convergence of Multi-Agent Reinforcement Learning | Mar 5, 2024 | Federated LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| A Structured Prediction Approach for Generalization in Cooperative Multi-Agent Reinforcement Learning | Oct 19, 2019 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning | Dec 20, 2024 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Explainable Reinforcement Learning Through a Causal Lens | May 27, 2019 | counterfactualreinforcement-learning | CodeCode Available | 0 | 5 |
| Multi-Agent Common Knowledge Reinforcement Learning | Oct 27, 2018 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| MSC: A Dataset for Macro-Management in StarCraft II | Oct 9, 2017 | ManagementReal-Time Strategy Games | CodeCode Available | 0 | 5 |
| Arena: a toolkit for Multi-Agent Reinforcement Learning | Jul 20, 2019 | Multi-agent Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| MazeBase: A Sandbox for Learning from Games | Nov 23, 2015 | NegationReinforcement Learning | CodeCode Available | 0 | 5 |
| Multiagent Bidirectionally-Coordinated Nets: Emergence of Human-level Coordination in Learning to Play StarCraft Combat Games | Mar 29, 2017 | Starcraft | CodeCode Available | 0 | 5 |