| Celebrating Diversity in Shared Multi-Agent Reinforcement Learning | Jun 4, 2021 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Gym-μRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning | May 21, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Cooperative Multi-Agent Reinforcement Learning with Sequential Credit Assignment | May 21, 2021 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition | May 18, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| An Introduction of mini-AlphaStar | Apr 14, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| C-COMA: A CONTINUAL REINFORCEMENT LEARNING MODEL FOR DYNAMIC MULTIAGENT ENVIRONMENTS | Apr 5, 2021 | Continual LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games | Mar 2, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game | Nov 27, 2020 | AI AgentImitation Learning | CodeCode Available | 1 |
| TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning | Nov 25, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |
| Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge? | Nov 18, 2020 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Multi-Agent Collaboration via Reward Attribution Decomposition | Oct 16, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| RODE: Learning Roles to Decompose Multi-Agent Tasks | Oct 4, 2020 | ClusteringStarcraft | CodeCode Available | 1 |
| Energy-based Surprise Minimization for Multi-Agent Value Factorization | Sep 16, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| QPLEX: Duplex Dueling Multi-Agent Q-Learning | Aug 3, 2020 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Off-Policy Multi-Agent Decomposed Policy Gradients | Jul 24, 2020 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Value-Decomposition Multi-Agent Actor-Critics | Jul 24, 2020 | Multi-agent Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Jun 18, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Learning to Play No-Press Diplomacy with Best Response Policy Iteration | Jun 8, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning | Jun 7, 2020 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Real World Games Look Like Spinning Tops | Apr 20, 2020 | ClusteringStarcraft | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| DefogGAN: Predicting Hidden Information in the StarCraft Fog of War with Generative Adversarial Nets | Mar 4, 2020 | Starcraft | CodeCode Available | 1 |
| Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies | Jan 1, 2020 | Efficient ExplorationMeta Reinforcement Learning | CodeCode Available | 1 |
| LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning | Dec 1, 2019 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| The StarCraft Multi-Agent Challenge | Feb 11, 2019 | BenchmarkingMuJoCo | CodeCode Available | 1 |
| Towards Accurate Generative Models of Video: A New Metric & Challenges | Dec 3, 2018 | DiversityRepresentation Learning | CodeCode Available | 1 |
| QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Mar 30, 2018 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Counterfactual Multi-Agent Policy Gradients | May 24, 2017 | Autonomous Vehiclescounterfactual | CodeCode Available | 1 |
| Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning | Feb 28, 2017 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Transformer World Model for Sample Efficient Multi-Agent Reinforcement Learning | Jun 23, 2025 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 0 |
| NeuroPAL: Punctuated Anytime Learning with Neuroevolution for Macromanagement in Starcraft: Brood War | Jun 12, 2025 | Computational EfficiencyStarcraft | —Unverified | 0 |
| Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning | May 19, 2025 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning | Mar 3, 2025 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Nucleolus Credit Assignment for Effective Coalitions in Multi-agent Reinforcement Learning | Mar 1, 2025 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| Reflection of Episodes: Learning to Play Game from Expert and Self Experiences | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cooperative Multi-Agent Planning with Adaptive Skill Synthesis | Feb 14, 2025 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning | Feb 8, 2025 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Innovative activities of Activision Blizzard: A patent network analysis | Feb 4, 2025 | Patent classificationStarcraft | —Unverified | 0 |
| Superhuman Game AI Disclosure: Expertise and Context Moderate Effects on Trust and Fairness | Jan 31, 2025 | EthicsFairness | —Unverified | 0 |
| Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics | Jan 21, 2025 | Distributional Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors | Dec 30, 2024 | CPUImitation Learning | —Unverified | 0 |
| Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning | Dec 20, 2024 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 0 |