| Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization | Aug 8, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| AVA: Attentive VLM Agent for Mastering StarCraft II | Mar 7, 2025 | Retrieval-augmented GenerationSMAC | CodeCode Available | 1 |
| Cooperative Multi-Agent Reinforcement Learning with Sequential Credit Assignment | May 21, 2021 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft | Aug 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| DefogGAN: Predicting Hidden Information in the StarCraft Fog of War with Generative Adversarial Nets | Mar 4, 2020 | Starcraft | CodeCode Available | 1 |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Jun 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Counterfactual Multi-Agent Policy Gradients | May 24, 2017 | Autonomous Vehiclescounterfactual | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning | Mar 16, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Settling the Variance of Multi-Agent Policy Gradients | Aug 19, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning | Jan 12, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Rethinking of AlphaStar | Aug 7, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| Multi-Agent Collaboration via Reward Attribution Decomposition | Oct 16, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |
| Off-Policy Multi-Agent Decomposed Policy Gradients | Jul 24, 2020 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| A Benchmark for Generalizing Across Diverse Team Strategies in Competitive Pokémon | Jun 12, 2025 | Large Language ModelStarcraft | CodeCode Available | 1 |
| Learning to Play No-Press Diplomacy with Best Response Policy Iteration | Jun 8, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning | Oct 11, 2024 | DiversityMuJoCo | CodeCode Available | 1 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL? | May 27, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies | Jan 1, 2020 | Efficient ExplorationMeta Reinforcement Learning | CodeCode Available | 1 |
| Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning | May 28, 2024 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 1 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| C-COMA: A CONTINUAL REINFORCEMENT LEARNING MODEL FOR DYNAMIC MULTIAGENT ENVIRONMENTS | Apr 5, 2021 | Continual LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| No-Press Diplomacy from Scratch | Oct 6, 2021 | Starcraft | CodeCode Available | 1 |
| Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge? | Nov 18, 2020 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning | Dec 1, 2019 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Celebrating Diversity in Shared Multi-Agent Reinforcement Learning | Jun 4, 2021 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Perceiver IO: A General Architecture for Structured Inputs & Outputs | Jul 30, 2021 | Optical Flow EstimationStarcraft | CodeCode Available | 1 |
| Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning | Feb 28, 2017 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| An Introduction of mini-AlphaStar | Apr 14, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles | Apr 3, 2023 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning | Dec 10, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Data-Driven Distributed Common Operational Picture from Heterogeneous Platforms using Multi-Agent Reinforcement Learning | Nov 8, 2024 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Automatic Observer Script for StarCraft: Brood War Bot Games (technical report) | May 1, 2015 | Starcraft | —Unverified | 0 |
| CURO: Curriculum Learning for Relative Overgeneralization | Dec 6, 2022 | Efficient ExplorationMulti-agent Reinforcement Learning | —Unverified | 0 |
| Aligning Individual and Collective Objectives in Multi-Agent Cooperation | Feb 19, 2024 | SMACSMAC+ | —Unverified | 0 |
| SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning | Mar 3, 2025 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning | Feb 24, 2021 | Meta-LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Coordinated Multi-Agent Exploration Using Shared Goals | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Asynchronous Advantage Actor-Critic Agent for Starcraft II | Jul 22, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment | Jun 1, 2021 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| AIIR-MIX: Multi-Agent Reinforcement Learning Meets Attention Individual Intrinsic Reward Mixing Network | Feb 19, 2023 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| Cooperative Multi-Agent Planning with Adaptive Skill Synthesis | Feb 14, 2025 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Cooperative Exploration for Multi-Agent Deep Reinforcement Learning | Jul 23, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning | Apr 24, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Containerized Distributed Value-Based Multi-Agent Reinforcement Learning | Oct 15, 2021 | BlockingManagement | —Unverified | 0 |