| DOP: Off-Policy Multi-Agent Decomposed Policy Gradients | Jan 1, 2021 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| Coordinated Multi-Agent Exploration Using Shared Goals | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms | Jan 1, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| FSV: Learning to Factorize Soft Value Function for Cooperative Multi-Agent Reinforcement Learning | Jan 1, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Factored Action Spaces in Deep Reinforcement Learning | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II | Dec 24, 2020 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement Learning | Dec 22, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Exact Reduction of Huge Action Spaces in General Reinforcement Learning | Dec 18, 2020 | BinarizationGeneral Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning for the Beginning of Starcraft II Game | Dec 14, 2020 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Multi-agent Policy Optimization with Approximatively Synchronous Advantage Estimation | Dec 7, 2020 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game | Nov 27, 2020 | AI AgentImitation Learning | CodeCode Available | 1 |
| TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning | Nov 25, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |
| Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge? | Nov 18, 2020 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Multi-Agent Collaboration via Reward Attribution Decomposition | Oct 16, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning | Oct 6, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| RODE: Learning Roles to Decompose Multi-Agent Tasks | Oct 4, 2020 | ClusteringStarcraft | CodeCode Available | 1 |
| Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning | Sep 28, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| AI and Wargaming | Sep 18, 2020 | Starcraft | —Unverified | 0 |
| Energy-based Surprise Minimization for Multi-Agent Value Factorization | Sep 16, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning | Sep 9, 2020 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 |
| BGC: Multi-Agent Group Belief with Graph Clustering | Aug 20, 2020 | ClusteringGraph Clustering | —Unverified | 0 |
| Hierarchical Reinforcement Learning in StarCraft II with Human Expertise in Subgoals Selection | Aug 8, 2020 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| QPLEX: Duplex Dueling Multi-Agent Q-Learning | Aug 3, 2020 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Improving Multi-Agent Cooperation using Theory of Mind | Jul 30, 2020 | Starcraft | —Unverified | 0 |
| Off-Policy Multi-Agent Decomposed Policy Gradients | Jul 24, 2020 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Value-Decomposition Multi-Agent Actor-Critics | Jul 24, 2020 | Multi-agent Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Artificial Intelligence is stupid and causal reasoning won't fix it | Jul 20, 2020 | ChatbotStarcraft | —Unverified | 0 |
| QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning | Jun 22, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Jun 18, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Lifelong Learning using Eigentasks: Task Separation, Skill Acquisition and Selective Transfer | Jun 12, 2020 | Continual LearningLifelong learning | —Unverified | 0 |
| StarCraft II Build Order Optimization using Deep Reinforcement Learning and Monte-Carlo Tree Search | Jun 12, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Play No-Press Diplomacy with Best Response Policy Iteration | Jun 8, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Incorporating Pragmatic Reasoning Communication into Emergent Language | Jun 7, 2020 | Multi-agent Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning | Jun 7, 2020 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization | May 31, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems | May 27, 2020 | Deep Reinforcement LearningStarcraft | —Unverified | 0 |
| Optimal Any-Angle Pathfinding on a Sphere | Apr 24, 2020 | Starcraft | —Unverified | 0 |
| Real World Games Look Like Spinning Tops | Apr 20, 2020 | ClusteringStarcraft | CodeCode Available | 1 |
| F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning | Apr 17, 2020 | Multi-agent Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Mar 19, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| DefogGAN: Predicting Hidden Information in the StarCraft Fog of War with Generative Adversarial Nets | Mar 4, 2020 | Starcraft | CodeCode Available | 1 |
| From Chess and Atari to StarCraft and Beyond: How Game AI is Driving the World of AI | Feb 24, 2020 | ArticlesStarcraft | —Unverified | 0 |
| A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I Learned to Stop Worrying about Mixed-Nash and Love Neural Nets | Feb 14, 2020 | StarcraftStarcraft II | —Unverified | 0 |
| Heterogeneous Learning from Demonstration | Jan 27, 2020 | Bayesian InferenceStarcraft | —Unverified | 0 |
| Reinforcement Learning-based Application Autoscaling in the Cloud: A Survey | Jan 27, 2020 | Cloud ComputingDecision Making | —Unverified | 0 |
| Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies | Jan 1, 2020 | Efficient ExplorationMeta Reinforcement Learning | CodeCode Available | 1 |
| LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning | Dec 1, 2019 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |