| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning | Oct 6, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| RODE: Learning Roles to Decompose Multi-Agent Tasks | Oct 4, 2020 | ClusteringStarcraft | CodeCode Available | 1 |
| Spatially Structured Recurrent Modules | Sep 28, 2020 | Starcraft IIVideo Prediction | —Unverified | 0 |
| Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning | Sep 28, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Energy-based Surprise Minimization for Multi-Agent Value Factorization | Sep 16, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| BGC: Multi-Agent Group Belief with Graph Clustering | Aug 20, 2020 | ClusteringGraph Clustering | —Unverified | 0 |
| Hierarchical Reinforcement Learning in StarCraft II with Human Expertise in Subgoals Selection | Aug 8, 2020 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| QPLEX: Duplex Dueling Multi-Agent Q-Learning | Aug 3, 2020 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Off-Policy Multi-Agent Decomposed Policy Gradients | Jul 24, 2020 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Value-Decomposition Multi-Agent Actor-Critics | Jul 24, 2020 | Multi-agent Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| S2RMs: Spatially Structured Recurrent Modules | Jul 13, 2020 | Starcraft IIVideo Prediction | —Unverified | 0 |
| Learning Implicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning | Jul 6, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| StarCraft II Build Order Optimization using Deep Reinforcement Learning and Monte-Carlo Tree Search | Jun 12, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Incorporating Pragmatic Reasoning Communication into Emergent Language | Jun 7, 2020 | Multi-agent Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization | May 31, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Real World Games Look Like Spinning Tops | Apr 20, 2020 | ClusteringStarcraft | CodeCode Available | 1 |
| F2A2: Flexible Fully-decentralized Approximate Actor-critic for Cooperative Multi-agent Reinforcement Learning | Apr 17, 2020 | Multi-agent Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I Learned to Stop Worrying about Mixed-Nash and Love Neural Nets | Feb 14, 2020 | StarcraftStarcraft II | —Unverified | 0 |
| Heterogeneous Learning from Demonstration | Jan 27, 2020 | Bayesian InferenceStarcraft | —Unverified | 0 |
| Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies | Jan 1, 2020 | Efficient ExplorationMeta Reinforcement Learning | CodeCode Available | 1 |
| LIIR: Learning Individual Intrinsic Reward in Multi-Agent Reinforcement Learning | Dec 1, 2019 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Narration-based Reward Shaping Approach using Grounded Natural Language Commands | Oct 31, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |