| Containerized Distributed Value-Based Multi-Agent Reinforcement Learning | Oct 15, 2021 | BlockingManagement | —Unverified | 0 |
| HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism | Oct 14, 2021 | Hierarchical Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Leveraging Transformers for StarCraft Macromanagement Prediction | Oct 11, 2021 | PredictionStarcraft | —Unverified | 0 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 |
| No-Press Diplomacy from Scratch | Oct 6, 2021 | Starcraft | CodeCode Available | 1 |
| Divergence-Regularized Multi-Agent Actor-Critic | Oct 1, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Revisiting the Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning | Sep 29, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Disentangling Sources of Risk for Distributional Multi-Agent Reinforcement Learning | Sep 29, 2021 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 |
| Role Diversity Matters: A Study of Cooperative Training Strategies for Multi-Agent RL | Sep 29, 2021 | DiversityMulti-agent Reinforcement Learning | —Unverified | 0 |
| MARNET: Backdoor Attacks against Value-Decomposition Multi-Agent Reinforcement Learning | Sep 29, 2021 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II | Sep 26, 2021 | GPUStarcraft | CodeCode Available | 1 |
| Influence-Based Reinforcement Learning for Intrinsically-Motivated Agents | Aug 28, 2021 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Adversary agent reinforcement learning for pursuit-evasion | Aug 25, 2021 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Settling the Variance of Multi-Agent Policy Gradients | Aug 19, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| An Approach to Partial Observability in Games: Learning to Both Act and Observe | Aug 11, 2021 | Atari GamesReinforcement Learning (RL) | —Unverified | 0 |
| Rethinking of AlphaStar | Aug 7, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| Perceiver IO: A General Architecture for Structured Inputs & Outputs | Jul 30, 2021 | Optical Flow EstimationStarcraft | CodeCode Available | 1 |
| Cooperative Exploration for Multi-Agent Deep Reinforcement Learning | Jul 23, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi | Jul 15, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning | Jun 22, 2021 | Distributional Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Truthful Self-Play | Jun 6, 2021 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| Context-Aware Sparse Deep Coordination Graphs | Jun 5, 2021 | graph constructionGraph Learning | CodeCode Available | 1 |
| Celebrating Diversity in Shared Multi-Agent Reinforcement Learning | Jun 4, 2021 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment | Jun 1, 2021 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning | Jun 1, 2021 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Cooperative Multi-Agent Reinforcement Learning with Sequential Credit Assignment | May 21, 2021 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Gym-μRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning | May 21, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural Architectures for Real-Time Strategy Games | May 21, 2021 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 |
| Coach-Player Multi-Agent Reinforcement Learning for Dynamic Team Composition | May 18, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning | May 13, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Convolution for Irregularly Sampled Temporal Point Clouds | May 1, 2021 | StarcraftStarcraft II | —Unverified | 0 |
| Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients | Apr 27, 2021 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Optimize Neural Fictitious Self-Play in Regret Minimization Thinking | Apr 22, 2021 | Starcraft | —Unverified | 0 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| An Introduction of mini-AlphaStar | Apr 14, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| C-COMA: A CONTINUAL REINFORCEMENT LEARNING MODEL FOR DYNAMIC MULTIAGENT ENVIRONMENTS | Apr 5, 2021 | Continual LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning | Apr 5, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| "Weak AI" is Likely to Never Become "Strong AI", So What is its Greatest Value for us? | Mar 29, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Mar 22, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| Meta-Learning for Planning: Automatic Synthesis of Sample Based Planners | Mar 13, 2021 | Decision MakingMeta-Learning | —Unverified | 0 |
| The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games | Mar 2, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Reinforcement Learning of Implicit and Explicit Control Flow in Instructions | Feb 25, 2021 | Minecraftreinforcement-learning | —Unverified | 0 |
| Credit Assignment with Meta-Policy Gradient for Multi-Agent Reinforcement Learning | Feb 24, 2021 | Meta-LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents | Feb 16, 2021 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Disturbing Reinforcement Learning Agents with Corrupted Rewards | Feb 12, 2021 | Autonomous Drivingreinforcement-learning | —Unverified | 0 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| RMIX: Risk-Sensitive Multi-Agent Reinforcement Learning | Jan 1, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |