| Dungeons and Data: A Large-Scale NetHack Dataset | Nov 1, 2022 | Decision MakingNetHack | CodeCode Available | 2 |
| Exploring Adaptive MCTS with TD Learning in miniXCOM | Oct 10, 2022 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Learning Explicit Credit Assignment for Cooperative Multi-Agent Reinforcement Learning via Polarization Policy Gradient | Oct 10, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| On Efficient Reinforcement Learning for Full-length Game of StarCraft II | Sep 23, 2022 | CPUreinforcement-learning | CodeCode Available | 2 |
| MIXRTs: Toward Interpretable Multi-Agent Reinforcement Learning via Mixing Recurrent Soft Decision Trees | Sep 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction | Sep 2, 2022 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Forecasting Evolution of Clusters in Game Agents with Hebbian Learning | Aug 19, 2022 | ClusteringMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Framework for Understanding and Visualizing Strategies of RL Agents | Aug 17, 2022 | EthicsStarcraft | CodeCode Available | 0 |
| Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft | Aug 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Model-Free Generative Replay for Lifelong Reinforcement Learning: Application to Starcraft-2 | Aug 9, 2022 | Lifelong learningManagement | —Unverified | 0 |
| Maximum Correntropy Value Decomposition for Multi-agent Deep Reinforcemen Learning | Aug 7, 2022 | Deep Reinforcement LearningSMAC | —Unverified | 0 |
| Unsupervised Hebbian Learning on Point Sets in StarCraft II | Jul 13, 2022 | DecoderSelf-Supervised Learning | —Unverified | 0 |
| SC2EGSet: StarCraft II Esport Replay and Game-state Dataset | Jul 7, 2022 | StarcraftStarcraft II | CodeCode Available | 1 |
| PAC: Assisted Value Factorisation with Counterfactual Predictions in Multi-Agent Reinforcement Learning | Jun 22, 2022 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Evolutionary Game-Theoretical Analysis for General Multiplayer Asymmetric Games | Jun 22, 2022 | StarcraftStarcraft II | —Unverified | 0 |
| On the Limitations of Elo: Real-World Games, are Transitive, not Additive | Jun 21, 2022 | StarcraftStarcraft II | CodeCode Available | 0 |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Jun 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning? | Jun 20, 2022 | AllMulti-agent Reinforcement Learning | —Unverified | 0 |
| Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis | Jun 17, 2022 | MuJoCoStarcraft | —Unverified | 0 |
| Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning | Jun 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL | Jun 1, 2022 | DiversityMulti-agent Reinforcement Learning | —Unverified | 0 |
| DM^2: Decentralized Multi-Agent Reinforcement Learning for Distribution Matching | Jun 1, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Off-Beat Multi-Agent Reinforcement Learning | May 27, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| QGNN: Value Function Factorisation with Graph Neural Networks | May 25, 2022 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Terrain Analysis in StarCraft 1 and 2 as Combinatorial Optimization | May 18, 2022 | Combinatorial OptimizationReal-Time Strategy Games | CodeCode Available | 0 |
| Learning to Guide Multiple Heterogeneous Actors from a Single Human Demonstration via Automatic Curriculum Learning in StarCraft II | May 11, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning | May 5, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Transfer Role Assignment Across Team Sizes | Apr 17, 2022 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning | Mar 16, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents | Mar 16, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Depthwise Convolution for Multi-Agent Communication with Enhanced Mean-Field Approximation | Mar 6, 2022 | Reinforcement Learning (RL)SMAC | —Unverified | 0 |
| Intrinsically-Motivated Reinforcement Learning: A Brief Introduction | Mar 3, 2022 | Autonomous Drivingreinforcement-learning | —Unverified | 0 |
| MCMARL: Parameterizing Value Function via Mixture of Categorical Distributions for Multi-Agent Reinforcement Learning | Feb 21, 2022 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 0 |
| NeuPL: Neural Population Learning | Feb 15, 2022 | StarcraftTransfer Learning | —Unverified | 0 |
| FCMNet: Full Communication Memory Net for Team-Level Cooperation in Multi-Agent Systems | Jan 28, 2022 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Exploiting Semantic Epsilon Greedy Exploration Strategy in Multi-Agent Reinforcement Learning | Jan 26, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning | Jan 12, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients | Jan 4, 2022 | StarcraftStarcraft II | CodeCode Available | 0 |
| Deep Reinforcement Learning, a textbook | Jan 4, 2022 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Local Advantage Networks for Cooperative Multi-Agent Reinforcement Learning | Dec 23, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CGIBNet: Bandwidth-constrained Communication with Graph Information Bottleneck in Multi-Agent Reinforcement Learning | Dec 20, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Cooperative Multi-Agent Reinforcement Learning with Hypergraph Convolution | Dec 9, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks | Dec 6, 2021 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Dec 1, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents | Dec 1, 2021 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 |
| Off-Policy Correction For Multi-Agent Reinforcement Learning | Nov 22, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration | Nov 22, 2021 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Coordinated Proximal Policy Optimization | Nov 7, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| On games and simulators as a platform for development of artificial intelligence for command and control | Oct 21, 2021 | Real-Time Strategy GamesStarcraft | —Unverified | 0 |
| State-based Episodic Memory for Multi-Agent Reinforcement Learning | Oct 19, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |