| Semantic HELM: A Human-Readable Memory for Reinforcement Learning | Jun 15, 2023 | Dota 2Language Modelling | CodeCode Available | 1 |
| Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL? | May 27, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles | Apr 3, 2023 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| PushWorld: A benchmark for manipulation planning with tools and movable obstacles | Jan 24, 2023 | OpenAI GymStarcraft | CodeCode Available | 1 |
| TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems | Jan 13, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft | Aug 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| SC2EGSet: StarCraft II Esport Replay and Game-state Dataset | Jul 7, 2022 | StarcraftStarcraft II | CodeCode Available | 1 |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Jun 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| QGNN: Value Function Factorisation with Graph Neural Networks | May 25, 2022 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning | Mar 16, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning | Jan 12, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks | Dec 6, 2021 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Dec 1, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration | Nov 22, 2021 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Coordinated Proximal Policy Optimization | Nov 7, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 |
| No-Press Diplomacy from Scratch | Oct 6, 2021 | Starcraft | CodeCode Available | 1 |
| Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II | Sep 26, 2021 | GPUStarcraft | CodeCode Available | 1 |
| Settling the Variance of Multi-Agent Policy Gradients | Aug 19, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| Rethinking of AlphaStar | Aug 7, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| Perceiver IO: A General Architecture for Structured Inputs & Outputs | Jul 30, 2021 | Optical Flow EstimationStarcraft | CodeCode Available | 1 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Context-Aware Sparse Deep Coordination Graphs | Jun 5, 2021 | graph constructionGraph Learning | CodeCode Available | 1 |