| MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning | Feb 12, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| COA-GPT: Generative Pre-trained Transformers for Accelerated Course of Action Development in Military Operations | Feb 1, 2024 | In-Context LearningStarcraft | —Unverified | 0 |
| SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models | Jan 31, 2024 | StarcraftStarcraft II | CodeCode Available | 1 |
| BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions | Jan 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling | Jan 10, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments | Jan 9, 2024 | ImputationReinforcement Learning (RL) | —Unverified | 0 |
| Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning | Dec 10, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CODEX: A Cluster-Based Method for Explainable Reinforcement Learning | Dec 7, 2023 | Clusteringcounterfactual | CodeCode Available | 0 |
| Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play | Nov 28, 2023 | Atari GamesDiversity | —Unverified | 0 |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Nov 16, 2023 | CPUGPU | CodeCode Available | 2 |
| QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning | Nov 1, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization | Oct 15, 2023 | Multi-agent Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning | Sep 13, 2023 | Multi-agent Reinforcement LearningPrivacy Preserving | —Unverified | 0 |
| Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning | Sep 12, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Leveraging World Model Disentanglement in Value-Based Multi-Agent Reinforcement Learning | Sep 8, 2023 | DisentanglementManagement | —Unverified | 0 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Never Explore Repeatedly in Multi-Agent Reinforcement Learning | Aug 19, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization | Jul 21, 2023 | ManagementMuJoCo | CodeCode Available | 1 |
| Transferable Curricula through Difficulty Conditioned Generators | Jun 22, 2023 | Reinforcement Learning (RL)Starcraft | —Unverified | 0 |
| Anticipatory Thinking Challenges in Open Worlds: Risk Management | Jun 22, 2023 | Adversarial RobustnessAutonomous Vehicles | —Unverified | 0 |
| Maximum Entropy Heterogeneous-Agent Reinforcement Learning | Jun 19, 2023 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Semantic HELM: A Human-Readable Memory for Reinforcement Learning | Jun 15, 2023 | Dota 2Language Modelling | CodeCode Available | 1 |
| Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization | Jun 15, 2023 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning | Jun 4, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| EXPODE: EXploiting POlicy Discrepancy for Efficient Exploration in Multi-agent Reinforcement Learning | May 30, 2023 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL? | May 27, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Boosting Value Decomposition via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning | May 12, 2023 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles | Apr 3, 2023 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning | Mar 16, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning | Mar 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| MAC-PO: Multi-Agent Experience Replay via Collective Priority Optimization | Feb 21, 2023 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| AIIR-MIX: Multi-Agent Reinforcement Learning Meets Attention Individual Intrinsic Reward Mixing Network | Feb 19, 2023 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning | Feb 11, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Equivariant MuZero | Feb 9, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| PushWorld: A benchmark for manipulation planning with tools and movable obstacles | Jan 24, 2023 | OpenAI GymStarcraft | CodeCode Available | 1 |
| TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems | Jan 13, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Self-Motivated Multi-Agent Exploration | Jan 5, 2023 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 0 |
| Strangeness-driven Exploration in Multi-Agent Reinforcement Learning | Dec 27, 2022 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Hierarchical Strategies for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Graph AttentionMulti-agent Reinforcement Learning | —Unverified | 0 |
| System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games | Dec 8, 2022 | Continual LearningLifelong learning | —Unverified | 0 |
| CURO: Curriculum Learning for Relative Overgeneralization | Dec 6, 2022 | Efficient ExplorationMulti-agent Reinforcement Learning | —Unverified | 0 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Decision-making with Speculative Opponent Models | Nov 22, 2022 | Decision MakingSMAC | CodeCode Available | 0 |
| Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition | Nov 21, 2022 | Starcraft | CodeCode Available | 0 |