| MAIDCRL: Semi-centralized Multi-Agent Influence Dense-CNN Reinforcement Learning | Feb 12, 2024 | Decision Makingreinforcement-learning | —Unverified | 0 |
| COA-GPT: Generative Pre-trained Transformers for Accelerated Course of Action Development in Military Operations | Feb 1, 2024 | In-Context LearningStarcraft | —Unverified | 0 |
| SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models | Jan 31, 2024 | StarcraftStarcraft II | CodeCode Available | 1 |
| BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions | Jan 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling | Jan 10, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments | Jan 9, 2024 | ImputationReinforcement Learning (RL) | —Unverified | 0 |
| Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning | Dec 10, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CODEX: A Cluster-Based Method for Explainable Reinforcement Learning | Dec 7, 2023 | Clusteringcounterfactual | CodeCode Available | 0 |
| Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play | Nov 28, 2023 | Atari GamesDiversity | —Unverified | 0 |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Nov 16, 2023 | CPUGPU | CodeCode Available | 2 |
| QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning | Nov 1, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization | Oct 15, 2023 | Multi-agent Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning | Sep 13, 2023 | Multi-agent Reinforcement LearningPrivacy Preserving | —Unverified | 0 |
| Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning | Sep 12, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Leveraging World Model Disentanglement in Value-Based Multi-Agent Reinforcement Learning | Sep 8, 2023 | DisentanglementManagement | —Unverified | 0 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Never Explore Repeatedly in Multi-Agent Reinforcement Learning | Aug 19, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization | Jul 21, 2023 | ManagementMuJoCo | CodeCode Available | 1 |
| Anticipatory Thinking Challenges in Open Worlds: Risk Management | Jun 22, 2023 | Adversarial RobustnessAutonomous Vehicles | —Unverified | 0 |
| Transferable Curricula through Difficulty Conditioned Generators | Jun 22, 2023 | Reinforcement Learning (RL)Starcraft | —Unverified | 0 |
| Maximum Entropy Heterogeneous-Agent Reinforcement Learning | Jun 19, 2023 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Semantic HELM: A Human-Readable Memory for Reinforcement Learning | Jun 15, 2023 | Dota 2Language Modelling | CodeCode Available | 1 |
| Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization | Jun 15, 2023 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |