| Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation | Jun 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AVA: Attentive VLM Agent for Mastering StarCraft II | Mar 7, 2025 | Retrieval-augmented GenerationSMAC | CodeCode Available | 1 |
| Trajectory-Class-Aware Multi-Agent Reinforcement Learning | Mar 3, 2025 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| Reflection of Episodes: Learning to Play Game from Expert and Self Experiences | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time | Feb 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| Superhuman Game AI Disclosure: Expertise and Context Moderate Effects on Trust and Fairness | Jan 31, 2025 | EthicsFairness | —Unverified | 0 |
| Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination Dynamics | Jan 21, 2025 | Distributional Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Human-like Bots for Tactical Shooters Using Compute-Efficient Sensors | Dec 30, 2024 | CPUImitation Learning | —Unverified | 0 |
| Novelty-Guided Data Reuse for Efficient and Diversified Multi-Agent Reinforcement Learning | Dec 20, 2024 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| LLM-PySC2: Starcraft II learning environment for Large Language Models | Nov 8, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| Efficiently Scanning and Resampling Spatio-Temporal Tasks with Irregular Observations | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Carefully Structured Compression: Efficiently Managing StarCraft II Data | Oct 11, 2024 | StarcraftStarcraft II | CodeCode Available | 0 |
| ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization | Oct 2, 2024 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| On Stateful Value Factorization in Multi-Agent Reinforcement Learning | Aug 27, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization | Aug 8, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning | May 13, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning | Apr 17, 2024 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| N-Agent Ad Hoc Teamwork | Apr 16, 2024 | Autonomous DrivingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Inferring Latent Temporal Sparse Coordination Graph for Multi-Agent Reinforcement Learning | Mar 28, 2024 | Graph LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Collaborative AI Teaming in Unknown Environments via Active Goal Deduction | Mar 22, 2024 | StarcraftStarcraft II | —Unverified | 0 |
| SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for Adaptive Real-Time Subtask Recognition | Mar 4, 2024 | Hierarchical Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning | Mar 2, 2024 | DecoderMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Aligning Individual and Collective Objectives in Multi-Agent Cooperation | Feb 19, 2024 | SMACSMAC+ | —Unverified | 0 |
| COA-GPT: Generative Pre-trained Transformers for Accelerated Course of Action Development in Military Operations | Feb 1, 2024 | In-Context LearningStarcraft | —Unverified | 0 |
| SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models | Jan 31, 2024 | StarcraftStarcraft II | CodeCode Available | 1 |
| BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions | Jan 14, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments | Jan 9, 2024 | ImputationReinforcement Learning (RL) | —Unverified | 0 |
| Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning | Dec 10, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| CODEX: A Cluster-Based Method for Explainable Reinforcement Learning | Dec 7, 2023 | Clusteringcounterfactual | CodeCode Available | 0 |
| Minimax Exploiter: A Data Efficient Approach for Competitive Self-Play | Nov 28, 2023 | Atari GamesDiversity | —Unverified | 0 |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Nov 16, 2023 | CPUGPU | CodeCode Available | 2 |
| Robust Multi-Agent Reinforcement Learning by Mutual Information Regularization | Oct 15, 2023 | Multi-agent Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| Fidelity-Induced Interpretable Policy Extraction for Reinforcement Learning | Sep 12, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Leveraging World Model Disentanglement in Value-Based Multi-Agent Reinforcement Learning | Sep 8, 2023 | DisentanglementManagement | —Unverified | 0 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Never Explore Repeatedly in Multi-Agent Reinforcement Learning | Aug 19, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization | Jul 21, 2023 | ManagementMuJoCo | CodeCode Available | 1 |
| Semantic HELM: A Human-Readable Memory for Reinforcement Learning | Jun 15, 2023 | Dota 2Language Modelling | CodeCode Available | 1 |
| Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization | Jun 15, 2023 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| EXPODE: EXploiting POlicy Discrepancy for Efficient Exploration in Multi-agent Reinforcement Learning | May 30, 2023 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL? | May 27, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Boosting Value Decomposition via Unit-Wise Attentive State Representation for Cooperative Multi-Agent Reinforcement Learning | May 12, 2023 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles | Apr 3, 2023 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning | Mar 16, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| AIIR-MIX: Multi-Agent Reinforcement Learning Meets Attention Individual Intrinsic Reward Mixing Network | Feb 19, 2023 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |