| Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration | Oct 3, 2024 | DiversityLanguage Modeling | CodeCode Available | 4 |
| LLM-PySC2: Starcraft II learning environment for Large Language Models | Nov 8, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models | Oct 21, 2024 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Mar 19, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning | Jun 9, 2025 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes | May 10, 2025 | BenchmarkingGPU | CodeCode Available | 1 |
| AVA: Attentive VLM Agent for Mastering StarCraft II | Mar 7, 2025 | Retrieval-augmented GenerationSMAC | CodeCode Available | 1 |
| An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks | Feb 7, 2025 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC | Dec 23, 2024 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization | Jun 20, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning | May 28, 2024 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 1 |
| Efficient Multi-agent Reinforcement Learning by Planning | May 20, 2024 | Computational EfficiencyModel-based Reinforcement Learning | CodeCode Available | 1 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| HomOpt: A Homotopy-Based Hyperparameter Optimization Method | Aug 7, 2023 | Bayesian OptimizationHyperparameter Optimization | CodeCode Available | 1 |
| Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers | May 10, 2023 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition | Nov 23, 2022 | Contrastive LearningDiversity | CodeCode Available | 1 |
| Latent State Marginalization as a Low-cost Approach for Improving Exploration | Oct 3, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft | Aug 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Scalable Multi-Agent Model-Based Reinforcement Learning | May 25, 2022 | Mambamodel | CodeCode Available | 1 |
| Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks | Dec 6, 2021 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers | Jan 20, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge? | Nov 18, 2020 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| The StarCraft Multi-Agent Challenge | Feb 11, 2019 | BenchmarkingMuJoCo | CodeCode Available | 1 |
| Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation | Jun 20, 2025 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 0 |
| Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods | Jun 3, 2025 | Ensemble LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Dynamic Sight Range Selection in Multi-Agent Reinforcement Learning | May 19, 2025 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| POCAII: Parameter Optimization with Conscious Allocation using Iterative Intelligence | May 16, 2025 | Hyperparameter OptimizationSMAC | —Unverified | 0 |
| Rainbow Delay Compensation: A Multi-Agent Reinforcement Learning Framework for Mitigating Delayed Observation | May 6, 2025 | Multi-agent Reinforcement LearningSMAC | —Unverified | 0 |
| Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation | Mar 27, 2025 | MuJoCoSMAC | CodeCode Available | 0 |
| Low-Rank Agent-Specific Adaptation (LoRASA) for Multi-Agent Policy Learning | Feb 8, 2025 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Dual Ensembled Multiagent Q-Learning with Hypernet Regularizer | Feb 4, 2025 | Q-LearningSMAC | CodeCode Available | 0 |
| O-MAPL: Offline Multi-agent Preference Learning | Jan 31, 2025 | Reinforcement Learning (RL)SMAC | —Unverified | 0 |
| BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems | Jan 3, 2025 | Deep Reinforcement LearningSMAC | —Unverified | 0 |
| Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration | Oct 25, 2024 | Efficient ExplorationMulti-agent Reinforcement Learning | —Unverified | 0 |
| A Safety Modulator Actor-Critic Method in Model-Free Safe Reinforcement Learning and Application in UAV Hovering | Oct 9, 2024 | Reinforcement Learning (RL)Safe Reinforcement Learning | —Unverified | 0 |
| A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning | Sep 12, 2024 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Diffusion-based Episodes Augmentation for Offline Multi-Agent Reinforcement Learning | Aug 23, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space | Aug 14, 2024 | Multi-agent Reinforcement LearningSMAC | —Unverified | 0 |
| QTypeMix: Enhancing Multi-Agent Cooperative Strategies through Heterogeneous and Homogeneous Value Decomposition | Aug 12, 2024 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 0 |
| CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems | Jun 25, 2024 | Adversarial AttackMulti-agent Reinforcement Learning | —Unverified | 0 |