| Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration | Oct 3, 2024 | DiversityLanguage Modeling | CodeCode Available | 4 |
| Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Mar 19, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models | Oct 21, 2024 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| LLM-PySC2: Starcraft II learning environment for Large Language Models | Nov 8, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge? | Nov 18, 2020 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Efficient Multi-agent Reinforcement Learning by Planning | May 20, 2024 | Computational EfficiencyModel-based Reinforcement Learning | CodeCode Available | 1 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes | May 10, 2025 | BenchmarkingGPU | CodeCode Available | 1 |
| SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC | Dec 23, 2024 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning | May 28, 2024 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 1 |
| Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers | May 10, 2023 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| AVA: Attentive VLM Agent for Mastering StarCraft II | Mar 7, 2025 | Retrieval-augmented GenerationSMAC | CodeCode Available | 1 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks | Feb 7, 2025 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| The StarCraft Multi-Agent Challenge | Feb 11, 2019 | BenchmarkingMuJoCo | CodeCode Available | 1 |
| UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers | Jan 20, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition | Nov 23, 2022 | Contrastive LearningDiversity | CodeCode Available | 1 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Scalable Multi-Agent Model-Based Reinforcement Learning | May 25, 2022 | Mambamodel | CodeCode Available | 1 |
| Latent State Marginalization as a Low-cost Approach for Improving Exploration | Oct 3, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning | Jun 9, 2025 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| HomOpt: A Homotopy-Based Hyperparameter Optimization Method | Aug 7, 2023 | Bayesian OptimizationHyperparameter Optimization | CodeCode Available | 1 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks | Dec 6, 2021 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization | Jun 20, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft | Aug 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Automated classification of pre-defined movement patterns: A comparison between GNSS and UWB technology | Mar 10, 2023 | SMACSMAC+ | —Unverified | 0 |
| Innate-Values-driven Reinforcement Learning based Cooperative Multi-Agent Cognitive Modeling | Jan 10, 2024 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| CuDA2: An approach for Incorporating Traitor Agents into Cooperative Multi-Agent Systems | Jun 25, 2024 | Adversarial AttackMulti-agent Reinforcement Learning | —Unverified | 0 |
| Aligning Individual and Collective Objectives in Multi-Agent Cooperation | Feb 19, 2024 | SMACSMAC+ | —Unverified | 0 |
| Coordinated Multi-Agent Exploration Using Shared Goals | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Cooperative Exploration for Multi-Agent Deep Reinforcement Learning | Jul 23, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Comparative study of Hyper-Parameter Optimization Tools | Jan 17, 2022 | Bayesian OptimizationBenchmarking | —Unverified | 0 |
| Improving Global Parameter-sharing in Physically Heterogeneous Multi-agent Reinforcement Learning with Unified Action Space | Aug 14, 2024 | Multi-agent Reinforcement LearningSMAC | —Unverified | 0 |
| A Spatiotemporal Stealthy Backdoor Attack against Cooperative Multi-Agent Deep Reinforcement Learning | Sep 12, 2024 | Backdoor AttackDeep Reinforcement Learning | —Unverified | 0 |
| Fast Optimization of Wildfire Suppression Policies with SMAC | Mar 28, 2017 | ManagementSMAC | —Unverified | 0 |
| Exploiting Semantic Epsilon Greedy Exploration Strategy in Multi-Agent Reinforcement Learning | Jan 26, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 |
| How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning | Dec 13, 2023 | Multi-agent Reinforcement LearningSMAC | —Unverified | 0 |
| Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods | Jun 3, 2025 | Ensemble LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation | Feb 13, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Heterogeneous Multi-Agent Reinforcement Learning for Zero-Shot Scalable Collaboration | Apr 5, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Characterization of neighborhood behaviours in a multi-neighborhood local search algorithm | Mar 12, 2016 | SMACSMAC+ | —Unverified | 0 |