| Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration | Oct 3, 2024 | DiversityLanguage Modeling | CodeCode Available | 4 | 5 |
| ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency | Nov 29, 2022 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 | 5 |
| A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language Models | Oct 21, 2024 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 2 | 5 |
| LLM-PySC2: Starcraft II learning environment for Large Language Models | Nov 8, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 2 | 5 |
| SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning | Dec 14, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Mar 19, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| Latent State Marginalization as a Low-cost Approach for Improving Exploration | Oct 3, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Soft-QMIX: Integrating Maximum Entropy For Monotonic Value Function Factorization | Jun 20, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Individual Contributions as Intrinsic Exploration Scaffolds for Multi-agent Reinforcement Learning | May 28, 2024 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 1 | 5 |
| Robust multi-agent coordination via evolutionary generation of auxiliary adversarial attackers | May 10, 2023 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC | Dec 23, 2024 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Efficient Multi-agent Reinforcement Learning by Planning | May 20, 2024 | Computational EfficiencyModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft | Aug 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes | May 10, 2025 | BenchmarkingGPU | CodeCode Available | 1 | 5 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 | 5 |
| UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers | Jan 20, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks | Feb 7, 2025 | BenchmarkingMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Scalable Multi-Agent Model-Based Reinforcement Learning | May 25, 2022 | Mambamodel | CodeCode Available | 1 | 5 |
| Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks | Dec 6, 2021 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition | Nov 23, 2022 | Contrastive LearningDiversity | CodeCode Available | 1 | 5 |
| AVA: Attentive VLM Agent for Mastering StarCraft II | Mar 7, 2025 | Retrieval-augmented GenerationSMAC | CodeCode Available | 1 | 5 |
| SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning | May 31, 2021 | FairnessMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Curriculum Learning With Counterfactual Group Relative Policy Advantage For Multi-Agent Reinforcement Learning | Jun 9, 2025 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge? | Nov 18, 2020 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning | Feb 16, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 | 5 |
| Deep Implicit Coordination Graphs for Multi-agent Reinforcement Learning | Jun 19, 2020 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| HomOpt: A Homotopy-Based Hyperparameter Optimization Method | Aug 7, 2023 | Bayesian OptimizationHyperparameter Optimization | CodeCode Available | 1 | 5 |
| The StarCraft Multi-Agent Challenge | Feb 11, 2019 | BenchmarkingMuJoCo | CodeCode Available | 1 | 5 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| AutoWeka4MCPS-AVATAR: Accelerating Automated Machine Learning Pipeline Composition and Optimisation | Nov 21, 2020 | BIG-bench Machine LearningCPU | CodeCode Available | 0 | 5 |
| Decision-making with Speculative Opponent Models | Nov 22, 2022 | Decision MakingSMAC | CodeCode Available | 0 | 5 |
| Decentralized Transformers with Centralized Aggregation are Sample-Efficient Multi-Agent World Models | Jun 22, 2024 | Reinforcement Learning (RL)SMAC | CodeCode Available | 0 | 5 |
| SMIX(λ): Enhancing Centralized Value Functions for Cooperative Multi-Agent Reinforcement Learning | Nov 11, 2019 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| A Unified Framework for Factorizing Distributional Value Functions for Multi-Agent Reinforcement Learning | Jun 4, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Additive Tree-Structured Conditional Parameter Spaces in Bayesian Optimization: A Novel Covariance Function and a Fast Implementation | Oct 6, 2020 | Bayesian Optimizationglobal-optimization | CodeCode Available | 0 | 5 |
| GHQ: Grouped Hybrid Q Learning for Heterogeneous Cooperative Multi-agent Reinforcement Learning | Mar 2, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 0 | 5 |
| Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation | Jun 20, 2025 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 0 | 5 |
| QTypeMix: Enhancing Multi-Agent Cooperative Strategies through Heterogeneous and Homogeneous Value Decomposition | Aug 12, 2024 | Multi-agent Reinforcement LearningSMAC | CodeCode Available | 0 | 5 |
| PPS-QMIX: Periodically Parameter Sharing for Accelerating Convergence of Multi-Agent Reinforcement Learning | Mar 5, 2024 | Federated LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| QVMix and QVMix-Max: Extending the Deep Quality-Value Family of Algorithms to Cooperative Multi-Agent Reinforcement Learning | Dec 22, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |
| Efficient Hyperparameter Optimization of Deep Learning Algorithms Using Deterministic RBF Surrogates | Jul 28, 2016 | Bayesian OptimizationGaussian Processes | CodeCode Available | 0 | 5 |
| mlrMBO: A Modular Framework for Model-Based Optimization of Expensive Black-Box Functions | Mar 9, 2017 | Bayesian Optimizationregression | CodeCode Available | 0 | 5 |
| On the Performance of Differential Evolution for Hyperparameter Tuning | Apr 15, 2019 | Bayesian OptimizationBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Efficient Evolutionary Methods for Game Agent Optimisation: Model-Based is Best | Jan 3, 2019 | SMACSMAC+ | CodeCode Available | 0 | 5 |
| Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation | Mar 27, 2025 | MuJoCoSMAC | CodeCode Available | 0 | 5 |
| Effects of Spectral Normalization in Multi-agent Reinforcement Learning | Dec 10, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 0 | 5 |