| Hierarchical Expert Prompt for Large-Language-Model: An Approach Defeat Elite AI in TextStarCraft II for the First Time | Feb 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 2 |
| LLM-PySC2: Starcraft II learning environment for Large Language Models | Nov 8, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 2 |
| Efficient Episodic Memory Utilization of Cooperative Multi-Agent Reinforcement Learning | Mar 2, 2024 | DecoderMulti-agent Reinforcement Learning | CodeCode Available | 2 |
| Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach | Dec 19, 2023 | Language ModellingLarge Language Model | CodeCode Available | 2 |
| JaxMARL: Multi-Agent RL Environments and Algorithms in JAX | Nov 16, 2023 | CPUGPU | CodeCode Available | 2 |
| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| On Efficient Reinforcement Learning for Full-length Game of StarCraft II | Sep 23, 2022 | CPUreinforcement-learning | CodeCode Available | 2 |
| AVA: Attentive VLM Agent for Mastering StarCraft II | Mar 7, 2025 | Retrieval-augmented GenerationSMAC | CodeCode Available | 1 |
| Trajectory-Class-Aware Multi-Agent Reinforcement Learning | Mar 3, 2025 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Assigning Credit with Partial Reward Decoupling in Multi-Agent Proximal Policy Optimization | Aug 8, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Group-Aware Coordination Graph for Multi-Agent Reinforcement Learning | Apr 17, 2024 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| N-Agent Ad Hoc Teamwork | Apr 16, 2024 | Autonomous DrivingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| SwarmBrain: Embodied agent for real-time strategy game StarCraft II via large language models | Jan 31, 2024 | StarcraftStarcraft II | CodeCode Available | 1 |
| FoX: Formation-aware exploration in multi-agent reinforcement learning | Aug 22, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization | Jul 21, 2023 | ManagementMuJoCo | CodeCode Available | 1 |
| Semantic HELM: A Human-Readable Memory for Reinforcement Learning | Jun 15, 2023 | Dota 2Language Modelling | CodeCode Available | 1 |
| Is Centralized Training with Decentralized Execution Framework Centralized Enough for MARL? | May 27, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning | May 9, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Effective and Stable Role-Based Multi-Agent Collaboration by Structural Information Principles | Apr 3, 2023 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority Influence | Feb 7, 2023 | Continuous ControlMuJoCo | CodeCode Available | 1 |
| TransfQMix: Transformers for Leveraging the Graph Structure of Multi-Agent Reinforcement Learning Problems | Jan 13, 2023 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Transformer-based Value Function Decomposition for Cooperative Multi-agent Reinforcement Learning in StarCraft | Aug 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| SC2EGSet: StarCraft II Esport Replay and Game-state Dataset | Jul 7, 2022 | StarcraftStarcraft II | CodeCode Available | 1 |
| MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer | Jun 20, 2022 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning | Mar 16, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks | Dec 6, 2021 | AllMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Dec 1, 2021 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Episodic Multi-agent Reinforcement Learning with Curiosity-Driven Exploration | Nov 22, 2021 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Coordinated Proximal Policy Optimization | Nov 7, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| TiKick: Towards Playing Multi-agent Football Full Games from Single-agent Demonstrations | Oct 9, 2021 | Deep Reinforcement LearningStarcraft | CodeCode Available | 1 |
| Applying supervised and reinforcement learning methods to create neural-network-based agents for playing StarCraft II | Sep 26, 2021 | GPUStarcraft | CodeCode Available | 1 |
| Rethinking of AlphaStar | Aug 7, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| Perceiver IO: A General Architecture for Structured Inputs & Outputs | Jul 30, 2021 | Optical Flow EstimationStarcraft | CodeCode Available | 1 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Context-Aware Sparse Deep Coordination Graphs | Jun 5, 2021 | graph constructionGraph Learning | CodeCode Available | 1 |
| Celebrating Diversity in Shared Multi-Agent Reinforcement Learning | Jun 4, 2021 | DiversityMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Cooperative Multi-Agent Reinforcement Learning with Sequential Credit Assignment | May 21, 2021 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Gym-μRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning | May 21, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| An Introduction of mini-AlphaStar | Apr 14, 2021 | StarcraftStarcraft II | CodeCode Available | 1 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| C-COMA: A CONTINUAL REINFORCEMENT LEARNING MODEL FOR DYNAMIC MULTIAGENT ENVIRONMENTS | Apr 5, 2021 | Continual LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games | Mar 2, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Rethinking the Implementation Matters in Cooperative Multi-Agent Reinforcement Learning | Feb 6, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game | Nov 27, 2020 | AI AgentImitation Learning | CodeCode Available | 1 |
| TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning | Nov 25, 2020 | Dota 2Multi-agent Reinforcement Learning | CodeCode Available | 1 |
| Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning | Oct 9, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| RODE: Learning Roles to Decompose Multi-Agent Tasks | Oct 4, 2020 | ClusteringStarcraft | CodeCode Available | 1 |
| Energy-based Surprise Minimization for Multi-Agent Value Factorization | Sep 16, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| QPLEX: Duplex Dueling Multi-Agent Q-Learning | Aug 3, 2020 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Off-Policy Multi-Agent Decomposed Policy Gradients | Jul 24, 2020 | Multi-agent Reinforcement LearningStarcraft | CodeCode Available | 1 |