| On Reinforcement Learning for Full-length Game of StarCraft | Sep 23, 2018 | CPUHierarchical Reinforcement Learning | —Unverified | 0 |
| On Stateful Value Factorization in Multi-Agent Reinforcement Learning | Aug 27, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Optimal Any-Angle Pathfinding on a Sphere | Apr 24, 2020 | Starcraft | —Unverified | 0 |
| Optimize Neural Fictitious Self-Play in Regret Minimization Thinking | Apr 22, 2021 | Starcraft | —Unverified | 0 |
| Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL | Jun 1, 2022 | DiversityMulti-agent Reinforcement Learning | —Unverified | 0 |
| POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning | May 13, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning | Sep 13, 2023 | Multi-agent Reinforcement LearningPrivacy Preserving | —Unverified | 0 |
| QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning | Nov 1, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning | Jun 22, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning | Sep 9, 2020 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 |
| Reflection of Episodes: Learning to Play Game from Expert and Self Experiences | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reinforcement Learning-based Application Autoscaling in the Cloud: A Survey | Jan 27, 2020 | Cloud ComputingDecision Making | —Unverified | 0 |
| Grounding Natural Language Commands to StarCraft II Game States for Narration-Guided Reinforcement Learning | Apr 24, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Reinforcement Learning of Implicit and Explicit Control Flow in Instructions | Feb 25, 2021 | Minecraftreinforcement-learning | —Unverified | 0 |
| ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning | Feb 11, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning | Jun 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning | Dec 20, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Revisiting the Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning | Sep 29, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents | Dec 1, 2021 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 |
| RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents | Feb 16, 2021 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 |
| RMIX: Risk-Sensitive Multi-Agent Reinforcement Learning | Jan 1, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Role Diversity Matters: A Study of Cooperative Training Strategies for Multi-Agent RL | Sep 29, 2021 | DiversityMulti-agent Reinforcement Learning | —Unverified | 0 |
| S2RL: Do We Really Need to Perceive All States in Deep Multi-Agent Reinforcement Learning? | Jun 20, 2022 | AllMulti-agent Reinforcement Learning | —Unverified | 0 |
| SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II | Dec 24, 2020 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction | Sep 2, 2022 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients | Apr 27, 2021 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Shapley Counterfactual Credits for Multi-Agent Reinforcement Learning | Jun 1, 2021 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning | May 13, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| SMAUG: A Sliding Multidimensional Task Window-Based MARL Framework for Adaptive Real-Time Subtask Recognition | Mar 4, 2024 | Hierarchical Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Regularized Softmax Deep Multi-Agent Q-Learning | Mar 22, 2021 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 |
| StarCraft II Build Order Optimization using Deep Reinforcement Learning and Monte-Carlo Tree Search | Jun 12, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments | Jan 9, 2024 | ImputationReinforcement Learning (RL) | —Unverified | 0 |
| State-based Episodic Memory for Multi-Agent Reinforcement Learning | Oct 19, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Superhuman Game AI Disclosure: Expertise and Context Moderate Effects on Trust and Fairness | Jan 31, 2025 | EthicsFairness | —Unverified | 0 |
| SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning | Mar 16, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games | Dec 8, 2022 | Continual LearningLifelong learning | —Unverified | 0 |
| The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems | May 27, 2020 | Deep Reinforcement LearningStarcraft | —Unverified | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization | May 31, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning | Sep 28, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Transferable Curricula through Difficulty Conditioned Generators | Jun 22, 2023 | Reinforcement Learning (RL)Starcraft | —Unverified | 0 |
| Tree Search for Simultaneous Move Games via Equilibrium Approximation | Jun 14, 2024 | Starcraft | —Unverified | 0 |
| Truthful Self-Play | Jun 6, 2021 | Multi-agent Reinforcement LearningStarcraft | —Unverified | 0 |
| UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning | Oct 6, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Unsupervised Hebbian Learning on Point Sets in StarCraft II | Jul 13, 2022 | DecoderSelf-Supervised Learning | —Unverified | 0 |
| Value Propagation Networks | May 28, 2018 | Navigatereinforcement-learning | —Unverified | 0 |
| Variational Offline Multi-agent Skill Discovery | May 26, 2024 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| "Weak AI" is Likely to Never Become "Strong AI", So What is its Greatest Value for us? | Mar 29, 2021 | image-classificationImage Classification | —Unverified | 0 |
| EXPODE: EXploiting POlicy Discrepancy for Efficient Exploration in Multi-agent Reinforcement Learning | May 30, 2023 | Efficient ExplorationMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Clear the Fog: Combat Value Assessment in Incomplete Information Games with Convolutional Encoder-Decoders | Nov 30, 2018 | Decision MakingDecoder | CodeCode Available | 0 |