| NQMIX: Non-monotonic Value Function Factorization for Deep Multi-Agent Reinforcement Learning | Apr 5, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Off-Beat Multi-Agent Reinforcement Learning | May 27, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization | Jun 15, 2023 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Offline-to-Online Multi-Agent Reinforcement Learning with Offline Value Function Memory and Sequential Exploration | Oct 25, 2024 | Efficient ExplorationMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| On games and simulators as a platform for development of artificial intelligence for command and control | Oct 21, 2021 | Real-Time Strategy GamesStarcraft | —Unverified | 0 | 0 |
| On Reinforcement Learning for Full-length Game of StarCraft | Sep 23, 2018 | CPUHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| On Stateful Value Factorization in Multi-Agent Reinforcement Learning | Aug 27, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Optimal Any-Angle Pathfinding on a Sphere | Apr 24, 2020 | Starcraft | —Unverified | 0 | 0 |
| Optimize Neural Fictitious Self-Play in Regret Minimization Thinking | Apr 22, 2021 | Starcraft | —Unverified | 0 | 0 |
| Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL | Jun 1, 2022 | DiversityMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning | May 13, 2024 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Privacy-Engineered Value Decomposition Networks for Cooperative Multi-Agent Reinforcement Learning | Sep 13, 2023 | Multi-agent Reinforcement LearningPrivacy Preserving | —Unverified | 0 | 0 |
| QFree: A Universal Value Function Factorization for Multi-Agent Reinforcement Learning | Nov 1, 2023 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| QTRAN++: Improved Value Transformation for Cooperative Multi-Agent Reinforcement Learning | Jun 22, 2020 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| QR-MIX: Distributional Value Function Factorisation for Cooperative Multi-Agent Reinforcement Learning | Sep 9, 2020 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 | 0 |
| Reflection of Episodes: Learning to Play Game from Expert and Self Experiences | Feb 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Reinforcement Learning-based Application Autoscaling in the Cloud: A Survey | Jan 27, 2020 | Cloud ComputingDecision Making | —Unverified | 0 | 0 |
| Reinforcement Learning for the Beginning of Starcraft II Game | Dec 14, 2020 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| Reinforcement Learning of Implicit and Explicit Control Flow in Instructions | Feb 25, 2021 | Minecraftreinforcement-learning | —Unverified | 0 | 0 |
| ReMIX: Regret Minimization for Monotonic Value Function Factorization in Multiagent Reinforcement Learning | Feb 11, 2023 | Decision Makingreinforcement-learning | —Unverified | 0 | 0 |
| Revisiting Some Common Practices in Cooperative Multi-Agent Reinforcement Learning | Jun 15, 2022 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning | Dec 20, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Revisiting the Monotonicity Constraint in Cooperative Multi-Agent Reinforcement Learning | Sep 29, 2021 | Multi-agent Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents | Dec 1, 2021 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 | 0 |
| RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents | Feb 16, 2021 | Multi-agent Reinforcement Learningquantile regression | —Unverified | 0 | 0 |