| That Escalated Quickly: Compounding Complexity by Editing Levels at the Frontier of Agent Capabilities | Sep 29, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| The 2017 AIBIRDS Competition | Mar 14, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| The 37 Implementation Details of Proximal Policy Optimization | Jan 17, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| The Adversarial Resilience Learning Architecture for AI-based Modelling, Exploration, and Operation of Complex Cyber-Physical Systems | May 27, 2020 | Deep Reinforcement LearningStarcraft | —Unverified | 0 | 0 |
| The Architectural Implications of Distributed Reinforcement Learning on CPU-GPU Systems | Dec 8, 2020 | CPUDeep Reinforcement Learning | —Unverified | 0 | 0 |
| The Bottleneck Simulator: A Model-based Deep Reinforcement Learning Approach | Jul 12, 2018 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| The Case for Automatic Database Administration using Deep Reinforcement Learning | Jan 17, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| The Communication and Computation Trade-off in Wireless Semantic Communications | Apr 14, 2025 | Deep Reinforcement LearningSemantic Communication | —Unverified | 0 | 0 |
| The Cost of Learning: Efficiency vs. Efficacy of Learning-Based RRM for 6G | Nov 30, 2022 | Deep Reinforcement LearningFriction | —Unverified | 0 | 0 |
| The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning | Jun 16, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |