| Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians | Feb 25, 2021 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Which Channel to Ask My Question? Personalized Customer Service RequestStream Routing using DeepReinforcement Learning | Nov 24, 2019 | ChatbotDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning | Feb 28, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 | 0 |
| Why Target Networks Stabilise Temporal Difference Methods | Feb 24, 2023 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks | Nov 25, 2024 | Atari Gamescounterfactual | —Unverified | 0 | 0 |
| Wind Power Forecasting Considering Data Privacy Protection: A Federated Deep Reinforcement Learning Approach | Nov 2, 2022 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 | 0 |
| Winning at Any Cost -- Infringing the Cartel Prohibition With Reinforcement Learning | Jul 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Winning Isn't Everything: Enhancing Game Development with Intelligent Agents | Mar 25, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks | Nov 16, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |