| Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter | Aug 13, 2020 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation | Mar 7, 2024 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| VoI-Driven Joint Optimization of Control and Communication in Vehicular Digital Twin Network | May 12, 2025 | Deep Reinforcement Learning | —Unverified | 0 |
| VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control | Feb 1, 2018 | Deep Reinforcement LearningDomain Adaptation | —Unverified | 0 |
| Vulcan: Solving the Steiner Tree Problem with Graph Neural Networks and Deep Reinforcement Learning | Nov 21, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Language and Culture Internalisation for Human-Like Autotelic AI | Jun 2, 2022 | AttributeCultural Vocal Bursts Intensity Prediction | —Unverified | 0 |
| WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving | Aug 27, 2021 | Atari GamesAutonomous Driving | —Unverified | 0 |
| Warm-Start AlphaZero Self-Play Search Enhancements | Apr 26, 2020 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Warmth and competence in human-agent cooperation | Jan 31, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Wasserstein Adaptive Value Estimation for Actor-Critic Reinforcement Learning | Jan 17, 2025 | Computational EfficiencyDeep Reinforcement Learning | —Unverified | 0 |
| Wastewater Treatment Plant Data for Nutrient Removal System | Jul 7, 2024 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-based Traffic Congestion Control Systems | Mar 17, 2020 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| WaveCorr: Deep Reinforcement Learning with Permutation Invariant Policy Networks for Portfolio Management | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog | Jun 30, 2019 | Deep Reinforcement LearningOpen-Domain Dialog | —Unverified | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog | Jan 1, 2020 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Weakly Coupled Deep Q-Networks | Oct 28, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates | Jan 1, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments | Feb 23, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Weighted Quantum Channel Compiling through Proximal Policy Optimization | Nov 3, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| What deep reinforcement learning tells us about human motor learning and vice-versa | Aug 23, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| What Robot do I Need? Fast Co-Adaptation of Morphology and Control using Graph Neural Networks | Nov 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning | Jan 6, 2019 | Deep Reinforcement LearningQuestion Answering | —Unverified | 0 |
| When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multi-Timescale Resource Management for Multi-access Edge Computing in 5G Ultra Dense Network | Sep 22, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| When Do Drivers Concentrate? Attention-based Driver Behavior Modeling With Deep Reinforcement Learning | Feb 26, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| When Multiple Agents Learn to Schedule: A Distributed Radio Resource Management Framework | Jun 20, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 |