| Visual Spatial Attention and Proprioceptive Data-Driven Reinforcement Learning for Robust Peg-in-Hole Task Under Variable Conditions | Dec 27, 2023 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Visual Tracking by means of Deep Reinforcement Learning and an Expert Demonstrator | Sep 18, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter | Aug 13, 2020 | Deep Reinforcement LearningObject | —Unverified | 0 | 0 |
| Vlearn: Off-Policy Learning with Efficient State-Value Function Estimation | Mar 7, 2024 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 | 0 |
| VoI-Driven Joint Optimization of Control and Communication in Vehicular Digital Twin Network | May 12, 2025 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| VR-Goggles for Robots: Real-to-sim Domain Adaptation for Visual Control | Feb 1, 2018 | Deep Reinforcement LearningDomain Adaptation | —Unverified | 0 | 0 |
| Vulcan: Solving the Steiner Tree Problem with Graph Neural Networks and Deep Reinforcement Learning | Nov 21, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Language and Culture Internalisation for Human-Like Autotelic AI | Jun 2, 2022 | AttributeCultural Vocal Bursts Intensity Prediction | —Unverified | 0 | 0 |
| WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving | Aug 27, 2021 | Atari GamesAutonomous Driving | —Unverified | 0 | 0 |
| Warm-Start AlphaZero Self-Play Search Enhancements | Apr 26, 2020 | Board GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Warmth and competence in human-agent cooperation | Jan 31, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Wasserstein Adaptive Value Estimation for Actor-Critic Reinforcement Learning | Jan 17, 2025 | Computational EfficiencyDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Wastewater Treatment Plant Data for Nutrient Removal System | Jul 7, 2024 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-based Traffic Congestion Control Systems | Mar 17, 2020 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| WaveCorr: Deep Reinforcement Learning with Permutation Invariant Policy Networks for Portfolio Management | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog | Jun 30, 2019 | Deep Reinforcement LearningOpen-Domain Dialog | —Unverified | 0 | 0 |
| Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog | Jan 1, 2020 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| Weakly Coupled Deep Q-Networks | Oct 28, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates | Jan 1, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments | Feb 23, 2018 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Weighted Quantum Channel Compiling through Proximal Policy Optimization | Nov 3, 2021 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| What deep reinforcement learning tells us about human motor learning and vice-versa | Aug 23, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| What Robot do I Need? Fast Co-Adaptation of Morphology and Control using Graph Neural Networks | Nov 3, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| What Should I Do Now? Marrying Reinforcement Learning and Symbolic Planning | Jan 6, 2019 | Deep Reinforcement LearningQuestion Answering | —Unverified | 0 | 0 |
| When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multi-Timescale Resource Management for Multi-access Edge Computing in 5G Ultra Dense Network | Sep 22, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| When Do Drivers Concentrate? Attention-based Driver Behavior Modeling With Deep Reinforcement Learning | Feb 26, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| When Multiple Agents Learn to Schedule: A Distributed Radio Resource Management Framework | Jun 20, 2019 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning | Sep 8, 2021 | Adversarial Attackcontinuous-control | —Unverified | 0 | 0 |
| Where Off-Policy Deep Reinforcement Learning Fails | Sep 27, 2018 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Where to go: Agent Guidance with Deep Reinforcement Learning in A City-Scale Online Ride-Hailing Service | Dec 12, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians | Feb 25, 2021 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Which Channel to Ask My Question? Personalized Customer Service RequestStream Routing using DeepReinforcement Learning | Nov 24, 2019 | ChatbotDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Why Do Animals Need Shaping? A Theory of Task Composition and Curriculum Learning | Feb 28, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 | 0 |
| Why Target Networks Stabilise Temporal Difference Methods | Feb 24, 2023 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Why the Agent Made that Decision: Explaining Deep Reinforcement Learning with Vision Masks | Nov 25, 2024 | Atari Gamescounterfactual | —Unverified | 0 | 0 |
| Wind Power Forecasting Considering Data Privacy Protection: A Federated Deep Reinforcement Learning Approach | Nov 2, 2022 | Deep Reinforcement LearningFederated Learning | —Unverified | 0 | 0 |
| Winning at Any Cost -- Infringing the Cartel Prohibition With Reinforcement Learning | Jul 5, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Winning Isn't Everything: Enhancing Game Development with Intelligent Agents | Mar 25, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Wireless Resource Allocation with Collaborative Distributed and Centralized DRL under Control Channel Attacks | Nov 16, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| WiseMove: A Framework for Safe Deep Reinforcement Learning for Autonomous Driving | Feb 11, 2019 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| World Models for General Surgical Grasping | May 28, 2024 | Deep Reinforcement LearningPose Estimation | —Unverified | 0 | 0 |
| Worst Cases Policy Gradients | Nov 9, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| XDQN: Inherently Interpretable DQN through Mimicking | Jan 8, 2023 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| XLVIN: eXecuted Latent Value Iteration Nets | Oct 25, 2020 | Deep Reinforcement LearningGraph Representation Learning | —Unverified | 0 | 0 |
| Zero-shot Deep Reinforcement Learning Driving Policy Transfer for Autonomous Vehicles based on Robust Control | Dec 7, 2018 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 | 0 |
| Zero-Shot Policy Transfer with Disentangled Attention | Sep 25, 2019 | Deep Reinforcement LearningDomain Adaptation | —Unverified | 0 | 0 |
| Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty | May 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Sim-to-Real Transfer of Robot Learning with Variable Length Inputs | Sep 20, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots | Dec 10, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |