Visual Tracking by means of Deep Reinforcement Learning and an Expert Demonstrator Sep 18, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter Aug 13, 2020 Deep Reinforcement Learning Object
— Unverified 0ViVa: Video-Trained Value Functions for Guiding Online RL from Diverse Data Mar 23, 2025 Reinforcement Learning (RL)
— Unverified 0Vizarel: A System to Help Better Understand RL Agents Jul 10, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0VLMLight: Traffic Signal Control via Vision-Language Meta-Control and Dual-Branch Reasoning May 26, 2025 Large Language Model Reinforcement Learning (RL)
— Unverified 0VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making May 6, 2025 Decision Making General Knowledge
— Unverified 0VLM-RL: A Unified Vision Language Models and Reinforcement Learning Framework for Safe Autonomous Driving Dec 20, 2024 Autonomous Driving Computational Efficiency
— Unverified 0VLP: Vision-Language Preference Learning for Embodied Manipulation Feb 17, 2025 Reinforcement Learning (RL)
— Unverified 0VL-SAFE: Vision-Language Guided Safety-Aware Reinforcement Learning with World Models for Autonomous Driving May 22, 2025 Autonomous Driving Reinforcement Learning (RL)
— Unverified 0VMAV-C: A Deep Attention-based Reinforcement Learning Algorithm for Model-based Control Dec 24, 2018 Deep Attention Model-based Reinforcement Learning
— Unverified 0vMFER: Von Mises-Fisher Experience Resampling Based on Uncertainty of Gradient Directions for Policy Improvement May 14, 2024 Decision Making Reinforcement Learning (RL)
— Unverified 0VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play Feb 4, 2025 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0VOQL: Towards Optimal Regret in Model-free RL with Nonlinear Function Approximation Dec 12, 2022 Q-Learning regression
— Unverified 0Voting-Based Multi-Agent Reinforcement Learning for Intelligent IoT Jul 2, 2019 Decision Making Multi-agent Reinforcement Learning
— Unverified 0VPE: Variational Policy Embedding for Transfer Reinforcement Learning Sep 10, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0VRAIL: Vectorized Reward-based Attribution for Interpretable Learning Jun 19, 2025 Reinforcement Learning (RL)
— Unverified 0VRLS: A Unified Reinforcement Learning Scheduler for Vehicle-to-Vehicle Communications Jul 22, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning Feb 11, 2025 Decision Making reinforcement-learning
— Unverified 0Vulcan: Solving the Steiner Tree Problem with Graph Neural Networks and Deep Reinforcement Learning Nov 21, 2021 Combinatorial Optimization Deep Reinforcement Learning
— Unverified 0Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics Sep 2, 2020 Reinforcement Learning (RL)
— Unverified 0WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving Aug 27, 2021 Atari Games Autonomous Driving
— Unverified 0Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning Nov 6, 2022 Decision Making Offline RL
— Unverified 0Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap Jun 20, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Warmth and competence in human-agent cooperation Jan 31, 2022 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes Jul 3, 2024 Reinforcement Learning (RL)
— Unverified 0Warren at SemEval-2020 Task 4: ALBERT and Multi-Task Learning for Commonsense Validation Dec 1, 2020 Multi-Task Learning reinforcement-learning
— Unverified 0Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control Mar 4, 2023 MuJoCo Q-Learning
— Unverified 0Wasserstein Adversarial Imitation Learning Jun 19, 2019 Imitation Learning reinforcement-learning
— Unverified 0Wasserstein Dependency Measure for Representation Learning Mar 28, 2019 Object Recognition reinforcement-learning
— Unverified 0Wasserstein Robust Reinforcement Learning Jul 30, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Wasserstein Unsupervised Reinforcement Learning Oct 15, 2021 Hierarchical Reinforcement Learning MuJoCo
— Unverified 0Watch from sky: machine-learning-based multi-UAV network for predictive police surveillance Mar 6, 2022 BIG-bench Machine Learning reinforcement-learning
— Unverified 0Stop-and-Go: Exploring Backdoor Attacks on Deep Reinforcement Learning-based Traffic Congestion Control Systems Mar 17, 2020 Autonomous Vehicles Deep Reinforcement Learning
— Unverified 0WaveCorr: Deep Reinforcement Learning with Permutation Invariant Policy Networks for Portfolio Management Sep 29, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog Jun 30, 2019 Deep Reinforcement Learning Open-Domain Dialog
— Unverified 0Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog Jan 1, 2020 Deep Reinforcement Learning OpenAI Gym
— Unverified 0On L_2-consistency of nearest neighbor matching Feb 6, 2019 Causal Inference Domain Adaptation
— Unverified 0Weakly Supervised Disentangled Representation for Goal-conditioned Reinforcement Learning Feb 28, 2022 Position reinforcement-learning
— Unverified 0Weakly-Supervised Learning of Disentangled and Interpretable Skills for Hierarchical Reinforcement Learning Sep 29, 2021 Decoder Hierarchical Reinforcement Learning
— Unverified 0Weakly-Supervised Reinforcement Learning for Controllable Behavior Apr 6, 2020 continuous-control Continuous Control
— Unverified 0Weakly Supervised Video Summarization by Hierarchical Reinforcement Learning Jan 12, 2020 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Weakness Analysis of Cyberspace Configuration Based on Reinforcement Learning Jul 9, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Weber-Fechner Law in Temporal Difference learning derived from Control as Inference Dec 30, 2024 Reinforcement Learning (RL)
— Unverified 0WebWISE: Web Interface Control and Sequential Exploration with Large Language Models Oct 24, 2023 Imitation Learning In-Context Learning
— Unverified 0Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates Jan 1, 2021 Deep Reinforcement Learning Q-Learning
— Unverified 0Weighted Double Deep Multiagent Reinforcement Learning in Stochastic Cooperative Environments Feb 23, 2018 Deep Reinforcement Learning Q-Learning
— Unverified 0Weighted Entropy Modification for Soft Actor-Critic Nov 18, 2020 MuJoCo reinforcement-learning
— Unverified 0Weighted Likelihood Policy Search with Model Selection Dec 1, 2012 model Model Selection
— Unverified 0Weighted Maximum Entropy Inverse Reinforcement Learning Aug 20, 2022 Imitation Learning reinforcement-learning
— Unverified 0Weighted model estimation for offline model-based reinforcement learning Dec 1, 2021 Density Ratio Estimation model
— Unverified 0