Value Function Decomposition for Iterative Design of Reinforcement Learning Agents Jun 24, 2022 Decision Making reinforcement-learning
— Unverified 0Instance-dependent _-bounds for policy evaluation in tabular reinforcement learning Sep 19, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Value function interference and greedy action selection in value-based multi-objective reinforcement learning Feb 9, 2024 Multi-Objective Reinforcement Learning Q-Learning
— Unverified 0Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning Nov 4, 2021 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Dexterous In-hand Manipulation by Guiding Exploration with Simple Sub-skill Controllers Mar 6, 2023 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF May 29, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0Value of Information and Reward Specification in Active Inference and POMDPs Aug 13, 2024 Bayesian Inference Decision Making
— Unverified 0Value Penalized Q-Learning for Recommender Systems Oct 15, 2021 Offline RL Q-Learning
— Unverified 0Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning Jan 27, 2019 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Value Propagation Networks May 28, 2018 Navigate reinforcement-learning
— Unverified 0Value Pursuit Iteration Dec 1, 2012 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Value Refinement Network (VRN) Sep 29, 2021 Q-Learning Reinforcement Learning (RL)
— Unverified 0Value Summation: A Novel Scoring Function for MPC-based Model-based Reinforcement Learning Sep 16, 2022 Model-based Reinforcement Learning MuJoCo
— Unverified 0VARD: Efficient and Dense Fine-Tuning for Diffusion Models with Value-based RL May 21, 2025 Reinforcement Learning (RL)
— Unverified 0Variable Compliance Control for Robotic Peg-in-Hole Assembly: A Deep Reinforcement Learning Approach Aug 24, 2020 Deep Reinforcement Learning Position
— Unverified 0Variable Gain Gradient Descent-based Reinforcement Learning for Robust Optimal Tracking Control of Uncertain Nonlinear System with Input-Constraints Jun 15, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Variable Impedance Control in End-Effector Space: An Action Space for Reinforcement Learning in Contact-Rich Tasks Jun 20, 2019 Contact-rich Manipulation Reinforcement Learning
— Unverified 0Variance-Aware Off-Policy Evaluation with Linear Function Approximation Jun 22, 2021 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs Mar 5, 2018 LEMMA reinforcement-learning
— Unverified 0Variance-aware robust reinforcement learning with linear function approximation under heavy-tailed rewards Mar 9, 2023 Decision Making regression
— Unverified 0Variance-Based Risk Estimations in Markov Processes via Transformation with State Lumping Jul 9, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Variance Reduced Advantage Estimation with δ Hindsight Credit Assignment Nov 19, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Variance-Reduced Conservative Policy Iteration Dec 12, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Variance-Reduced Off-Policy Memory-Efficient Policy Search Sep 14, 2020 Reinforcement Learning (RL) Stochastic Optimization
— Unverified 0Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient Jul 25, 2020 Q-Learning reinforcement-learning
— Unverified 0Variance Reduction for Evolution Strategies via Structured Control Variates May 29, 2019 Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization Jun 14, 2022 Policy Gradient Methods Reinforcement Learning (RL)
— Unverified 0Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines Mar 20, 2018 Deep Reinforcement Learning Policy Gradient Methods
— Unverified 0Variance Reduction for Reinforcement Learning in Input-Driven Environments Jul 6, 2018 Meta-Learning MuJoCo
— Unverified 0Variance Reduction Methods for Sublinear Reinforcement Learning Feb 26, 2018 Q-Learning reinforcement-learning
— Unverified 0Variational Adaptive-Newton Method for Explorative Learning Nov 15, 2017 Active Learning reinforcement-learning
— Unverified 0Variational Bayes: A report on approaches and applications May 26, 2019 Bayesian Inference Continual Learning
— Unverified 0Variational Bayesian Reinforcement Learning with Regret Bounds Jul 25, 2018 Q-Learning reinforcement-learning
— Unverified 0Variational Constrained Reinforcement Learning with Application to Planning at Roundabout Sep 25, 2019 Autonomous Driving reinforcement-learning
— Unverified 0Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning Oct 17, 2020 Deep Reinforcement Learning Efficient Exploration
— Unverified 0Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning Jun 2, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Variational Inference for Model-Free and Model-Based Reinforcement Learning Sep 4, 2022 Bayesian Inference Bayesian Optimization
— Unverified 0Variational Inference for Policy Gradient Feb 21, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Variational Inference MPC for Bayesian Model-based Reinforcement Learning Jul 8, 2019 Bayesian Inference Model-based Reinforcement Learning
— Unverified 0Variational Intrinsic Control Revisited Oct 7, 2020 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Variational Inverse Control with Events: A General Framework for Data-Driven Reward Definition May 29, 2018 continuous-control Continuous Control
— Unverified 0Variational Meta Reinforcement Learning for Social Robotics Jun 7, 2022 Meta Reinforcement Learning Navigate
— Unverified 0Variational Model-based Policy Optimization Jun 9, 2020 continuous-control Continuous Control
— Unverified 0Variational multiscale reinforcement learning for discovering reduced order closure models of nonlinear spatiotemporal transport systems Jul 7, 2022 Reinforcement Learning (RL)
— Unverified 0Variational oracle guiding for reinforcement learning Sep 29, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Variational Policy Gradient Method for Reinforcement Learning with General Utilities Jul 4, 2020 reinforcement-learning Reinforcement Learning
— Unverified 0Variational quantum compiling with double Q-learning Mar 22, 2021 Q-Learning Reinforcement Learning (RL)
— Unverified 0Parametrized quantum policies for reinforcement learning Mar 9, 2021 Benchmarking reinforcement-learning
— Unverified 0Policy Gradients using Variational Quantum Circuits Mar 20, 2022 Benchmarking Quantum Machine Learning
— Unverified 0Variational Quantum Reinforcement Learning via Evolutionary Optimization Sep 1, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0