A Look at Value-Based Decision-Time vs. Background Planning Methods Across Different Settings Jun 16, 2022 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Understanding Deep Neural Function Approximation in Reinforcement Learning via ε-Greedy Exploration Sep 15, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Understanding End-to-End Model-Based Reinforcement Learning Methods as Implicit Parameterization Dec 1, 2021 Model-based Reinforcement Learning reinforcement-learning
— Unverified 0Understanding & Generalizing AlphaGo Zero May 1, 2019 Decision Making reinforcement-learning
— Unverified 0Understanding Hindsight Goal Relabeling from a Divergence Minimization Perspective Sep 26, 2022 Imitation Learning Multi-Goal Reinforcement Learning
— Unverified 0The Importance of Online Data: Understanding Preference Fine-tuning via Coverage Jun 3, 2024 Reinforcement Learning (RL)
— Unverified 0Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization Mar 31, 2023 Offline RL Q-Learning
— Unverified 0Understanding Self-Predictive Learning for Reinforcement Learning Dec 6, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Understanding the Complexity Gains of Single-Task RL with a Curriculum Dec 24, 2022 Reinforcement Learning (RL)
— Unverified 0Understanding the Generalization Gap in Visual Reinforcement Learning Sep 29, 2021 Data Augmentation Deep Reinforcement Learning
— Unverified 0Understanding the Limits of Poisoning Attacks in Episodic Reinforcement Learning Aug 29, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Understanding the Pathologies of Approximate Policy Evaluation when Combined with Greedification in Reinforcement Learning Oct 28, 2020 Reinforcement Learning (RL)
— Unverified 0Understanding the Relation Between Maximum-Entropy Inverse Reinforcement Learning and Behaviour Cloning Mar 27, 2019 continuous-control Continuous Control
— Unverified 0Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning Mar 10, 2023 Deep Reinforcement Learning Diversity
— Unverified 0Understanding the World to Solve Social Dilemmas Using Multi-Agent Reinforcement Learning May 19, 2023 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning Feb 10, 2022 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence Feb 5, 2024 continuous-control Continuous Control
— Unverified 0Undirected Machine Translation with Discriminative Reinforcement Learning Apr 1, 2014 Language Modelling Machine Translation
— Unverified 0UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning Oct 6, 2020 Multi-agent Reinforcement Learning reinforcement-learning
— Unverified 0UNEX-RL: Reinforcing Long-Term Rewards in Multi-Stage Recommender Systems with UNidirectional EXecution Jan 12, 2024 Multi-agent Reinforcement Learning Recommendation Systems
— Unverified 0Reinforcement Learning in Credit Scoring and Underwriting Dec 15, 2022 Decision Making Efficient Exploration
— Unverified 0UniCon: Universal Neural Controller For Physics-based Character Motion Nov 30, 2020 Deep Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Unified Algorithms for RL with Decision-Estimation Coefficients: PAC, Reward-Free, Preference-Based Learning, and Beyond Sep 23, 2022 PAC learning Reinforcement Learning (RL)
— Unverified 0Unified Conversational Recommendation Policy Learning via Graph-based Reinforcement Learning May 20, 2021 Attribute Conversational Recommendation
— Unverified 0Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents Apr 3, 2023 Deep Reinforcement Learning Offline RL
— Unverified 0Unified Locomotion Transformer with Simultaneous Sim-to-Real Transfer for Quadrupeds Mar 12, 2025 Deep Reinforcement Learning Knowledge Distillation
— Unverified 0Unified Policy Optimization for Continuous-action Reinforcement Learning in Non-stationary Tasks and Games Aug 19, 2022 MuJoCo Reinforcement Learning (RL)
— Unverified 0Unified Reinforcement Q-Learning for Mean Field Game and Control Problems Jun 24, 2020 Q-Learning Reinforcement Learning (RL)
— Unverified 0Uniform-PAC Bounds for Reinforcement Learning with Linear Function Approximation Jun 22, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Uniform-PAC Guarantees for Model-Based RL with Bounded Eluder Dimension May 15, 2023 Open-Ended Question Answering Reinforcement Learning (RL)
— Unverified 0Uniform State Abstraction For Reinforcement Learning Apr 6, 2020 continuous-control Continuous Control
— Unverified 0Unifying Causal Inference and Reinforcement Learning using Higher-Order Category Theory Sep 13, 2022 Causal Inference reinforcement-learning
— Unverified 0Unifying Ensemble Methods for Q-learning via Social Choice Theory Feb 27, 2019 Diversity Q-Learning
— Unverified 0Unifying task specification in reinforcement learning Sep 7, 2016 reinforcement-learning Reinforcement Learning
— Unverified 0Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming Oct 30, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Universal Activation Function For Machine Learning Nov 7, 2020 BIG-bench Machine Learning General Classification
— Unverified 0Universal Agent for Disentangling Environments and Tasks Jan 1, 2018 Hierarchical Reinforcement Learning reinforcement-learning
— Unverified 0Universal Agent Mixtures and the Geometry of Intelligence Feb 13, 2023 Multi-agent Reinforcement Learning Reinforcement Learning (RL)
— Unverified 0Universal Distributional Decision-based Black-box Adversarial Attack with Reinforcement Learning Nov 15, 2022 Adversarial Attack reinforcement-learning
— Unverified 0Universal Learning Waveform Selection Strategies for Adaptive Target Tracking Feb 10, 2022 reinforcement-learning Reinforcement Learning (RL)
— Unverified 0Universal Successor Features Based Deep Reinforcement Learning for Navigation Jun 17, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Universal Successor Features for Transfer Reinforcement Learning Jan 5, 2020 MuJoCo reinforcement-learning
— Unverified 0Universal Successor Representations for Transfer Reinforcement Learning Apr 11, 2018 reinforcement-learning Reinforcement Learning
— Unverified 0Universal Trading for Order Execution with Oracle Policy Distillation Jan 28, 2021 Algorithmic Trading reinforcement-learning
— Unverified 0UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning May 20, 2025 Large Language Model Multimodal Large Language Model
— Unverified 0UniZero: Generalized and Efficient Planning with Scalable Latent World Models Jun 15, 2024 Multi-Task Learning Reinforcement Learning (RL)
— Unverified 0Unlearning Works Better Than You Think: Local Reinforcement-Based Selection of Auxiliary Objectives Apr 19, 2025 Reinforcement Learning (RL)
— Unverified 0Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem Jun 3, 2025 GPU Math
— Unverified 0Unlocking Pixels for Reinforcement Learning via Implicit Attention Feb 8, 2021 reinforcement-learning Reinforcement Learning
— Unverified 0Unlocking the Potential of Simulators: Design with RL in Mind Jun 8, 2017 Decision Making Friction
— Unverified 0