Why Online Reinforcement Learning is Causal Mar 7, 2024 counterfactual Offline RL
— Unverified 0Why Pay More When You Can Pay Less: A Joint Learning Framework for Active Feature Acquisition and Classification Sep 18, 2017 General Classification Reinforcement Learning
— Unverified 0Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. Sep 29, 2021 continuous-control Continuous Control
— Unverified 0Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters May 27, 2022 D4RL Offline RL
— Unverified 0Widely Used and Fast De Novo Drug Design by a Protein Sequence-Based Reinforcement Learning Model Aug 14, 2022 Drug Design Drug Discovery
— Unverified 0Wield: Systematic Reinforcement Learning With Progressive Randomization Sep 15, 2019 General Classification reinforcement-learning
— Unverified 0Will it Blend? Composing Value Functions in Reinforcement Learning Jul 12, 2018 Lifelong learning reinforcement-learning
— Unverified 0Wind Power Forecasting Considering Data Privacy Protection: A Federated Deep Reinforcement Learning Approach Nov 2, 2022 Deep Reinforcement Learning Federated Learning
— Unverified 0Winning at Any Cost -- Infringing the Cartel Prohibition With Reinforcement Learning Jul 5, 2021 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Winning the CityLearn Challenge: Adaptive Optimization with Evolutionary Search under Trajectory-based Guidance Dec 4, 2022 Decision Making Reinforcement Learning (RL)
— Unverified 0Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic Jan 1, 2021 Management Reinforcement Learning (RL)
— Unverified 0Wireless 2.0: Towards an Intelligent Radio Environment Empowered by Reconfigurable Meta-Surfaces and Artificial Intelligence Feb 23, 2020 Management reinforcement-learning
— Unverified 0WiseMove: A Framework for Safe Deep Reinforcement Learning for Autonomous Driving Feb 11, 2019 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation Dec 1, 2021 continuous-control Continuous Control
— Unverified 0Words as Beacons: Guiding RL Agents with High-Level Language Prompts Oct 11, 2024 Reinforcement Learning (RL)
— Unverified 0Workflow-Guided Response Generation for Task-Oriented Dialogue Nov 14, 2023 Reinforcement Learning (RL) Response Generation
— Unverified 0World Model-Based Learning for Long-Term Age of Information Minimization in Vehicular Networks May 3, 2025 Reinforcement Learning (RL) Scheduling
— Unverified 0World Models Increase Autonomy in Reinforcement Learning Aug 19, 2024 reinforcement-learning Reinforcement Learning
— Unverified 0World of Bits: An Open-Domain Platform for Web-Based Agents Aug 1, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0World Programs for Model-Based Learning and Planning in Compositional State and Action Spaces Dec 30, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0World Value Functions: Knowledge Representation for Multitask Reinforcement Learning May 18, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0Worm-level Control through Search-based Reinforcement Learning Nov 9, 2017 reinforcement-learning Reinforcement Learning
— Unverified 0Worst-Case Regret Bounds for Exploration via Randomized Value Functions Jun 7, 2019 Efficient Exploration reinforcement-learning
— Unverified 0Worst Cases Policy Gradients Nov 9, 2019 Deep Reinforcement Learning reinforcement-learning
— Unverified 0X-MEN: Guaranteed XOR-Maximum Entropy Constrained Inverse Reinforcement Learning Mar 22, 2022 reinforcement-learning Reinforcement Learning
— Unverified 0xMTF: A Formula-Free Model for Reinforcement-Learning-Based Multi-Task Fusion in Recommender Systems Apr 8, 2025 Multi-Task Learning Recommendation Systems
— Unverified 0X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real May 11, 2025 Domain Adaptation Imitation Learning
— Unverified 0Yes, Q-learning Helps Offline In-Context RL Feb 24, 2025 In-Context Reinforcement Learning MuJoCo
— Unverified 0You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL Oct 5, 2021 D4RL Offline RL
— Unverified 0You Only Live Once: Single-Life Reinforcement Learning Oct 17, 2022 continuous-control Continuous Control
— Unverified 0Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization May 19, 2025 Offline RL Portfolio Optimization
— Unverified 0Zermelo's problem: Optimal point-to-point navigation in 2D turbulent flows using Reinforcement Learning Jul 17, 2019 Navigate Reinforcement Learning
— Unverified 0Zero-Shot Action Generalization with Limited Observations Mar 11, 2025 Decision Making Reinforcement Learning (RL)
— Unverified 0Zero-Shot Generalization of Vision-Based RL Without Data Augmentation Oct 9, 2024 Data Augmentation Disentanglement
— Unverified 0Zero Shot Learning on Simulated Robots Oct 4, 2019 reinforcement-learning Reinforcement Learning
— Unverified 0Zero-Shot Policy Transfer with Disentangled Attention Sep 25, 2019 Deep Reinforcement Learning Domain Adaptation
— Unverified 0Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning Oct 1, 2022 Disentanglement Meta Reinforcement Learning
— Unverified 0PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation Jun 6, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty May 11, 2021 Decision Making Deep Reinforcement Learning
— Unverified 0Zero-Shot Reinforcement Learning with Deep Attention Convolutional Neural Networks Jan 2, 2020 Autonomous Driving Deep Attention
— Unverified 0Zero-Shot Reward Specification via Grounded Natural Language Sep 29, 2021 Reinforcement Learning (RL)
— Unverified 0Sim-to-Real Transfer of Robot Learning with Variable Length Inputs Sep 20, 2018 Decision Making Deep Reinforcement Learning
— Unverified 0Zero-shot Text Classification via Reinforced Self-training Jul 1, 2020 Classification General Classification
— Unverified 0Zero-Shot Transfer with Deictic Object-Oriented Representation in Reinforcement Learning Dec 1, 2018 Object reinforcement-learning
— Unverified 0Zero-Shot Uncertainty-Aware Deployment of Simulation Trained Policies on Real-World Robots Dec 10, 2021 continuous-control Continuous Control
— Unverified 0Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach May 3, 2024 Q-Learning reinforcement-learning
— Unverified 0Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer Feb 2, 2025 Reinforcement Learning (RL) Video Generation
— Unverified 0Zeroth-Order Optimization is Secretly Single-Step Policy Optimization Jun 17, 2025 Reinforcement Learning (RL)
— Unverified 0Zeroth-Order Supervised Policy Improvement Jun 11, 2020 continuous-control Continuous Control
— Unverified 0Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning Apr 6, 2021 Action Classification Action Detection
— Unverified 0