PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation Jun 6, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Model-Based Reinforcement Learning with Multi-Task Offline Pretraining Jun 6, 2023 Knowledge Distillation Model-based Reinforcement Learning
Code Code Available 0Mildly Constrained Evaluation Policy for Offline Reinforcement Learning Jun 6, 2023 D4RL MuJoCo
Code Code Available 0Boosting Offline Reinforcement Learning with Action Preference Query Jun 6, 2023 Autonomous Driving D4RL
— Unverified 0CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments Jun 6, 2023 Hierarchical Reinforcement Learning Navigate
— Unverified 0A Novel Multi-Agent Deep RL Approach for Traffic Signal Control Jun 5, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0A General Perspective on Objectives of Reinforcement Learning Jun 5, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Action-Evolution Petri Nets: a Framework for Modeling and Solving Dynamic Task Assignment Problems Jun 5, 2023 Reinforcement Learning (RL)
— Unverified 0Risk-Aware Reward Shaping of Reinforcement Learning Agents for Autonomous Driving Jun 5, 2023 Autonomous Driving Motion Planning
Code Code Available 0Survival Instinct in Offline Reinforcement Learning Jun 5, 2023 Offline RL reinforcement-learning
— Unverified 0Cycle Consistency Driven Object Discovery Jun 3, 2023 Object Object Discovery
— Unverified 0Improving the generalizability and robustness of large-scale traffic signal control Jun 2, 2023 Deep Reinforcement Learning Distributional Reinforcement Learning
— Unverified 0Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction Jun 2, 2023 Reinforcement Learning (RL)
— Unverified 0Efficient Reinforcement Learning with Impaired Observability: Learning to Act with Delayed and Missing State Observations Jun 2, 2023 Reinforcement Learning (RL)
— Unverified 0An Architecture for Deploying Reinforcement Learning in Industrial Environments Jun 2, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Deep Q-Learning versus Proximal Policy Optimization: Performance Comparison in a Material Sorting Task Jun 2, 2023 Deep Reinforcement Learning Q-Learning
— Unverified 0A Modular Test Bed for Reinforcement Learning Incorporation into Industrial Applications Jun 2, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Hyperparameters in Reinforcement Learning and How To Tune Them Jun 2, 2023 AutoML Deep Reinforcement Learning
— Unverified 0Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces Jun 2, 2023 Attribute reinforcement-learning
Code Code Available 0Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space Jun 2, 2023 Reinforcement Learning (RL)
— Unverified 0Non-stationary Reinforcement Learning under General Function Approximation Jun 1, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Normalization Enhances Generalization in Visual Reinforcement Learning Jun 1, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Heterogeneous Knowledge for Augmented Modular Reinforcement Learning Jun 1, 2023 Decision Making reinforcement-learning
— Unverified 0Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding Jun 1, 2023 Management Offline RL
— Unverified 0Identifiability and Generalizability in Constrained Inverse Reinforcement Learning Jun 1, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning Jun 1, 2023 Fairness Offline RL
— Unverified 0IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control Jun 1, 2023 D4RL Model-based Reinforcement Learning
— Unverified 0Replicability in Reinforcement Learning May 31, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL May 31, 2023 MuJoCo Reinforcement Learning (RL)
— Unverified 0Robust Reinforcement Learning Objectives for Sequential Recommender Systems May 30, 2023 Offline RL Recommendation Systems
Code Code Available 0Policy Optimization for Continuous Reinforcement Learning May 30, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0RL + Model-based Control: Using On-demand Optimal Control to Learn Versatile Legged Locomotion May 29, 2023 Reinforcement Learning (RL)
— Unverified 0Off-Policy RL Algorithms Can be Sample-Efficient for Continuous Control via Sample Multiple Reuse May 29, 2023 continuous-control Continuous Control
Code Code Available 0Towards a Better Understanding of Representation Dynamics under TD-learning May 29, 2023 Reinforcement Learning (RL) Representation Learning
— Unverified 0Bridging the Sim-to-Real Gap from the Information Bottleneck Perspective May 29, 2023 Knowledge Distillation Reinforcement Learning (RL)
Code Code Available 0RLAD: Reinforcement Learning from Pixels for Autonomous Driving in Urban Environments May 29, 2023 Autonomous Driving reinforcement-learning
— Unverified 0Potential-based Credit Assignment for Cooperative RL-based Testing of Autonomous Vehicles May 28, 2023 Autonomous Vehicles counterfactual
— Unverified 0The Curious Price of Distributional Robustness in Reinforcement Learning with a Generative Model May 26, 2023 Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning with Simple Sequence Priors May 26, 2023 continuous-control Continuous Control
— Unverified 0Policy Synthesis and Reinforcement Learning for Discounted LTL May 26, 2023 PAC learning reinforcement-learning
— Unverified 0Emergent Agentic Transformer from Chain of Hindsight Experience May 26, 2023 D4RL Imitation Learning
— Unverified 0Learning Interpretable Models of Aircraft Handling Behaviour by Reinforcement Learning from Human Feedback May 26, 2023 Reinforcement Learning (RL)
— Unverified 0Distributional Reinforcement Learning with Dual Expectile-Quantile Regression May 26, 2023 Continuous Control Distributional Reinforcement Learning
— Unverified 0A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents May 26, 2023 Instruction Following Reinforcement Learning (RL)
Code Code Available 0End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes May 25, 2023 Bayesian Optimisation Inductive Bias
Code Code Available 0DPOK: Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models May 25, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 0Deterministic policy gradient based optimal control with probabilistic constraints May 25, 2023 Model Predictive Control reinforcement-learning
— Unverified 0Reward-Machine-Guided, Self-Paced Reinforcement Learning May 25, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 0Matrix Estimation for Offline Reinforcement Learning with Low-Rank Structure May 24, 2023 Matrix Completion reinforcement-learning
— Unverified 0Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees May 24, 2023 Reinforcement Learning (RL)
Code Code Available 0