Contrastive Preference Learning: Learning from Human Feedback without RL Oct 20, 2023 reinforcement-learning Reinforcement Learning (RL)
Code Code Available 1Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning Oct 19, 2023 MuJoCo Prompt Engineering
Code Code Available 1Towards Robust Offline Reinforcement Learning under Diverse Data Corruption Oct 19, 2023 Offline RL Q-Learning
Code Code Available 1SDGym: Low-Code Reinforcement Learning Environments using System Dynamics Models Oct 19, 2023 OpenAI Gym reinforcement-learning
— Unverified 0On The Expressivity of Objective-Specification Formalisms in Reinforcement Learning Oct 18, 2023 Multi-Objective Reinforcement Learning reinforcement-learning
— Unverified 0Learning to Optimise Climate Sensor Placement using a Transformer Oct 18, 2023 Deep Reinforcement Learning Management
— Unverified 0Accelerate Presolve in Large-Scale Linear Programming via Reinforcement Learning Oct 18, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement Learning Oct 18, 2023 Policy Gradient Methods reinforcement-learning
Code Code Available 0Quality Diversity through Human Feedback: Towards Open-Ended Diversity-Driven Optimization Oct 18, 2023 Diversity Image Generation
Code Code Available 1Using Experience Classification for Training Non-Markovian Tasks Oct 18, 2023 Autonomous Driving Classification
— Unverified 0Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning Oct 18, 2023 Offline RL Quantization
— Unverified 0Improving Generalization of Alignment with Human Preferences through Group Invariant Learning Oct 18, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Neural Packing: from Visual Sensing to Reinforcement Learning Oct 17, 2023 Combinatorial Optimization Motion Planning
— Unverified 0Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning Oct 17, 2023 Autonomous Racing reinforcement-learning
— Unverified 0Reinforcement learning with non-ergodic reward increments: robustness via ergodicity transformations Oct 17, 2023 Autonomous Driving reinforcement-learning
Code Code Available 0Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning Oct 16, 2023 Chatbot Offline RL
Code Code Available 0Uncertainty-aware transfer across tasks using hybrid model-based successor feature reinforcement learning Oct 16, 2023 Decision Making Reinforcement Learning (RL)
— Unverified 0Leveraging Topological Maps in Deep Reinforcement Learning for Multi-Object Navigation Oct 16, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0AMAGO: Scalable In-Context Reinforcement Learning for Adaptive Agents Oct 15, 2023 In-Context Learning In-Context Reinforcement Learning
Code Code Available 1Deep Reinforcement Learning with Explicit Context Representation Oct 15, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0LgTS: Dynamic Task Sampling using LLM-generated sub-goals for Reinforcement Learning Agents Oct 14, 2023 Reinforcement Learning (RL)
— Unverified 0A Framework for Empowering Reinforcement Learning Agents with Causal Analysis: Enhancing Automated Cryptocurrency Trading Oct 14, 2023 Decision Making Feature Engineering
— Unverified 0Reduced Policy Optimization for Continuous Control with Hard Constraints Oct 14, 2023 continuous-control Continuous Control
Code Code Available 1Hybrid Reinforcement Learning for Optimizing Pump Sustainability in Real-World Water Distribution Networks Oct 13, 2023 Reinforcement Learning (RL) Scheduling
— Unverified 0Exploration with Principles for Diverse AI Supervision Oct 13, 2023 Reinforcement Learning (RL) Unsupervised Reinforcement Learning
— Unverified 0METRA: Scalable Unsupervised RL with Metric-Aware Abstraction Oct 13, 2023 Reinforcement Learning (RL) Unsupervised Pre-training
Code Code Available 1Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments Oct 13, 2023 Active Learning Offline RL
— Unverified 0Virtual Augmented Reality for Atari Reinforcement Learning Oct 12, 2023 Image Segmentation reinforcement-learning
Code Code Available 0Dealing with uncertainty: balancing exploration and exploitation in deep recurrent reinforcement learning Oct 12, 2023 Autonomous Driving reinforcement-learning
Code Code Available 0Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach Oct 12, 2023 Deep Reinforcement Learning Q-Learning
Code Code Available 0Novelty Detection in Reinforcement Learning with World Models Oct 12, 2023 Decision Making Novelty Detection
— Unverified 0Discerning Temporal Difference Learning Oct 12, 2023 Reinforcement Learning (RL)
— Unverified 0A Lightweight Calibrated Simulation Enabling Efficient Offline Learning for Optimal Control of Real Buildings Oct 12, 2023 Reinforcement Learning (RL)
Code Code Available 0Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias Oct 12, 2023 D4RL Offline RL
Code Code Available 1Online RL in Linearly q^π-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore Oct 11, 2023 Reinforcement Learning (RL)
— Unverified 0Off-Policy Evaluation for Human Feedback Oct 11, 2023 Off-policy evaluation Reinforcement Learning (RL)
— Unverified 0Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking Oct 11, 2023 Fact Checking Misinformation
— Unverified 0Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization Oct 10, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences Oct 10, 2023 Efficient Exploration Policy Gradient Methods
— Unverified 0Spectral Entry-wise Matrix Estimation for Low-Rank Reinforcement Learning Oct 10, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Scalable Semantic Non-Markovian Simulation Proxy for Reinforcement Learning Oct 10, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Bi-Level Offline Policy Optimization with Limited Exploration Oct 10, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0Aligning Language Models with Human Preferences via a Bayesian Approach Oct 9, 2023 Contrastive Learning Reinforcement Learning (RL)
Code Code Available 1Predictive auxiliary objectives in deep RL mimic learning in the brain Oct 9, 2023 Auxiliary Learning Deep Reinforcement Learning
— Unverified 0When is Agnostic Reinforcement Learning Statistically Tractable? Oct 9, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0On Double Descent in Reinforcement Learning with LSTD and Random Features Oct 9, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Distributional Soft Actor-Critic with Three Refinements Oct 9, 2023 Decision Making Reinforcement Learning (RL)
Code Code Available 2Multi-timestep models for Model-based Reinforcement Learning Oct 9, 2023 model Model-based Reinforcement Learning
— Unverified 0Safe Deep Policy Adaptation Oct 8, 2023 reinforcement-learning Reinforcement Learning
Code Code Available 1Lifelong Learning for Fog Load Balancing: A Transfer Learning Approach Oct 8, 2023 Lifelong learning Reinforcement Learning (RL)
— Unverified 0