QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control Jun 15, 2023 CPU Deep Reinforcement Learning
Code Code Available 2Datasets and Benchmarks for Offline Safe Reinforcement Learning Jun 15, 2023 Autonomous Driving Benchmarking
Code Code Available 2Real-Time Network-Level Traffic Signal Control: An Explicit Multiagent Coordination Method Jun 15, 2023 Reinforcement Learning (RL) Traffic Signal Control
— Unverified 0Predictive Maneuver Planning with Deep Reinforcement Learning (PMP-DRL) for comfortable and safe autonomous driving Jun 15, 2023 Autonomous Driving Deep Reinforcement Learning
— Unverified 0Langevin Thompson Sampling with Logarithmic Communication: Bandits and Reinforcement Learning Jun 15, 2023 Decision Making Multi-Armed Bandits
— Unverified 0Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources Jun 14, 2023 Offline RL reinforcement-learning
— Unverified 0Off-policy Evaluation in Doubly Inhomogeneous Environments Jun 14, 2023 Offline RL Off-policy evaluation
Code Code Available 0A reinforcement learning strategy for p-adaptation in high order solvers Jun 14, 2023 Computational Efficiency reinforcement-learning
— Unverified 0Skill-Critic: Refining Learned Skills for Hierarchical Reinforcement Learning Jun 14, 2023 Autonomous Racing Decision Making
— Unverified 0Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning Jun 14, 2023 Meta Reinforcement Learning Navigate
— Unverified 0Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective Jun 13, 2023 Learning-To-Rank Offline RL
Code Code Available 0Multi-market Energy Optimization with Renewables via Reinforcement Learning Jun 13, 2023 Deep Reinforcement Learning reinforcement-learning
— Unverified 0Can ChatGPT Enable ITS? The Case of Mixed Traffic Control via Reinforcement Learning Jun 13, 2023 General Knowledge Management
Code Code Available 0Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care Jun 13, 2023 Offline RL Q-Learning
— Unverified 0Kernelized Reinforcement Learning with Order Optimal Regret Bounds Jun 13, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning Jun 13, 2023 D4RL Efficient Exploration
— Unverified 0DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback Jun 13, 2023 Deep Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 0A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning Jun 13, 2023 reinforcement-learning Reinforcement Learning
— Unverified 0Galactic: Scaling End-to-End Reinforcement Learning for Rearrangement at 100k Steps-Per-Second Jun 13, 2023 GPU Reinforcement Learning (RL)
Code Code Available 1Robust Reinforcement Learning through Efficient Adversarial Herding Jun 12, 2023 MuJoCo reinforcement-learning
— Unverified 0Combining Reinforcement Learning and Barrier Functions for Adaptive Risk Management in Portfolio Optimization Jun 12, 2023 Management Portfolio Optimization
— Unverified 0Online Prototype Alignment for Few-shot Policy Transfer Jun 12, 2023 Domain Adaptation Reinforcement Learning (RL)
Code Code Available 0Diverse Projection Ensembles for Distributional Reinforcement Learning Jun 12, 2023 Distributional Reinforcement Learning Diversity
— Unverified 0Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds Jun 12, 2023 Reinforcement Learning (RL)
— Unverified 0Transcendental Idealism of Planner: Evaluating Perception from Planning Perspective for Autonomous Driving Jun 12, 2023 Autonomous Driving Autonomous Vehicles
Code Code Available 1ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles Jun 12, 2023 Offline RL reinforcement-learning
— Unverified 0Policy Regularization with Dataset Constraint for Offline Reinforcement Learning Jun 11, 2023 Offline RL reinforcement-learning
Code Code Available 1Digital Twin-Enhanced Wireless Indoor Navigation: Achieving Efficient Environment Sensing with Zero-Shot Reinforcement Learning Jun 11, 2023 Navigate reinforcement-learning
Code Code Available 1Reinforcement Learning in Robotic Motion Planning by Combined Experience-based Planning and Self-Imitation Learning Jun 11, 2023 Imitation Learning Motion Planning
— Unverified 0PEAR: Primitive enabled Adaptive Relabeling for boosting Hierarchical Reinforcement Learning Jun 10, 2023 Decision Making Hierarchical Reinforcement Learning
— Unverified 0Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel Jun 9, 2023 Decision Making reinforcement-learning
— Unverified 0The Role of Diverse Replay for Generalisation in Reinforcement Learning Jun 9, 2023 Diversity reinforcement-learning
— Unverified 0Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation Jun 9, 2023 Policy Gradient Methods reinforcement-learning
— Unverified 0On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning Jun 9, 2023 Reinforcement Learning (RL) Representation Learning
Code Code Available 1Iteratively Refined Behavior Regularization for Offline Reinforcement Learning Jun 9, 2023 D4RL Offline RL
— Unverified 0Learning Not to Spoof Jun 9, 2023 Reinforcement Learning (RL)
— Unverified 0Approximate information state based convergence analysis of recurrent Q-learning Jun 9, 2023 Q-Learning Reinforcement Learning (RL)
— Unverified 0An End-to-End Reinforcement Learning Approach for Job-Shop Scheduling Problems Based on Constraint Programming Jun 9, 2023 Combinatorial Optimization Feature Engineering
Code Code Available 1Decoupled Prioritized Resampling for Offline RL Jun 8, 2023 Offline RL Reinforcement Learning (RL)
Code Code Available 1Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning Jun 8, 2023 Decision Making Offline RL
— Unverified 0Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL Jun 7, 2023 Data Augmentation Offline RL
Code Code Available 1Timing Process Interventions with Causal Inference and Reinforcement Learning Jun 7, 2023 Causal Inference reinforcement-learning
— Unverified 0Stabilizing Contrastive RL: Techniques for Robotic Goal Reaching from Offline Data Jun 6, 2023 Contrastive Learning Data Augmentation
Code Code Available 1CAVEN: An Embodied Conversational Agent for Efficient Audio-Visual Navigation in Noisy Environments Jun 6, 2023 Hierarchical Reinforcement Learning Navigate
— Unverified 0Value Functions are Control Barrier Functions: Verification of Safe Policies using Control Theory Jun 6, 2023 Diversity Reinforcement Learning (RL)
Code Code Available 1Mildly Constrained Evaluation Policy for Offline Reinforcement Learning Jun 6, 2023 D4RL MuJoCo
Code Code Available 0Model-Based Reinforcement Learning with Multi-Task Offline Pretraining Jun 6, 2023 Knowledge Distillation Model-based Reinforcement Learning
Code Code Available 0Boosting Offline Reinforcement Learning with Action Preference Query Jun 6, 2023 Autonomous Driving D4RL
— Unverified 0PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation Jun 6, 2023 Offline RL Reinforcement Learning (RL)
— Unverified 0RLtools: A Fast, Portable Deep Reinforcement Learning Library for Continuous Control Jun 6, 2023 continuous-control Continuous Control
Code Code Available 2