| Offline Reinforcement Learning with Additional Covering Distributions | May 22, 2023 | Inductive BiasOffline RL | —Unverified | 0 |
| Offline Primal-Dual Reinforcement Learning for Linear MDPs | May 22, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation | May 22, 2023 | Imitation LearningMotion Planning | CodeCode Available | 2 |
| Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models | May 18, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning | May 17, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| SLiC-HF: Sequence Likelihood Calibration with Human Feedback | May 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revisiting the Minimalist Approach to Offline Reinforcement Learning | May 16, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage | May 16, 2023 | Offline RL | —Unverified | 0 |
| Towards Generalizable Reinforcement Learning for Trade Execution | May 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Explaining RL Decisions with Trajectories | May 6, 2023 | Attributecontinuous-control | CodeCode Available | 0 |
| Masked Trajectory Models for Prediction, Representation, and Control | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Federated Ensemble-Directed Offline Reinforcement Learning | May 4, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Leveraging Factored Action Spaces for Efficient Offline Reinforcement Learning in Healthcare | May 2, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| What can online reinforcement learning with function approximation benefit from general coverage conditions? | Apr 25, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies | Apr 20, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Using Offline Data to Speed Up Reinforcement Learning in Procedurally Generated Environments | Apr 18, 2023 | Imitation LearningOffline RL | CodeCode Available | 0 |
| Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning | Apr 14, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement Learning | Apr 10, 2023 | D4RLData Augmentation | CodeCode Available | 0 |
| Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents | Apr 3, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Enabling A Network AI Gym for Autonomous Cyber Agents | Apr 3, 2023 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization | Mar 31, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions | Mar 30, 2023 | DiversityOffline RL | —Unverified | 0 |
| Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization | Mar 28, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Optimal Transport for Offline Imitation Learning | Mar 24, 2023 | D4RLDecision Making | CodeCode Available | 1 |