| Offline Reinforcement Learning with Additional Covering Distributions | May 22, 2023 | Inductive BiasOffline RL | —Unverified | 0 |
| Offline Primal-Dual Reinforcement Learning for Linear MDPs | May 22, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| FurnitureBench: Reproducible Real-World Benchmark for Long-Horizon Complex Manipulation | May 22, 2023 | Imitation LearningMotion Planning | CodeCode Available | 2 |
| Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models | May 18, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning | May 17, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| SLiC-HF: Sequence Likelihood Calibration with Human Feedback | May 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revisiting the Minimalist Approach to Offline Reinforcement Learning | May 16, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage | May 16, 2023 | Offline RL | —Unverified | 0 |
| Towards Generalizable Reinforcement Learning for Trade Execution | May 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Explaining RL Decisions with Trajectories | May 6, 2023 | Attributecontinuous-control | CodeCode Available | 0 |