| Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping | Sep 15, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Efficient Planning in a Compact Latent Action Space | Aug 22, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| AdaCat: Adaptive Categorical Discretization for Autoregressive Models | Aug 3, 2022 | Density EstimationOffline RL | CodeCode Available | 1 |
| Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations | Jul 20, 2022 | Imitation LearningOffline RL | CodeCode Available | 1 |
| When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning | Jun 27, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Behavior Transformers: Cloning k modes with one stone | Jun 22, 2022 | Object DetectionOffline RL | CodeCode Available | 1 |
| Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning | Jun 9, 2022 | D4RLModel-based Reinforcement Learning | CodeCode Available | 1 |
| RORL: Robust Offline Reinforcement Learning via Conservative Smoothing | Jun 6, 2022 | Decision MakingOffline RL | CodeCode Available | 1 |
| When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning | May 23, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning | Apr 26, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |