| Efficient Planning in a Compact Latent Action Space | Aug 22, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Direct Preference-based Policy Optimization without Reward Modeling | Jan 30, 2023 | Contrastive LearningOffline RL | CodeCode Available | 1 |
| OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation | Jun 21, 2021 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations | Jul 20, 2022 | Imitation LearningOffline RL | CodeCode Available | 1 |
| When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning | May 23, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning | Mar 9, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning | Feb 23, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors | Sep 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Guiding Online Reinforcement Learning with Action-Free Offline Pretraining | Jan 30, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |