| Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning | Oct 25, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| MoCoDA: Model-based Counterfactual Data Augmentation | Oct 20, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Oct 15, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Efficient Offline Policy Optimization with a Learned Model | Oct 12, 2022 | Offline RL | CodeCode Available | 1 |
| Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories | Oct 12, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning | Oct 11, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials | Oct 11, 2022 | Offline RLQ-Learning | CodeCode Available | 1 |
| BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets | Oct 7, 2022 | Autonomous DrivingBackdoor Attack | CodeCode Available | 1 |
| VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training | Sep 30, 2022 | Offline RLOpen-Ended Question Answering | CodeCode Available | 1 |
| Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling | Sep 29, 2022 | Computational EfficiencyD4RL | CodeCode Available | 1 |