| When Demonstrations Meet Generative World Models: A Maximum Likelihood Framework for Offline Inverse Reinforcement Learning | Feb 15, 2023 | Autonomous Drivingcontinuous-control | CodeCode Available | 1 |
| Conservative State Value Estimation for Offline Reinforcement Learning | Feb 14, 2023 | D4RLreinforcement-learning | CodeCode Available | 0 |
| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Anti-Exploration by Random Network Distillation | Jan 31, 2023 | D4RL | CodeCode Available | 1 |
| Improving Behavioural Cloning with Positive Unlabeled Learning | Jan 27, 2023 | Behavioural cloningD4RL | —Unverified | 0 |
| Model-based Offline Reinforcement Learning with Local Misspecification | Jan 26, 2023 | D4RLmodel | —Unverified | 0 |
| Extreme Q-Learning: MaxEnt RL without Entropy | Jan 5, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Model-based trajectory stitching for improved behavioural cloning and its applications | Dec 8, 2022 | Behavioural cloningBenchmarking | —Unverified | 0 |
| TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets | Dec 5, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery | Dec 2, 2022 | D4RLreinforcement-learning | —Unverified | 0 |