| Q-value Regularized Transformer for Offline Reinforcement Learning | May 27, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Reinformer: Max-Return Sequence Modeling for Offline RL | May 14, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Feb 6, 2024 | D4RLImitation Learning | CodeCode Available | 1 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Exploration and Anti-Exploration with Distributional Random Network Distillation | Jan 18, 2024 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Train Once, Get a Family: State-Adaptive Balances for Offline-to-Online Reinforcement Learning | Oct 27, 2023 | D4RLReinforcement Learning (RL) | CodeCode Available | 1 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias | Oct 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Score Regularized Policy Optimization through Diffusion Behavior | Oct 11, 2023 | D4RL | CodeCode Available | 1 |