| Offline RL Without Off-Policy Evaluation | Jun 16, 2021 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 | 5 |
| Exploration and Anti-Exploration with Distributional Random Network Distillation | Jan 18, 2024 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Offline Reinforcement Learning with Value-based Episodic Memory | Oct 19, 2021 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model | Dec 7, 2024 | D4RLmodel | CodeCode Available | 1 | 5 |
| In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought | May 31, 2024 | D4RLDecision Making | CodeCode Available | 1 | 5 |
| Mildly Conservative Q-Learning for Offline Reinforcement Learning | Jun 9, 2022 | D4RLQ-Learning | CodeCode Available | 1 | 5 |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Oct 15, 2022 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning | May 23, 2022 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Improving and Benchmarking Offline Reinforcement Learning Algorithms | Jun 1, 2023 | AttributeBenchmarking | CodeCode Available | 1 | 5 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 | 5 |
| Are Expressive Models Truly Necessary for Offline RL? | Dec 15, 2024 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Katakomba: Tools and Benchmarks for Data-Driven NetHack | Jun 14, 2023 | D4RLNetHack | CodeCode Available | 1 | 5 |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | May 23, 2023 | D4RLImitation Learning | CodeCode Available | 1 | 5 |
| Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning | May 31, 2024 | D4RLReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning | May 30, 2024 | D4RLreinforcement-learning | CodeCode Available | 1 | 5 |
| Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning | Apr 25, 2023 | D4RLImage Generation | CodeCode Available | 1 | 5 |
| Anti-Exploration by Random Network Distillation | Jan 31, 2023 | D4RL | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| Critic-Guided Decision Transformer for Offline Reinforcement Learning | Dec 21, 2023 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Curricular Subgoals for Inverse Reinforcement Learning | Jun 14, 2023 | Autonomous DrivingD4RL | CodeCode Available | 1 | 5 |
| Behavior Proximal Policy Optimization | Feb 22, 2023 | D4RLOffline RL | CodeCode Available | 1 | 5 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 | 5 |
| Implicit Behavioral Cloning | Sep 1, 2021 | D4RL | CodeCode Available | 1 | 5 |