| Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias | Oct 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Model-Bellman Inconsistency for Model-based Offline Reinforcement Learning | Jul 1, 2023 | D4RLmodel | CodeCode Available | 1 |
| Contrastive Energy Prediction for Exact Energy-Guided Diffusion Sampling in Offline Reinforcement Learning | Apr 25, 2023 | D4RLImage Generation | CodeCode Available | 1 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 |
| Anti-Exploration by Random Network Distillation | Jan 31, 2023 | D4RL | CodeCode Available | 1 |
| M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model | Dec 7, 2024 | D4RLmodel | CodeCode Available | 1 |
| Katakomba: Tools and Benchmarks for Data-Driven NetHack | Jun 14, 2023 | D4RLNetHack | CodeCode Available | 1 |
| Improving and Benchmarking Offline Reinforcement Learning Algorithms | Jun 1, 2023 | AttributeBenchmarking | CodeCode Available | 1 |
| Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling | Sep 29, 2022 | Computational EfficiencyD4RL | CodeCode Available | 1 |
| Are Expressive Models Truly Necessary for Offline RL? | Dec 15, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization | Mar 28, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Revisiting the Minimalist Approach to Offline Reinforcement Learning | May 16, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 |
| When should we prefer Decision Transformers for Offline Reinforcement Learning? | May 23, 2023 | D4RLImitation Learning | CodeCode Available | 1 |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Oct 15, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Implicit Behavioral Cloning | Sep 1, 2021 | D4RL | CodeCode Available | 1 |
| Diffusion Model Predictive Control | Oct 7, 2024 | D4RLmodel | —Unverified | 0 |
| Budgeting Counterfactual for Offline RL | Jul 12, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| DiffuserLite: Towards Real-time Diffusion Planning | Jan 27, 2024 | D4RLDecision Making | —Unverified | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Boosting Offline Reinforcement Learning with Action Preference Query | Jun 6, 2023 | Autonomous DrivingD4RL | —Unverified | 0 |
| An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning | Apr 17, 2025 | D4RLreinforcement-learning | —Unverified | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning | Feb 5, 2024 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning | May 3, 2025 | D4RLOffline RL | —Unverified | 0 |
| Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network | Feb 1, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Decision SpikeFormer: Spike-Driven Transformer for Decision Making | Apr 4, 2025 | D4RLDecision Making | —Unverified | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |
| Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling | May 31, 2024 | D4RLMamba | —Unverified | 0 |
| DCE: Offline Reinforcement Learning With Double Conservative Estimates | Sep 27, 2022 | Computational EfficiencyD4RL | —Unverified | 0 |
| Improving Behavioural Cloning with Positive Unlabeled Learning | Jan 27, 2023 | Behavioural cloningD4RL | —Unverified | 0 |
| Accelerating Residual Reinforcement Learning with Uncertainty Estimation | Jun 21, 2025 | D4RLreinforcement-learning | —Unverified | 0 |
| HIPODE: Enhancing Offline Reinforcement Learning with High-Quality Synthetic Data from a Policy-Decoupled Approach | Jun 10, 2023 | D4RLData Augmentation | —Unverified | 0 |
| Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning | Feb 7, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning | Oct 27, 2023 | Autonomous DrivingD4RL | —Unverified | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 |
| Learning Computational Efficient Bots with Costly Features | Aug 18, 2023 | Computational EfficiencyD4RL | —Unverified | 0 |
| Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models | May 30, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Goal-Conditioned Data Augmentation for Offline Reinforcement Learning | Dec 29, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL | Jun 16, 2021 | D4RLDomain Generalization | —Unverified | 0 |
| From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning | Jul 17, 2025 | D4RLOffline RL | —Unverified | 0 |
| Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning | May 30, 2024 | D4RLDecision Making | —Unverified | 0 |
| A Behavior Regularized Implicit Policy for Offline Reinforcement Learning | Feb 19, 2022 | D4RLreinforcement-learning | —Unverified | 0 |
| Hierarchical Decision Transformer | Sep 21, 2022 | D4RLreinforcement-learning | —Unverified | 0 |
| Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Sep 9, 2024 | D4RLDecision Making | —Unverified | 0 |
| Flow to Control: Offline Reinforcement Learning with Lossless Primitive Discovery | Dec 2, 2022 | D4RLreinforcement-learning | —Unverified | 0 |