| Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL | May 26, 2025 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model | Oct 27, 2024 | D4RLQ-Learning | CodeCode Available | 0 | 5 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Directly Forecasting Belief for Reinforcement Learning with Delays | May 1, 2025 | D4RLMuJoCo | CodeCode Available | 0 | 5 |
| CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization | Jun 18, 2025 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 | 5 |
| Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning | Dec 4, 2024 | D4RLImitation Learning | CodeCode Available | 0 | 5 |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Jul 21, 2023 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 | 5 |
| Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses | May 18, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning | Jun 5, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Uncertainty Regularized Policy Learning for Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning | Apr 16, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL | Oct 5, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 | 0 |
| Accelerating Residual Reinforcement Learning with Uncertainty Estimation | Jun 21, 2025 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning | Dec 22, 2024 | D4RLQ-Learning | —Unverified | 0 | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 | 0 |
| Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning | Jul 21, 2022 | Autonomous DrivingD4RL | —Unverified | 0 | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning | May 3, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning | Apr 17, 2025 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| A Behavior Regularized Implicit Policy for Offline Reinforcement Learning | Feb 19, 2022 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning | Jun 13, 2023 | D4RLEfficient Exploration | —Unverified | 0 | 0 |
| Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL | Jun 16, 2021 | D4RLDomain Generalization | —Unverified | 0 | 0 |
| Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning | Feb 7, 2025 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Improving Behavioural Cloning with Positive Unlabeled Learning | Jan 27, 2023 | Behavioural cloningD4RL | —Unverified | 0 | 0 |
| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| Boosting Offline Reinforcement Learning with Action Preference Query | Jun 6, 2023 | Autonomous DrivingD4RL | —Unverified | 0 | 0 |
| Budgeting Counterfactual for Offline RL | Jul 12, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 | 0 |
| CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Jun 11, 2024 | D4RLDenoising | —Unverified | 0 | 0 |
| CEIL: Generalized Contextual Imitation Learning | Jun 26, 2023 | D4RLImitation Learning | —Unverified | 0 | 0 |
| Context-Former: Stitching via Latent Conditioned Sequence Modeling | Jan 29, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Feb 5, 2024 | Contrastive LearningD4RL | —Unverified | 0 | 0 |
| DCE: Offline Reinforcement Learning With Double Conservative Estimates | Sep 27, 2022 | Computational EfficiencyD4RL | —Unverified | 0 | 0 |
| Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling | May 31, 2024 | D4RLMamba | —Unverified | 0 | 0 |
| Decision SpikeFormer: Spike-Driven Transformer for Decision Making | Apr 4, 2025 | D4RLDecision Making | —Unverified | 0 | 0 |
| Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning | Feb 5, 2024 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 | 0 |
| DiffuserLite: Towards Real-time Diffusion Planning | Jan 27, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Diffusion Model Predictive Control | Oct 7, 2024 | D4RLmodel | —Unverified | 0 | 0 |
| Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning | Jul 10, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning | Feb 5, 2024 | D4RLQ-Learning | —Unverified | 0 | 0 |
| Augmenting Offline Reinforcement Learning with State-only Interactions | Feb 1, 2024 | D4RLData Augmentation | —Unverified | 0 | 0 |
| Offline Diversity Maximization Under Imitation Constraints | Jul 21, 2023 | D4RLDiversity | —Unverified | 0 | 0 |