| Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL | May 26, 2025 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model | Oct 27, 2024 | D4RLQ-Learning | CodeCode Available | 0 | 5 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Directly Forecasting Belief for Reinforcement Learning with Delays | May 1, 2025 | D4RLMuJoCo | CodeCode Available | 0 | 5 |
| CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization | Jun 18, 2025 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 | 5 |
| Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning | Dec 4, 2024 | D4RLImitation Learning | CodeCode Available | 0 | 5 |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Jul 21, 2023 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 | 5 |
| Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses | May 18, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning | Jun 5, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Uncertainty Regularized Policy Learning for Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning | Apr 16, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL | Oct 5, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 | 0 |
| Accelerating Residual Reinforcement Learning with Uncertainty Estimation | Jun 21, 2025 | D4RLreinforcement-learning | —Unverified | 0 | 0 |
| ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning | Dec 22, 2024 | D4RLQ-Learning | —Unverified | 0 | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 | 0 |
| Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning | Jul 21, 2022 | Autonomous DrivingD4RL | —Unverified | 0 | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning | May 3, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning | Apr 17, 2025 | D4RLreinforcement-learning | —Unverified | 0 | 0 |