| Contrastive Example-Based Control | Jul 24, 2023 | Offline RL | CodeCode Available | 0 |
| Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness | Sep 29, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 |
| Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMs | Jul 15, 2025 | DiversityMMLU | CodeCode Available | 0 |
| DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning | Oct 9, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator | Dec 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning | Jan 17, 2024 | Offline RLRobot Manipulation | CodeCode Available | 0 |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Revisiting Bellman Errors for Offline Model Selection | Jan 31, 2023 | Atari Gamesmodel | CodeCode Available | 0 |
| Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective | Jun 13, 2023 | Learning-To-RankOffline RL | CodeCode Available | 0 |
| Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning | Aug 22, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Continual Task Learning through Adaptive Policy Self-Composition | Nov 18, 2024 | Continual LearningOffline RL | CodeCode Available | 0 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs | Oct 18, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Behavior Prior Representation learning for Offline Reinforcement Learning | Nov 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning | Apr 6, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL | May 26, 2025 | D4RLOffline RL | CodeCode Available | 0 |
| RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning | Dec 1, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning | Jun 24, 2020 | Atari GamesDQN Replay Dataset | CodeCode Available | 0 |
| Decision Transformer under Random Frame Dropping | Mar 3, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints | Oct 27, 2018 | 6D Pose Estimation using RGBDIndustrial Robots | CodeCode Available | 0 |
| Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL | Jun 8, 2024 | Data AugmentationMamba | CodeCode Available | 0 |
| TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents | Apr 18, 2024 | energy managementOffline RL | CodeCode Available | 0 |
| Robust Offline Reinforcement learning with Heavy-Tailed Rewards | Oct 28, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |