| A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems | Mar 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Multi-Game Decision Transformers | May 30, 2022 | Atari GamesOffline RL | CodeCode Available | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Feb 20, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Corruption-Robust Offline Reinforcement Learning with General Function Approximation | Oct 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning | Oct 2, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? | May 30, 2023 | Imitation LearningOffline RL | CodeCode Available | 0 |
| COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks | Mar 16, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator | Mar 5, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning | May 28, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Jul 21, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| Solving Offline Reinforcement Learning with Decision Tree Regression | Jan 21, 2024 | D4RLFeature Importance | CodeCode Available | 0 |
| Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Jun 20, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning | Jun 14, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Beyond Reward: Offline Preference-guided Policy Optimization | May 25, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Jan 6, 2024 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief | Oct 13, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Model-based Offline Policy Optimization with Adversarial Network | Sep 5, 2023 | modelOffline RL | CodeCode Available | 0 |
| Active Advantage-Aligned Online Reinforcement Learning with Offline Data | Feb 11, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Stabilizing Extreme Q-learning by Maclaurin Expansion | Jun 7, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Model-Based Offline Planning with Trajectory Pruning | May 16, 2021 | modelOffline RL | CodeCode Available | 0 |
| Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination | Jun 16, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 |
| Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies | Jan 30, 2023 | Data AugmentationFeature Engineering | CodeCode Available | 0 |
| Contrastive Example-Based Control | Jul 24, 2023 | Offline RL | CodeCode Available | 0 |
| Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness | Sep 29, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 |
| Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMs | Jul 15, 2025 | DiversityMMLU | CodeCode Available | 0 |
| DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning | Oct 9, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator | Dec 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy Learning | Jan 17, 2024 | Offline RLRobot Manipulation | CodeCode Available | 0 |
| MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from Observations | Mar 30, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Revisiting Bellman Errors for Offline Model Selection | Jan 31, 2023 | Atari Gamesmodel | CodeCode Available | 0 |
| Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective | Jun 13, 2023 | Learning-To-RankOffline RL | CodeCode Available | 0 |
| Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning | Aug 22, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Continual Task Learning through Adaptive Policy Self-Composition | Nov 18, 2024 | Continual LearningOffline RL | CodeCode Available | 0 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs | Oct 18, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Behavior Prior Representation learning for Offline Reinforcement Learning | Nov 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning | Apr 6, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL | May 26, 2025 | D4RLOffline RL | CodeCode Available | 0 |
| RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning | Dec 1, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning | Jun 24, 2020 | Atari GamesDQN Replay Dataset | CodeCode Available | 0 |
| Decision Transformer under Random Frame Dropping | Mar 3, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| The CoSTAR Block Stacking Dataset: Learning with Workspace Constraints | Oct 27, 2018 | 6D Pose Estimation using RGBDIndustrial Robots | CodeCode Available | 0 |
| Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL | Jun 8, 2024 | Data AugmentationMamba | CodeCode Available | 0 |
| TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents | Apr 18, 2024 | energy managementOffline RL | CodeCode Available | 0 |
| Robust Offline Reinforcement learning with Heavy-Tailed Rewards | Oct 28, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |