| A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems | Mar 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Multi-Game Decision Transformers | May 30, 2022 | Atari GamesOffline RL | CodeCode Available | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic Spaces | Feb 20, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Corruption-Robust Offline Reinforcement Learning with General Function Approximation | Oct 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning | Oct 2, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? | May 30, 2023 | Imitation LearningOffline RL | CodeCode Available | 0 |
| COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning Attacks | Mar 16, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator | Mar 5, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning | May 28, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Jul 21, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| Solving Offline Reinforcement Learning with Decision Tree Regression | Jan 21, 2024 | D4RLFeature Importance | CodeCode Available | 0 |
| Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity | Jun 20, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning | Jun 14, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Beyond Reward: Offline Preference-guided Policy Optimization | May 25, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Jan 6, 2024 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief | Oct 13, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Model-based Offline Policy Optimization with Adversarial Network | Sep 5, 2023 | modelOffline RL | CodeCode Available | 0 |
| Active Advantage-Aligned Online Reinforcement Learning with Offline Data | Feb 11, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Stabilizing Extreme Q-learning by Maclaurin Expansion | Jun 7, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Model-Based Offline Planning with Trajectory Pruning | May 16, 2021 | modelOffline RL | CodeCode Available | 0 |
| Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination | Jun 16, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 |
| Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies | Jan 30, 2023 | Data AugmentationFeature Engineering | CodeCode Available | 0 |