| Representation Balancing Offline Model-based Reinforcement Learning | Jan 1, 2021 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL | Dec 25, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems | Jul 18, 2024 | Offline RLRecommendation Systems | CodeCode Available | 0 |
| Learning to Control Autonomous Fleets from Observation via Offline Reinforcement Learning | Feb 28, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Learning from Sparse Offline Datasets via Conservative Density Estimation | Jan 16, 2024 | D4RLDensity Estimation | CodeCode Available | 0 |
| S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning | Sep 30, 2022 | Data AugmentationImage Generation | CodeCode Available | 0 |
| On the Effectiveness of Offline RL for Dialogue Response Generation | Jul 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning | May 25, 2023 | Distributional Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Latent Safety-Constrained Policy Approach for Safe Offline Reinforcement Learning | Dec 11, 2024 | Autonomous DrivingOffline RL | CodeCode Available | 0 |
| Is Value Functions Estimation with Classification Plug-and-play for Offline Reinforcement Learning? | Jun 10, 2024 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency | Mar 3, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Off-policy Evaluation in Doubly Inhomogeneous Environments | Jun 14, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |
| Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood | Jun 10, 2025 | Computational EfficiencyD4RL | CodeCode Available | 0 |
| Offline RL With Resource Constrained Online Deployment | Oct 7, 2021 | D4RLOffline RL | CodeCode Available | 0 |
| Scalable Decision-Making in Stochastic Environments through Learned Temporal Abstraction | Feb 28, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 |
| POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning | Jan 1, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | May 20, 2024 | Atari GamesMamba | CodeCode Available | 0 |
| CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization | Jun 18, 2025 | D4RLOffline RL | CodeCode Available | 0 |
| Policy Constraint by Only Support Constraint for Offline Reinforcement Learning | Mar 7, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning | Sep 15, 2021 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Fat-to-Thin Policy Optimization: Offline RL with Sparse Policies | Jan 24, 2025 | MuJoCoOffline RL | CodeCode Available | 0 |
| Explaining RL Decisions with Trajectories | May 6, 2023 | Attributecontinuous-control | CodeCode Available | 0 |
| Experimental evaluation of offline reinforcement learning for HVAC control in buildings | Aug 15, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Offline Reinforcement Learning from Datasets with Structured Non-Stationarity | May 23, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |