| Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning | Apr 6, 2024 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RL | Dec 25, 2024 | Offline RLReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| On the Effectiveness of Offline RL for Dialogue Response Generation | Jul 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency | Mar 3, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| Off-policy Evaluation in Doubly Inhomogeneous Environments | Jun 14, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 | 5 |
| Offline RL With Resource Constrained Online Deployment | Oct 7, 2021 | D4RLOffline RL | CodeCode Available | 0 | 5 |
| Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood | Jun 10, 2025 | Computational EfficiencyD4RL | CodeCode Available | 0 | 5 |
| Active Advantage-Aligned Online Reinforcement Learning with Offline Data | Feb 11, 2025 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems | Mar 2, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| DR-SAC: Distributionally Robust Soft Actor-Critic for Reinforcement Learning under Uncertainty | Jun 14, 2025 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |