| Off-policy Evaluation in Doubly Inhomogeneous Environments | Jun 14, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |
| Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective | Jun 13, 2023 | Learning-To-RankOffline RL | CodeCode Available | 0 |
| Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care | Jun 13, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning | Jun 13, 2023 | D4RLEfficient Exploration | —Unverified | 0 |
| ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles | Jun 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Policy Regularization with Dataset Constraint for Offline Reinforcement Learning | Jun 11, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Iteratively Refined Behavior Regularization for Offline Reinforcement Learning | Jun 9, 2023 | D4RLOffline RL | —Unverified | 0 |
| Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning | Jun 8, 2023 | Decision MakingOffline RL | —Unverified | 0 |
| Decoupled Prioritized Resampling for Offline RL | Jun 8, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL | Jun 7, 2023 | Data AugmentationOffline RL | CodeCode Available | 1 |