| Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias | Oct 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Bi-Level Offline Policy Optimization with Limited Exploration | Oct 10, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning | Oct 9, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning | Oct 9, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration | Oct 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL | Oct 6, 2023 | AttributeOffline RL | CodeCode Available | 1 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 |
| Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning | Oct 6, 2023 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning | Oct 2, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |