| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 |
| The Role of Coverage in Online Reinforcement Learning | Oct 9, 2022 | Efficient ExplorationOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning | Sep 30, 2022 | Data AugmentationImage Generation | CodeCode Available | 0 |
| Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes | Sep 18, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Can Offline Reinforcement Learning Help Natural Language Understanding? | Sep 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation | Sep 14, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Task-Agnostic Learning to Accomplish New Tasks | Sep 9, 2022 | Imitation LearningOffline RL | —Unverified | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |