| Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning | Jun 9, 2021 | Offline RLOpen-Ended Question Answering | —Unverified | 0 |
| Offline Inverse Reinforcement Learning | Jun 9, 2021 | Data AugmentationImitation Learning | —Unverified | 0 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Online reinforcement learning with sparse rewards through an active inference capsule | Jun 4, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Offline Reinforcement Learning as One Big Sequence Modeling Problem | Jun 3, 2021 | Imitation LearningOffline RL | CodeCode Available | 1 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 |
| Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning | Jun 1, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Revisiting Design Choices in Offline Model Based Reinforcement Learning | May 21, 2021 | Bayesian OptimizationModel-based Reinforcement Learning | —Unverified | 0 |
| Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning | May 17, 2021 | Offline RLQ-Learning | CodeCode Available | 1 |
| Model-Based Offline Planning with Trajectory Pruning | May 16, 2021 | modelOffline RL | CodeCode Available | 0 |