| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Offline Meta-Reinforcement Learning with Online Self-Supervision | Jul 8, 2021 | Meta Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble | Jul 1, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation | Jun 21, 2021 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Offline RL Without Off-Policy Evaluation | Jun 16, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Reinforcement Learning as One Big Sequence Modeling Problem | Jun 13, 2021 | Imitation LearningOffline RL | CodeCode Available | 1 |
| A Minimalist Approach to Offline Reinforcement Learning | Jun 12, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Online reinforcement learning with sparse rewards through an active inference capsule | Jun 4, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Offline Reinforcement Learning as One Big Sequence Modeling Problem | Jun 3, 2021 | Imitation LearningOffline RL | CodeCode Available | 1 |