| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Implicit Behavioral Cloning | Sep 1, 2021 | D4RL | CodeCode Available | 1 |
| A Pragmatic Look at Deep Imitation Learning | Aug 4, 2021 | Behavioural cloningD4RL | —Unverified | 0 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Offline RL Without Off-Policy Evaluation | Jun 16, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL | Jun 16, 2021 | D4RLDomain Generalization | —Unverified | 0 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 |
| Reducing Conservativeness Oriented Offline Reinforcement Learning | Feb 27, 2021 | D4RLreinforcement-learning | —Unverified | 0 |
| Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |