| You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL | Oct 5, 2021 | D4RLOffline RL | —Unverified | 0 |
| BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning | Oct 2, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Offline Reinforcement Learning for Large Scale Language Action Spaces | Sep 29, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reward Shifting for Optimistic Exploration and Conservative Exploitation | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Should I Run Offline Reinforcement Learning or Behavioral Cloning? | Sep 29, 2021 | Atari GamesDiagnostic | —Unverified | 0 |
| Learning Pseudometric-based Action Representations for Offline Reinforcement Learning | Sep 29, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Targeted Environment Design from Offline Data | Sep 29, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| The Essential Elements of Offline RL via Supervised Learning | Sep 29, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |