| Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL | Jun 16, 2021 | D4RLDomain Generalization | —Unverified | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Offline Reinforcement Learning as Anti-Exploration | Jun 11, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Corruption-Robust Offline Reinforcement Learning | Jun 11, 2021 | Adversarial RobustnessOffline RL | —Unverified | 0 |
| Offline Inverse Reinforcement Learning | Jun 9, 2021 | Data AugmentationImitation Learning | —Unverified | 0 |
| Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning | Jun 9, 2021 | Offline RLOpen-Ended Question Answering | —Unverified | 0 |
| Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning | Jun 1, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Revisiting Design Choices in Offline Model Based Reinforcement Learning | May 21, 2021 | Bayesian OptimizationModel-based Reinforcement Learning | —Unverified | 0 |
| Model-Based Offline Planning with Trajectory Pruning | May 16, 2021 | modelOffline RL | CodeCode Available | 0 |
| Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings | May 13, 2021 | Offline RL | —Unverified | 0 |