| Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL | Jun 16, 2021 | D4RLDomain Generalization | —Unverified | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Offline Reinforcement Learning as Anti-Exploration | Jun 11, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Corruption-Robust Offline Reinforcement Learning | Jun 11, 2021 | Adversarial RobustnessOffline RL | —Unverified | 0 |
| Offline Inverse Reinforcement Learning | Jun 9, 2021 | Data AugmentationImitation Learning | —Unverified | 0 |
| Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning | Jun 9, 2021 | Offline RLOpen-Ended Question Answering | —Unverified | 0 |
| Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning | Jun 1, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Revisiting Design Choices in Offline Model Based Reinforcement Learning | May 21, 2021 | Bayesian OptimizationModel-based Reinforcement Learning | —Unverified | 0 |
| Model-Based Offline Planning with Trajectory Pruning | May 16, 2021 | modelOffline RL | CodeCode Available | 0 |
| Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings | May 13, 2021 | Offline RL | —Unverified | 0 |
| Interpretable performance analysis towards offline reinforcement learning: A dataset perspective | May 12, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem | May 2, 2021 | Atari GamesOffline RL | —Unverified | 0 |
| Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism | Mar 22, 2021 | Imitation LearningMulti-Armed Bandits | —Unverified | 0 |
| Regularized Behavior Value Estimation | Mar 17, 2021 | Offline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Fisher Divergence Critic Regularization | Mar 14, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks | Mar 11, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 |
| Instabilities of Offline RL with Pre-Trained Neural Representation | Mar 8, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator | Mar 5, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning | Feb 23, 2021 | Continuous ControlOffline RL | —Unverified | 0 |
| Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning | Feb 22, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning | Feb 19, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators | Feb 13, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| Representation Matters: Offline Pretraining for Sequential Decision Making | Feb 11, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Near-Optimal Offline Reinforcement Learning via Double Variance Reduction | Feb 2, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Offline Policy Optimization with Variance Regularization | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Uncertainty Weighted Offline Reinforcement Learning | Jan 1, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Robust Offline Reinforcement Learning from Low-Quality Data | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Representation Balancing Offline Model-based Reinforcement Learning | Jan 1, 2021 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Addressing Extrapolation Error in Deep Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Is Pessimism Provably Efficient for Offline RL? | Dec 30, 2020 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| POPO: Pessimistic Offline Policy Optimization | Dec 26, 2020 | Offline RLQ-Learning | CodeCode Available | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning | Dec 1, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| MOReL: Model-Based Offline Reinforcement Learning | Dec 1, 2020 | modelOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning Hands-On | Nov 29, 2020 | Behavioural cloningDecision Making | —Unverified | 0 |
| OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning | Oct 26, 2020 | Few-Shot Imitation LearningImitation Learning | —Unverified | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 |
| DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs | Oct 18, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Human-centric Dialog Training via Offline Reinforcement Learning | Oct 12, 2020 | Language ModellingOffline RL | —Unverified | 0 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Model-Based Offline Planning | Aug 12, 2020 | modelOffline RL | —Unverified | 0 |
| Overcoming Model Bias for Robust Offline Deep Reinforcement Learning | Aug 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Jul 21, 2020 | D4RLDecision Making | —Unverified | 0 |
| Hyperparameter Selection for Offline Reinforcement Learning | Jul 17, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |
| Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning | Jul 7, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |