| Near-Optimal Offline Reinforcement Learning via Double Variance Reduction | Feb 2, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Offline Policy Optimization with Variance Regularization | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Uncertainty Weighted Offline Reinforcement Learning | Jan 1, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Robust Offline Reinforcement Learning from Low-Quality Data | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Representation Balancing Offline Model-based Reinforcement Learning | Jan 1, 2021 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Addressing Extrapolation Error in Deep Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Is Pessimism Provably Efficient for Offline RL? | Dec 30, 2020 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| POPO: Pessimistic Offline Policy Optimization | Dec 26, 2020 | Offline RLQ-Learning | CodeCode Available | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning | Dec 1, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| MOReL: Model-Based Offline Reinforcement Learning | Dec 1, 2020 | modelOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning Hands-On | Nov 29, 2020 | Behavioural cloningDecision Making | —Unverified | 0 |
| OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning | Oct 26, 2020 | Few-Shot Imitation LearningImitation Learning | —Unverified | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 |
| DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs | Oct 18, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Human-centric Dialog Training via Offline Reinforcement Learning | Oct 12, 2020 | Language ModellingOffline RL | —Unverified | 0 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Model-Based Offline Planning | Aug 12, 2020 | modelOffline RL | —Unverified | 0 |
| Overcoming Model Bias for Robust Offline Deep Reinforcement Learning | Aug 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Jul 21, 2020 | D4RLDecision Making | —Unverified | 0 |
| Hyperparameter Selection for Offline Reinforcement Learning | Jul 17, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |
| Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning | Jul 7, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |