| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 |
| DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs | Oct 18, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Human-centric Dialog Training via Offline Reinforcement Learning | Oct 12, 2020 | Language ModellingOffline RL | —Unverified | 0 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Model-Based Offline Planning | Aug 12, 2020 | modelOffline RL | —Unverified | 0 |
| Overcoming Model Bias for Robust Offline Deep Reinforcement Learning | Aug 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Jul 21, 2020 | D4RLDecision Making | —Unverified | 0 |
| Hyperparameter Selection for Offline Reinforcement Learning | Jul 17, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |
| Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning | Jul 7, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |