| Two-step reinforcement learning for model-free redesign of nonlinear optimal regulator | Mar 5, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning | Feb 23, 2021 | Continuous ControlOffline RL | —Unverified | 0 |
| Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning | Feb 22, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning | Feb 19, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| COMBO: Conservative Offline Model-Based Policy Optimization | Feb 16, 2021 | modelOffline RL | CodeCode Available | 1 |
| PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators | Feb 13, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| Representation Matters: Offline Pretraining for Sequential Decision Making | Feb 11, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Near-Optimal Offline Reinforcement Learning via Double Variance Reduction | Feb 2, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning | Feb 1, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Representation Balancing Offline Model-based Reinforcement Learning | Jan 1, 2021 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Uncertainty Weighted Offline Reinforcement Learning | Jan 1, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |
| Robust Offline Reinforcement Learning from Low-Quality Data | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Offline Policy Optimization with Variance Regularization | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Addressing Extrapolation Error in Deep Offline Reinforcement Learning | Jan 1, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Is Pessimism Provably Efficient for Offline RL? | Dec 30, 2020 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| POPO: Pessimistic Offline Policy Optimization | Dec 26, 2020 | Offline RLQ-Learning | CodeCode Available | 0 |
| Offline Reinforcement Learning from Images with Latent Space Models | Dec 21, 2020 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 |
| MOReL: Model-Based Offline Reinforcement Learning | Dec 1, 2020 | modelOffline RL | —Unverified | 0 |
| RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning | Dec 1, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Offline Reinforcement Learning Hands-On | Nov 29, 2020 | Behavioural cloningDecision Making | —Unverified | 0 |
| OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning | Oct 26, 2020 | Few-Shot Imitation LearningImitation Learning | —Unverified | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 |
| Batch Exploration with Examples for Scalable Robotic Reinforcement Learning | Oct 22, 2020 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPs | Oct 18, 2020 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Learning Dexterous Manipulation from Suboptimal Experts | Oct 16, 2020 | Offline RLQ-Learning | —Unverified | 0 |
| Human-centric Dialog Training via Offline Reinforcement Learning | Oct 12, 2020 | Language ModellingOffline RL | —Unverified | 0 |
| FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization | Oct 2, 2020 | Meta Reinforcement LearningMetric Learning | CodeCode Available | 1 |
| Rethinking Attention with Performers | Sep 30, 2020 | D4RLImage Generation | CodeCode Available | 2 |
| The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line | Aug 16, 2020 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Offline Meta-Reinforcement Learning with Advantage Weighting | Aug 13, 2020 | Machine TranslationMeta-Learning | CodeCode Available | 1 |
| Overcoming Model Bias for Robust Offline Deep Reinforcement Learning | Aug 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Model-Based Offline Planning | Aug 12, 2020 | modelOffline RL | —Unverified | 0 |
| EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL | Jul 21, 2020 | D4RLDecision Making | —Unverified | 0 |
| Hyperparameter Selection for Offline Reinforcement Learning | Jul 17, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |
| Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning | Jul 7, 2020 | Offline RLreinforcement-learning | —Unverified | 0 |
| Transformers are RNNs: Fast Autoregressive Transformers with Linear Attention | Jun 29, 2020 | D4RLLanguage Modelling | CodeCode Available | 1 |
| Critic Regularized Regression | Jun 26, 2020 | Offline RLregression | CodeCode Available | 1 |
| RL Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning | Jun 24, 2020 | Atari GamesDQN Replay Dataset | CodeCode Available | 0 |
| Conservative Q-Learning for Offline Reinforcement Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization | Jun 5, 2020 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Acme: A Research Framework for Distributed Reinforcement Learning | Jun 1, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 1 |
| MOPO: Model-based Offline Policy Optimization | May 27, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| MOReL : Model-Based Offline Reinforcement Learning | May 12, 2020 | modelOffline RL | CodeCode Available | 1 |
| D4RL: Datasets for Deep Data-Driven Reinforcement Learning | Apr 15, 2020 | D4RLOffline RL | CodeCode Available | 2 |
| Reformer: The Efficient Transformer | Jan 13, 2020 | D4RLImage Generation | CodeCode Available | 2 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |