| The Challenges of Exploration for Offline Reinforcement Learning | Jan 27, 2022 | Model Predictive ControlOffline RL | —Unverified | 0 |
| Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning | Jan 14, 2022 | modelMuJoCo | —Unverified | 0 |
| Offline Reinforcement Learning for Road Traffic Control | Jan 7, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning | Dec 31, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Single-Shot Pruning for Offline Reinforcement Learning | Dec 31, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| A Validation Tool for Designing Reinforcement Learning Environments | Dec 10, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization | Dec 9, 2021 | Atari GamesD4RL | —Unverified | 0 |
| Curriculum Offline Imitating Learning | Dec 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning | Nov 29, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions | Nov 29, 2021 | Contrastive LearningDecision Making | —Unverified | 0 |
| UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning | Nov 22, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation | Nov 21, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| A Survey of Zero-shot Generalisation in Deep Reinforcement Learning | Nov 18, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| d3rlpy: An Offline Deep Reinforcement Learning Library | Nov 6, 2021 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 |
| Towards Instance-Optimal Offline Reinforcement Learning with Pessimism | Oct 17, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Value Penalized Q-Learning for Recommender Systems | Oct 15, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Representation Learning for Online and Offline RL in Low-rank MDPs | Oct 9, 2021 | Offline RLRepresentation Learning | —Unverified | 0 |
| Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters | Oct 8, 2021 | Decision Makingenergy management | —Unverified | 0 |
| Offline RL With Resource Constrained Online Deployment | Oct 7, 2021 | D4RLOffline RL | CodeCode Available | 0 |
| You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL | Oct 5, 2021 | D4RLOffline RL | —Unverified | 0 |
| BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning | Oct 2, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Offline Reinforcement Learning for Large Scale Language Action Spaces | Sep 29, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reward Shifting for Optimistic Exploration and Conservative Exploitation | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Should I Run Offline Reinforcement Learning or Behavioral Cloning? | Sep 29, 2021 | Atari GamesDiagnostic | —Unverified | 0 |
| Learning Pseudometric-based Action Representations for Offline Reinforcement Learning | Sep 29, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Targeted Environment Design from Offline Data | Sep 29, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| The Essential Elements of Offline RL via Supervised Learning | Sep 29, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| Particle Based Stochastic Policy Optimization | Sep 29, 2021 | Deep Reinforcement LearningMuJoCo Games | —Unverified | 0 |
| Pareto Policy Pool for Model-based Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Uncertainty Regularized Policy Learning for Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Variational oracle guiding for reinforcement learning | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning | Sep 29, 2021 | Multi-Task LearningOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Resource Constrained Online Deployment | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation | Sep 17, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| Conservative Data Sharing for Multi-Task Offline Reinforcement Learning | Sep 16, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning | Sep 15, 2021 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Policy Gradients Incorporating the Future | Aug 4, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Offline Preference-Based Apprenticeship Learning | Jul 20, 2021 | Active LearningOffline RL | —Unverified | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage | Jul 13, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning | Jul 8, 2021 | Face DetectionFace Recognition | —Unverified | 0 |
| The Least Restriction for Offline Reinforcement Learning | Jul 5, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement Learning | Jul 3, 2021 | AttributeInductive Bias | CodeCode Available | 0 |
| Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL | Jun 22, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Boosting Offline Reinforcement Learning with Residual Generative Modeling | Jun 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |