| Value Penalized Q-Learning for Recommender Systems | Oct 15, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Safe Driving via Expert Guided Policy Optimization | Oct 13, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Planning from Pixels in Environments with Combinatorially Hard Search Spaces | Oct 12, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Offline Reinforcement Learning with Implicit Q-Learning | Oct 12, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Beyond Pick-and-Place: Tackling Robotic Stacking of Diverse Shapes | Oct 12, 2021 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| StARformer: Transformer with State-Action-Reward Representations for Visual Reinforcement Learning | Oct 12, 2021 | Imitation LearningInductive Bias | CodeCode Available | 1 |
| Representation Learning for Online and Offline RL in Low-rank MDPs | Oct 9, 2021 | Offline RLRepresentation Learning | —Unverified | 0 |
| Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters | Oct 8, 2021 | Decision Makingenergy management | —Unverified | 0 |
| Offline RL With Resource Constrained Online Deployment | Oct 7, 2021 | D4RLOffline RL | CodeCode Available | 0 |
| You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL | Oct 5, 2021 | D4RLOffline RL | —Unverified | 0 |
| Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble | Oct 4, 2021 | Adroid door-clonedAdroid door-human | CodeCode Available | 1 |
| BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement Learning | Oct 2, 2021 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Offline Reinforcement Learning with Reverse Model-based Imagination | Oct 1, 2021 | Data Augmentationmodel | CodeCode Available | 1 |
| Reward Shifting for Optimistic Exploration and Conservative Exploitation | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Particle Based Stochastic Policy Optimization | Sep 29, 2021 | Deep Reinforcement LearningMuJoCo Games | —Unverified | 0 |
| Variational oracle guiding for reinforcement learning | Sep 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Uncertainty Regularized Policy Learning for Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Resource Constrained Online Deployment | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with In-sample Q-Learning | Sep 29, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Adaptive Q-learning for Interaction-Limited Reinforcement Learning | Sep 29, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Reinforcement Learning for Large Scale Language Action Spaces | Sep 29, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pareto Policy Pool for Model-based Offline Reinforcement Learning | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| Learning Pseudometric-based Action Representations for Offline Reinforcement Learning | Sep 29, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning | Sep 29, 2021 | Multi-Task LearningOffline RL | —Unverified | 0 |