| CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| Should I Run Offline Reinforcement Learning or Behavioral Cloning? | Sep 29, 2021 | Atari GamesDiagnostic | —Unverified | 0 |
| Targeted Environment Design from Offline Data | Sep 29, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| The Essential Elements of Offline RL via Supervised Learning | Sep 29, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| A Workflow for Offline Model-Free Robotic Reinforcement Learning | Sep 22, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation | Sep 17, 2021 | Decision MakingOffline RL | —Unverified | 0 |
| Conservative Data Sharing for Multi-Task Offline Reinforcement Learning | Sep 16, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning | Sep 15, 2021 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Policy Gradients Incorporating the Future | Aug 4, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Model Selection for Offline Reinforcement Learning: Practical Considerations for Healthcare Settings | Jul 23, 2021 | Computational EfficiencyDecision Making | CodeCode Available | 1 |
| Offline Preference-Based Apprenticeship Learning | Jul 20, 2021 | Active LearningOffline RL | —Unverified | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage | Jul 13, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning | Jul 8, 2021 | Face DetectionFace Recognition | —Unverified | 0 |
| Offline Meta-Reinforcement Learning with Online Self-Supervision | Jul 8, 2021 | Meta Reinforcement LearningOffline RL | CodeCode Available | 1 |
| The Least Restriction for Offline Reinforcement Learning | Jul 5, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement Learning | Jul 3, 2021 | AttributeInductive Bias | CodeCode Available | 0 |
| Offline-to-Online Reinforcement Learning via Balanced Replay and Pessimistic Q-Ensemble | Jul 1, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL | Jun 22, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation | Jun 21, 2021 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Boosting Offline Reinforcement Learning with Residual Generative Modeling | Jun 19, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| Offline RL Without Off-Policy Evaluation | Jun 16, 2021 | D4RLOffline RL | CodeCode Available | 1 |
| Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL | Jun 16, 2021 | D4RLDomain Generalization | —Unverified | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 |
| Reinforcement Learning as One Big Sequence Modeling Problem | Jun 13, 2021 | Imitation LearningOffline RL | CodeCode Available | 1 |
| A Minimalist Approach to Offline Reinforcement Learning | Jun 12, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Corruption-Robust Offline Reinforcement Learning | Jun 11, 2021 | Adversarial RobustnessOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning as Anti-Exploration | Jun 11, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning | Jun 9, 2021 | Offline RLOpen-Ended Question Answering | —Unverified | 0 |
| Offline Inverse Reinforcement Learning | Jun 9, 2021 | Data AugmentationImitation Learning | —Unverified | 0 |
| Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning | Jun 7, 2021 | Multi-agent Reinforcement LearningOffline RL | CodeCode Available | 1 |
| Online reinforcement learning with sparse rewards through an active inference capsule | Jun 4, 2021 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Offline Reinforcement Learning as One Big Sequence Modeling Problem | Jun 3, 2021 | Imitation LearningOffline RL | CodeCode Available | 1 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 |
| Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning | Jun 1, 2021 | Offline RLRecommendation Systems | —Unverified | 0 |
| Revisiting Design Choices in Offline Model Based Reinforcement Learning | May 21, 2021 | Bayesian OptimizationModel-based Reinforcement Learning | —Unverified | 0 |
| Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning | May 17, 2021 | Offline RLQ-Learning | CodeCode Available | 1 |
| Model-Based Offline Planning with Trajectory Pruning | May 16, 2021 | modelOffline RL | CodeCode Available | 0 |
| Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings | May 13, 2021 | Offline RL | —Unverified | 0 |
| Interpretable performance analysis towards offline reinforcement learning: A dataset perspective | May 12, 2021 | Offline RLQ-Learning | —Unverified | 0 |
| InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem | May 2, 2021 | Atari GamesOffline RL | —Unverified | 0 |
| Online and Offline Reinforcement Learning by Planning with a Learned Model | Apr 13, 2021 | Atari GamesContinuous Control | CodeCode Available | 1 |
| Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism | Mar 22, 2021 | Imitation LearningMulti-Armed Bandits | —Unverified | 0 |
| Regularized Behavior Value Estimation | Mar 17, 2021 | Offline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Fisher Divergence Critic Regularization | Mar 14, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks | Mar 11, 2021 | Offline RLreinforcement-learning | —Unverified | 0 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 |
| Instabilities of Offline RL with Pre-Trained Neural Representation | Mar 8, 2021 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |