| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 |
| Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data | Oct 16, 2022 | Model SelectionOffline RL | —Unverified | 0 |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Oct 15, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief | Oct 13, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories | Oct 12, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Efficient Offline Policy Optimization with a Learned Model | Oct 12, 2022 | Offline RL | CodeCode Available | 1 |
| Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials | Oct 11, 2022 | Offline RLQ-Learning | CodeCode Available | 1 |
| Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning | Oct 11, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| The Role of Coverage in Online Reinforcement Learning | Oct 9, 2022 | Efficient ExplorationOffline RL | —Unverified | 0 |
| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 |
| BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets | Oct 7, 2022 | Autonomous DrivingBackdoor Attack | CodeCode Available | 1 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning | Sep 30, 2022 | Data AugmentationImage Generation | CodeCode Available | 0 |
| VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training | Sep 30, 2022 | Offline RLOpen-Ended Question Answering | CodeCode Available | 1 |
| Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling | Sep 29, 2022 | Computational EfficiencyD4RL | CodeCode Available | 1 |
| Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes | Sep 18, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Can Offline Reinforcement Learning Help Natural Language Understanding? | Sep 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping | Sep 15, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation | Sep 14, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Task-Agnostic Learning to Accomplish New Tasks | Sep 9, 2022 | Imitation LearningOffline RL | —Unverified | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |
| Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments | Aug 23, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Efficient Planning in a Compact Latent Action Space | Aug 22, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning | Aug 12, 2022 | D4RLOffline RL | CodeCode Available | 2 |
| Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity | Aug 11, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| AdaCat: Adaptive Categorical Discretization for Autoregressive Models | Aug 3, 2022 | Density EstimationOffline RL | CodeCode Available | 1 |
| Offline Reinforcement Learning at Multiple Frequencies | Jul 26, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations | Jul 20, 2022 | Imitation LearningOffline RL | CodeCode Available | 1 |
| BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion | Jul 16, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| GriddlyJS: A Web IDE for Reinforcement Learning | Jul 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Offline Equilibrium Finding | Jul 12, 2022 | Offline RL | CodeCode Available | 0 |
| Offline RL Policies Should be Trained to be Adaptive | Jul 5, 2022 | Offline RL | —Unverified | 0 |
| An Empirical Study of Implicit Regularization in Deep Offline RL | Jul 5, 2022 | Offline RL | —Unverified | 0 |
| Prompting Decision Transformer for Few-Shot Policy Generalization | Jun 27, 2022 | Few-Shot LearningInductive Bias | —Unverified | 0 |
| When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning | Jun 27, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Behavior Transformers: Cloning k modes with one stone | Jun 22, 2022 | Object DetectionOffline RL | CodeCode Available | 1 |
| A Survey on Model-based Reinforcement Learning | Jun 19, 2022 | Decision Makingmodel | —Unverified | 0 |
| Bootstrapped Transformer for Offline Reinforcement Learning | Jun 17, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning | Jun 17, 2022 | Few-Shot LearningOffline RL | CodeCode Available | 2 |
| Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination | Jun 16, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Contrastive Learning as Goal-Conditioned Reinforcement Learning | Jun 15, 2022 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning | Jun 14, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Provably Efficient Offline Reinforcement Learning with Trajectory-Wise Reward | Jun 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Provable Benefit of Multitask Representation Learning in Reinforcement Learning | Jun 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Federated Offline Reinforcement Learning | Jun 11, 2022 | Offline RLPrivacy Preserving | —Unverified | 0 |
| Large-Scale Retrieval for Reinforcement Learning | Jun 10, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations | Jun 9, 2022 | Benchmarkingcontinuous-control | CodeCode Available | 2 |
| Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning | Jun 9, 2022 | D4RLModel-based Reinforcement Learning | CodeCode Available | 1 |