| Boosting Offline Reinforcement Learning via Data Rebalancing | Oct 17, 2022 | D4RLOffline RL | —Unverified | 0 |
| Data-Efficient Pipeline for Offline Reinforcement Learning with Limited Data | Oct 16, 2022 | Model SelectionOffline RL | —Unverified | 0 |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Oct 15, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Mutual Information Regularized Offline Reinforcement Learning | Oct 14, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics Belief | Oct 13, 2022 | D4RLOffline RL | CodeCode Available | 0 |
| Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories | Oct 12, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Efficient Offline Policy Optimization with a Learned Model | Oct 12, 2022 | Offline RL | CodeCode Available | 1 |
| Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials | Oct 11, 2022 | Offline RLQ-Learning | CodeCode Available | 1 |
| Reliable Conditioning of Behavioral Cloning for Offline Reinforcement Learning | Oct 11, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| The Role of Coverage in Online Reinforcement Learning | Oct 9, 2022 | Efficient ExplorationOffline RL | —Unverified | 0 |
| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 |
| BAFFLE: Hiding Backdoors in Offline Reinforcement Learning Datasets | Oct 7, 2022 | Autonomous DrivingBackdoor Attack | CodeCode Available | 1 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| S2P: State-conditioned Image Synthesis for Data Augmentation in Offline Reinforcement Learning | Sep 30, 2022 | Data AugmentationImage Generation | CodeCode Available | 0 |
| VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training | Sep 30, 2022 | Offline RLOpen-Ended Question Answering | CodeCode Available | 1 |
| Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling | Sep 29, 2022 | Computational EfficiencyD4RL | CodeCode Available | 1 |
| Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes | Sep 18, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Can Offline Reinforcement Learning Help Natural Language Understanding? | Sep 15, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Optimistic Curiosity Exploration and Conservative Exploitation with Linear Reward Shaping | Sep 15, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation | Sep 14, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Task-Agnostic Learning to Accomplish New Tasks | Sep 9, 2022 | Imitation LearningOffline RL | —Unverified | 0 |
| Q-learning Decision Transformer: Leveraging Dynamic Programming for Conditional Sequence Modelling in Offline RL | Sep 8, 2022 | D4RLOffline RL | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |
| Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments | Aug 23, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Efficient Planning in a Compact Latent Action Space | Aug 22, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |