| Skill Decision Transformer | Jan 31, 2023 | D4RLDescriptive | CodeCode Available | 0 |
| Direct Preference-based Policy Optimization without Reward Modeling | Jan 30, 2023 | Contrastive LearningOffline RL | CodeCode Available | 1 |
| Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies | Jan 30, 2023 | Data AugmentationFeature Engineering | CodeCode Available | 0 |
| Guiding Online Reinforcement Learning with Action-Free Offline Pretraining | Jan 30, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Learning to View: Decision Transformers for Active Object Detection | Jan 23, 2023 | Active Object DetectionMotion Planning | —Unverified | 0 |
| Extreme Q-Learning: MaxEnt RL without Entropy | Jan 5, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives | Jan 3, 2023 | Offline RLRecommendation Systems | —Unverified | 0 |
| Benchmarks and Algorithms for Offline Preference-Based Reward Learning | Jan 3, 2023 | Active LearningOffline RL | —Unverified | 0 |
| Offline Policy Optimization in RL with Variance Regularizaton | Dec 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Representation Learning in Deep RL via Discrete Information Bottleneck | Dec 28, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |