| Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization | Mar 28, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Optimal Transport for Offline Imitation Learning | Mar 24, 2023 | D4RLDecision Making | CodeCode Available | 1 |
| DataLight: Offline Data-Driven Traffic Signal Control | Mar 20, 2023 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning | Mar 9, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| The In-Sample Softmax for Offline Reinforcement Learning | Feb 28, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Neural Laplace Control for Continuous-time Delayed Systems | Feb 24, 2023 | Model Predictive ControlOffline RL | CodeCode Available | 1 |
| Behavior Proximal Policy Optimization | Feb 22, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Swapped goal-conditioned offline reinforcement learning | Feb 17, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Dual RL: Unification and New Methods for Reinforcement and Imitation Learning | Feb 16, 2023 | Imitation LearningOffline RL | CodeCode Available | 1 |
| Direct Preference-based Policy Optimization without Reward Modeling | Jan 30, 2023 | Contrastive LearningOffline RL | CodeCode Available | 1 |
| Guiding Online Reinforcement Learning with Action-Free Offline Pretraining | Jan 30, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Extreme Q-Learning: MaxEnt RL without Entropy | Jan 5, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 1 |
| Offline Reinforcement Learning for Visual Navigation | Dec 16, 2022 | NavigateOffline RL | CodeCode Available | 1 |
| Efficient Reinforcement Learning Through Trajectory Generation | Nov 30, 2022 | LEMMAOffline RL | CodeCode Available | 1 |
| One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning | Nov 30, 2022 | AllDecision Making | CodeCode Available | 1 |
| Masked Autoencoding for Scalable and Generalizable Decision Making | Nov 23, 2022 | Decision MakingOffline RL | CodeCode Available | 1 |
| Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows | Nov 20, 2022 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size | Nov 20, 2022 | Offline RL | CodeCode Available | 1 |
| Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information | Oct 31, 2022 | Offline RLReinforcement Learning (RL) | CodeCode Available | 1 |
| Leveraging Demonstrations with Latent Space Priors | Oct 26, 2022 | Offline RL | CodeCode Available | 1 |
| Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning | Oct 25, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| MoCoDA: Model-based Counterfactual Data Augmentation | Oct 20, 2022 | counterfactualData Augmentation | CodeCode Available | 1 |
| A Policy-Guided Imitation Approach for Offline Reinforcement Learning | Oct 15, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories | Oct 12, 2022 | D4RLOffline RL | CodeCode Available | 1 |
| Efficient Offline Policy Optimization with a Learned Model | Oct 12, 2022 | Offline RL | CodeCode Available | 1 |