| Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning | Oct 27, 2023 | Autonomous DrivingD4RL | —Unverified | 0 |
| CROP: Conservative Reward for Model-based Offline Policy Optimization | Oct 26, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation | Oct 25, 2023 | Contrastive Learningmodel | —Unverified | 0 |
| Finetuning Offline World Models in the Real World | Oct 24, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Corruption-Robust Offline Reinforcement Learning with General Function Approximation | Oct 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Towards Robust Offline Reinforcement Learning under Diverse Data Corruption | Oct 19, 2023 | Offline RLQ-Learning | CodeCode Available | 1 |
| Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning | Oct 18, 2023 | Offline RLQuantization | —Unverified | 0 |
| Building Persona Consistent Dialogue Agents with Offline Reinforcement Learning | Oct 16, 2023 | ChatbotOffline RL | CodeCode Available | 0 |
| End-to-end Offline Reinforcement Learning for Glycemia Control | Oct 16, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments | Oct 13, 2023 | Active LearningOffline RL | —Unverified | 0 |
| Offline Retraining for Online RL: Decoupled Policy Learning to Mitigate Exploration Bias | Oct 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Bi-Level Offline Policy Optimization with Limited Exploration | Oct 10, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning | Oct 9, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning | Oct 9, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration | Oct 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL | Oct 6, 2023 | AttributeOffline RL | CodeCode Available | 1 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 |
| Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning | Oct 6, 2023 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning | Oct 2, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning | Sep 29, 2023 | Image GenerationOffline RL | CodeCode Available | 1 |
| Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness | Sep 29, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Uncertainty-Aware Decision Transformer for Stochastic Driving Environments | Sep 28, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Zero-Shot Reinforcement Learning from Low Quality Data | Sep 26, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills | Sep 24, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Counterfactual Conservative Q Learning for Offline Multi-agent Reinforcement Learning | Sep 22, 2023 | counterfactualMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps | Sep 22, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Robotic Offline RL from Internet Videos via Value-Function Pre-Training | Sep 22, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions | Sep 18, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning | Sep 16, 2023 | D4RLmodel | —Unverified | 0 |
| Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning | Sep 14, 2023 | Data AugmentationOffline RL | —Unverified | 0 |
| VAPOR: Legged Robot Navigation in Outdoor Vegetation Using Offline Reinforcement Learning | Sep 14, 2023 | Offline RLreinforcement-learning | CodeCode Available | 1 |
| Reasoning with Latent Diffusion in Offline Reinforcement Learning | Sep 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning | Sep 6, 2023 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Model-based Offline Policy Optimization with Adversarial Network | Sep 5, 2023 | modelOffline RL | CodeCode Available | 0 |
| Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance | Sep 4, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Multi-Objective Decision Transformers for Offline Reinforcement Learning | Aug 31, 2023 | D4RLOffline RL | —Unverified | 0 |
| Reinforced Self-Training (ReST) for Language Modeling | Aug 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World | Aug 15, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations | Aug 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation | Jul 26, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Contrastive Example-Based Control | Jul 24, 2023 | Offline RL | CodeCode Available | 0 |
| A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning | Jul 24, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| On the Effectiveness of Offline RL for Dialogue Response Generation | Jul 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Jul 21, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Self-Assembling Artificial Neural Networks through Neural Developmental Programs | Jul 17, 2023 | Offline RL | CodeCode Available | 1 |
| Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning | Jul 13, 2023 | BenchmarkingOffline RL | CodeCode Available | 1 |
| Budgeting Counterfactual for Offline RL | Jul 12, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |