| Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments | Oct 13, 2023 | Active LearningOffline RL | —Unverified | 0 |
| Bi-Level Offline Policy Optimization with Limited Exploration | Oct 10, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning | Oct 9, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning | Oct 9, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration | Oct 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning | Oct 6, 2023 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning | Oct 2, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness | Sep 29, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Uncertainty-Aware Decision Transformer for Stochastic Driving Environments | Sep 28, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills | Sep 24, 2023 | Autonomous DrivingOffline RL | —Unverified | 0 |
| Robotic Offline RL from Internet Videos via Value-Function Pre-Training | Sep 22, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps | Sep 22, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions | Sep 18, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning | Sep 16, 2023 | D4RLmodel | —Unverified | 0 |
| Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning | Sep 14, 2023 | Data AugmentationOffline RL | —Unverified | 0 |
| Model-based Offline Policy Optimization with Adversarial Network | Sep 5, 2023 | modelOffline RL | CodeCode Available | 0 |
| Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance | Sep 4, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Multi-Objective Decision Transformers for Offline Reinforcement Learning | Aug 31, 2023 | D4RLOffline RL | —Unverified | 0 |
| Reinforced Self-Training (ReST) for Language Modeling | Aug 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Real Robot Challenge 2022: Learning Dexterous Manipulation from Offline Data in the Real World | Aug 15, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations | Aug 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation | Jul 26, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Contrastive Example-Based Control | Jul 24, 2023 | Offline RL | CodeCode Available | 0 |
| A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning | Jul 24, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| On the Effectiveness of Offline RL for Dialogue Response Generation | Jul 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Jul 21, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Budgeting Counterfactual for Offline RL | Jul 12, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning | Jul 10, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Goal-Conditioned Predictive Coding for Offline Reinforcement Learning | Jul 7, 2023 | Decision MakingOffline RL | —Unverified | 0 |
| Offline Reinforcement Learning with Imbalanced Datasets | Jul 6, 2023 | D4RLOffline RL | —Unverified | 0 |
| LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning | Jul 5, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning | Jun 27, 2023 | D4RLOffline RL | —Unverified | 0 |
| Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization | Jun 26, 2023 | Offline RLTest-time Adaptation | —Unverified | 0 |
| ChiPFormer: Transferable Chip Placement via Offline Decision Transformer | Jun 26, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching | Jun 24, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data | Jun 24, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning | Jun 23, 2023 | Imitation LearningOffline RL | —Unverified | 0 |
| Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap | Jun 20, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Automatic Trade-off Adaptation in Offline RL | Jun 16, 2023 | Offline RL | —Unverified | 0 |
| Semi-Offline Reinforcement Learning for Optimized Text Generation | Jun 16, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| 2vec: Policy Representations with Successor Features | Jun 16, 2023 | Offline RL | —Unverified | 0 |
| Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization | Jun 15, 2023 | ManagementMulti-agent Reinforcement Learning | —Unverified | 0 |
| Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources | Jun 14, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Off-policy Evaluation in Doubly Inhomogeneous Environments | Jun 14, 2023 | Offline RLOff-policy evaluation | CodeCode Available | 0 |
| A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning | Jun 13, 2023 | D4RLEfficient Exploration | —Unverified | 0 |
| Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective | Jun 13, 2023 | Learning-To-RankOffline RL | CodeCode Available | 0 |
| Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care | Jun 13, 2023 | Offline RLQ-Learning | —Unverified | 0 |
| ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles | Jun 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |