| Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning | Jun 10, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains | May 12, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning | Jun 13, 2023 | D4RLEfficient Exploration | —Unverified | 0 |
| Advancing RAN Slicing with Offline Reinforcement Learning | Dec 16, 2023 | ManagementOffline RL | —Unverified | 0 |
| ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data | Nov 8, 2022 | Offline RL | —Unverified | 0 |
| Diffusion Self-Weighted Guidance for Offline Reinforcement Learning | May 23, 2025 | Offline RLreinforcement-learning | —Unverified | 0 |
| Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning | Jul 10, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Budgeting Counterfactual for Offline RL | Jul 12, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| A Dual Approach to Imitation Learning from Observations with Offline Datasets | Jun 13, 2024 | Imitation LearningOffline RL | —Unverified | 0 |
| Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies | Dec 15, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |