| Deploying Offline Reinforcement Learning with Human Feedback | Mar 13, 2023 | Decision MakingModel Selection | —Unverified | 0 |
| Measurement Scheduling for ICU Patients with Offline Reinforcement Learning | Feb 12, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Large-Scale Retrieval for Reinforcement Learning | Jun 10, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding | Jun 1, 2023 | ManagementOffline RL | —Unverified | 0 |
| Bi-Level Offline Policy Optimization with Limited Exploration | Oct 10, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning | Apr 14, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Offline Actor-Critic Reinforcement Learning Scales to Large Models | Feb 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies | May 22, 2025 | Offline RLQ-Learning | —Unverified | 0 |
| Offline Policy Optimization with Variance Regularization | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning | Jul 10, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Large Language Model driven Policy Exploration for Recommender Systems | Jan 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Dual Approach to Imitation Learning from Observations with Offline Datasets | Jun 13, 2024 | Imitation LearningOffline RL | —Unverified | 0 |
| Language-Conditioned Offline RL for Multi-Robot Navigation | Jul 29, 2024 | Offline RLRobot Navigation | —Unverified | 0 |
| Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation | Mar 26, 2025 | D4RLData Augmentation | —Unverified | 0 |
| DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning | Feb 23, 2021 | Continuous ControlOffline RL | —Unverified | 0 |
| Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics | Nov 2, 2021 | D4RLData Augmentation | —Unverified | 0 |
| Model-enhanced Contrastive Reinforcement Learning for Sequential Recommendation | Oct 25, 2023 | Contrastive Learningmodel | —Unverified | 0 |
| Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning | Jun 10, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL | Jun 1, 2022 | D4RLOffline RL | —Unverified | 0 |
| Deep RL with Hierarchical Action Exploration for Dialogue Generation | Mar 22, 2023 | Dialogue GenerationOffline RL | —Unverified | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 |
| Is Pessimism Provably Efficient for Offline RL? | Dec 30, 2020 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications | Feb 15, 2023 | Decision MakingManagement | —Unverified | 0 |
| Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 |