| Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration | Oct 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Settling the Communication Complexity for Distributed Offline Reinforcement Learning | Feb 10, 2022 | Multi-Armed BanditsOffline RL | —Unverified | 0 | 0 |
| Settling the Sample Complexity of Model-Based Offline Reinforcement Learning | Apr 11, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Should I Run Offline Reinforcement Learning or Behavioral Cloning? | Sep 29, 2021 | Atari GamesDiagnostic | —Unverified | 0 | 0 |
| Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters | Oct 8, 2021 | Decision Makingenergy management | —Unverified | 0 | 0 |
| Single-Shot Pruning for Offline Reinforcement Learning | Dec 31, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Data-Incremental Continual Offline Reinforcement Learning | Apr 19, 2024 | Continual LearningOffline RL | —Unverified | 0 | 0 |
| Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning | Aug 28, 2024 | Drone navigationOffline RL | —Unverified | 0 | 0 |
| SLiC-HF: Sequence Likelihood Calibration with Human Feedback | May 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Solving Continual Offline Reinforcement Learning with Decision Transformer | Jan 16, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces | Oct 21, 2024 | Continual LearningLifelong learning | —Unverified | 0 | 0 |
| Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning | Jul 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 | 0 |
| SR-Reward: Taking The Path More Traveled | Jan 4, 2025 | D4RLImitation Learning | —Unverified | 0 | 0 |
| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| State Regularized Policy Optimization on Data with Dynamics Shift | Jun 6, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments | Aug 23, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC | Nov 11, 2024 | Offline RL | —Unverified | 0 | 0 |
| Striving for Simplicity in Off-Policy Deep Reinforcement Learning | Sep 25, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Survival Instinct in Offline Reinforcement Learning | Jun 5, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | May 8, 2025 | D4RLDecision Making | —Unverified | 0 | 0 |