| Launchpad: Learning to Schedule Using Offline and Online RL Methods | Dec 1, 2022 | Deep Reinforcement LearningOffline RL | —Unverified | 0 |
| Representation Learning for Online and Offline RL in Low-rank MDPs | Oct 9, 2021 | Offline RLRepresentation Learning | —Unverified | 0 |
| Representation Learning in Deep RL via Discrete Information Bottleneck | Dec 28, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Representation Matters: Offline Pretraining for Sequential Decision Making | Feb 11, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning | Feb 3, 2025 | Meta-LearningOffline RL | —Unverified | 0 |
| Rethinking Decision Transformer via Hierarchical Reinforcement Learning | Nov 1, 2023 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| Revisiting Design Choices in Offline Model Based Reinforcement Learning | May 21, 2021 | Bayesian OptimizationModel-based Reinforcement Learning | —Unverified | 0 |
| Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints | Dec 28, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning | May 17, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning | Feb 15, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Reward Shifting for Optimistic Exploration and Conservative Exploitation | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning | May 6, 2024 | Offline RL | —Unverified | 0 |
| Robotic Offline RL from Internet Videos via Value-Function Pre-Training | Sep 22, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Robust Bandwidth Estimation for Real-Time Communication with Offline Reinforcement Learning | Jul 8, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling | Jul 5, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Robust Offline Reinforcement Learning from Low-Quality Data | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation | Oct 19, 2022 | D4RLMuJoCo | —Unverified | 0 |
| Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization | Nov 27, 2024 | Computational EfficiencyOffline RL | —Unverified | 0 |
| S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning | Mar 10, 2021 | Autonomous DrivingD4RL | —Unverified | 0 |
| Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving | Oct 31, 2023 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Scaling Offline RL via Efficient and Expressive Shortcut Models | May 28, 2025 | Offline RLreinforcement-learning | —Unverified | 0 |
| Scaling Vision-and-Language Navigation With Offline RL | Mar 27, 2024 | Offline RLVision and Language Navigation | —Unverified | 0 |
| Selective Uncertainty Propagation in Offline RL | Feb 1, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning | Oct 6, 2023 | Multi-agent Reinforcement LearningOffline RL | —Unverified | 0 |
| Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning | Nov 29, 2023 | AstronomyOffline RL | —Unverified | 0 |
| Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models | Jun 6, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Semi-gradient DICE for Offline Constrained Reinforcement Learning | Jun 10, 2025 | Offline RLOff-policy evaluation | —Unverified | 0 |
| Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration | Oct 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Settling the Communication Complexity for Distributed Offline Reinforcement Learning | Feb 10, 2022 | Multi-Armed BanditsOffline RL | —Unverified | 0 |
| Settling the Sample Complexity of Model-Based Offline Reinforcement Learning | Apr 11, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| Should I Run Offline Reinforcement Learning or Behavioral Cloning? | Sep 29, 2021 | Atari GamesDiagnostic | —Unverified | 0 |
| Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters | Oct 8, 2021 | Decision Makingenergy management | —Unverified | 0 |
| Single-Shot Pruning for Offline Reinforcement Learning | Dec 31, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Data-Incremental Continual Offline Reinforcement Learning | Apr 19, 2024 | Continual LearningOffline RL | —Unverified | 0 |
| Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning | Aug 28, 2024 | Drone navigationOffline RL | —Unverified | 0 |
| SLiC-HF: Sequence Likelihood Calibration with Human Feedback | May 17, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Solving Continual Offline Reinforcement Learning with Decision Transformer | Jan 16, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces | Oct 21, 2024 | Continual LearningLifelong learning | —Unverified | 0 |
| Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning | Jul 17, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| SR-Reward: Taking The Path More Traveled | Jan 4, 2025 | D4RLImitation Learning | —Unverified | 0 |
| State Advantage Weighting for Offline RL | Oct 9, 2022 | D4RLOffline RL | —Unverified | 0 |
| State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 |
| State Regularized Policy Optimization on Data with Dynamics Shift | Jun 6, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments | Aug 23, 2022 | Decision MakingOffline RL | —Unverified | 0 |
| Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC | Nov 11, 2024 | Offline RL | —Unverified | 0 |
| Striving for Simplicity in Off-Policy Deep Reinforcement Learning | Sep 25, 2019 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 |
| Survival Instinct in Offline Reinforcement Learning | Jun 5, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |