| Offline Primal-Dual Reinforcement Learning for Linear MDPs | May 22, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes | Nov 28, 2022 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation | Oct 30, 2024 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning as Anti-Exploration | Jun 11, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Offline Reinforcement Learning at Multiple Frequencies | Jul 26, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline reinforcement learning for job-shop scheduling problems | Oct 21, 2024 | Combinatorial OptimizationDeep Learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning for Large Scale Language Action Spaces | Sep 29, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Offline Reinforcement Learning for Road Traffic Control | Jan 7, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets | Nov 19, 2023 | ManagementOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation | Nov 21, 2021 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning Hands-On | Nov 29, 2020 | Behavioural cloningDecision Making | —Unverified | 0 | 0 |
| Offline Reinforcement Learning Under Value and Density-Ratio Realizability: The Power of Gaps | Mar 25, 2022 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Realizability and Single-policy Concentrability | Feb 9, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Differential Privacy | Jun 2, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes | Sep 18, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Differentiable Function Approximation is Provably Efficient | Oct 3, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Additional Covering Distributions | May 22, 2023 | Inductive BiasOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Imbalanced Datasets | Jul 6, 2023 | D4RLOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Behavioral Supervisor Tuning | Apr 25, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Adaptive Behavior Regularization | Nov 15, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Causal Structured World Models | Jun 3, 2022 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Closed-Form Policy Improvement Operators | Nov 29, 2022 | D4RLForm | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Discrete Diffusion Skills | Mar 26, 2025 | DecoderOffline RL | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Fisher Divergence Critic Regularization | Mar 14, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Offline Reinforcement Learning with Resource Constrained Online Deployment | Sep 29, 2021 | D4RLOffline RL | —Unverified | 0 | 0 |
| Offline RL Policies Should be Trained to be Adaptive | Jul 5, 2022 | Offline RL | —Unverified | 0 | 0 |
| Offline RL via Feature-Occupancy Gradient Ascent | May 22, 2024 | Offline RL | —Unverified | 0 | 0 |
| Offline RL with Observation Histories: Analyzing and Improving Sample Complexity | Oct 31, 2023 | Autonomous NavigationOffline RL | —Unverified | 0 | 0 |
| Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints | Nov 2, 2022 | Atari GamesOffline RL | —Unverified | 0 | 0 |
| Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator | Apr 23, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Offline Trajectory Generalization for Offline Reinforcement Learning | Apr 16, 2024 | D4RLData Augmentation | —Unverified | 0 | 0 |
| OffRIPP: Offline RL-based Informative Path Planning | Sep 25, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds | Feb 5, 2025 | Few-Shot LearningImitation Learning | —Unverified | 0 | 0 |
| Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks | Mar 11, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation | Nov 23, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 | 0 |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 | 0 |
| On the Role of Discount Factor in Offline Reinforcement Learning | Jun 7, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples | Mar 7, 2023 | Offline RLOff-policy evaluation | —Unverified | 0 | 0 |
| On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures | Jan 3, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Offline Preference-Based Apprenticeship Learning | Jul 20, 2021 | Active LearningOffline RL | —Unverified | 0 | 0 |
| OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning | Oct 26, 2020 | Few-Shot Imitation LearningImitation Learning | —Unverified | 0 | 0 |
| OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | May 27, 2024 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian | Nov 1, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning | Jun 14, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL | Jun 26, 2025 | Offline RL | —Unverified | 0 | 0 |
| Optimistic Model Rollouts for Pessimistic Offline Policy Optimization | Jan 11, 2024 | modelOffline RL | —Unverified | 0 | 0 |
| Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning | Mar 21, 2022 | Autonomous DrivingOffline RL | —Unverified | 0 | 0 |
| Oracle Inequalities for Model Selection in Offline Reinforcement Learning | Nov 3, 2022 | Model SelectionOffline RL | —Unverified | 0 | 0 |