| Offline Preference-Based Apprenticeship Learning | Jul 20, 2021 | Active LearningOffline RL | —Unverified | 0 | 0 |
| OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning | Oct 26, 2020 | Few-Shot Imitation LearningImitation Learning | —Unverified | 0 | 0 |
| OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | May 27, 2024 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian | Nov 1, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning | Jun 14, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL | Jun 26, 2025 | Offline RL | —Unverified | 0 | 0 |
| Optimistic Model Rollouts for Pessimistic Offline Policy Optimization | Jan 11, 2024 | modelOffline RL | —Unverified | 0 | 0 |
| Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning | Mar 21, 2022 | Autonomous DrivingOffline RL | —Unverified | 0 | 0 |
| Oracle Inequalities for Model Selection in Offline Reinforcement Learning | Nov 3, 2022 | Model SelectionOffline RL | —Unverified | 0 | 0 |