| Offline RL Policies Should be Trained to be Adaptive | Jul 5, 2022 | Offline RL | —Unverified | 0 | 0 |
| Offline RL via Feature-Occupancy Gradient Ascent | May 22, 2024 | Offline RL | —Unverified | 0 | 0 |
| Offline RL with Observation Histories: Analyzing and Improving Sample Complexity | Oct 31, 2023 | Autonomous NavigationOffline RL | —Unverified | 0 | 0 |
| Offline RL With Realistic Datasets: Heteroskedasticity and Support Constraints | Nov 2, 2022 | Atari GamesOffline RL | —Unverified | 0 | 0 |
| Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator | Apr 23, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Offline Trajectory Generalization for Offline Reinforcement Learning | Apr 16, 2024 | D4RLData Augmentation | —Unverified | 0 | 0 |
| OffRIPP: Offline RL-based Informative Path Planning | Sep 25, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds | Feb 5, 2025 | Few-Shot LearningImitation Learning | —Unverified | 0 | 0 |
| Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks | Mar 11, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation | Nov 23, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning | Jun 15, 2021 | Deep Reinforcement LearningMixture-of-Experts | —Unverified | 0 | 0 |
| On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond | Jan 6, 2024 | Decision MakingDiversity | —Unverified | 0 | 0 |
| On the Role of Discount Factor in Offline Reinforcement Learning | Jun 7, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples | Mar 7, 2023 | Offline RLOff-policy evaluation | —Unverified | 0 | 0 |
| On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures | Jan 3, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Offline Preference-Based Apprenticeship Learning | Jul 20, 2021 | Active LearningOffline RL | —Unverified | 0 | 0 |
| OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning | Oct 26, 2020 | Few-Shot Imitation LearningImitation Learning | —Unverified | 0 | 0 |
| OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators | May 27, 2024 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian | Nov 1, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning | Jun 14, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Optimal Single-Policy Sample Complexity and Transient Coverage for Average-Reward Offline RL | Jun 26, 2025 | Offline RL | —Unverified | 0 | 0 |
| Optimistic Model Rollouts for Pessimistic Offline Policy Optimization | Jan 11, 2024 | modelOffline RL | —Unverified | 0 | 0 |
| Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning | Aug 27, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Optimizing Trajectories for Highway Driving with Offline Reinforcement Learning | Mar 21, 2022 | Autonomous DrivingOffline RL | —Unverified | 0 | 0 |
| Oracle Inequalities for Model Selection in Offline Reinforcement Learning | Nov 3, 2022 | Model SelectionOffline RL | —Unverified | 0 | 0 |