| Preference Elicitation for Offline Reinforcement Learning | Jun 26, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning | May 29, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Preserving Expert-Level Privacy in Offline Reinforcement Learning | Nov 18, 2024 | Offline RLreinforcement-learning | —Unverified | 0 |
| Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning | May 9, 2025 | D4RLOffline RL | —Unverified | 0 |
| Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning | Jun 27, 2023 | D4RLOffline RL | —Unverified | 0 |
| PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement | Nov 26, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Prompting Decision Transformer for Few-Shot Policy Generalization | Jun 27, 2022 | Few-Shot LearningInductive Bias | —Unverified | 0 |
| Provable Benefit of Multitask Representation Learning in Reinforcement Learning | Jun 13, 2022 | Offline RLreinforcement-learning | —Unverified | 0 |
| What can online reinforcement learning with function approximation benefit from general coverage conditions? | Apr 25, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 |
| Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation | Feb 25, 2023 | Offline RLQ-Learning | —Unverified | 0 |