| Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning | Nov 6, 2022 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap | Jun 20, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| What are the Statistical Limits of Offline RL with Linear Function Approximation? | Oct 22, 2020 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| What Matters for Batch Online Reinforcement Learning in Robotics? | May 12, 2025 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning? | Apr 12, 2022 | Atari GamesDiagnostic | —Unverified | 0 | 0 |
| Which Features are Best for Successor Features? | Feb 15, 2025 | Offline RL | —Unverified | 0 | 0 |
| Why Online Reinforcement Learning is Causal | Mar 7, 2024 | counterfactualOffline RL | —Unverified | 0 | 0 |
| Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters. | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters | May 27, 2022 | D4RLOffline RL | —Unverified | 0 | 0 |
| Yes, Q-learning Helps Offline In-Context RL | Feb 24, 2025 | In-Context Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |