SOTAVerified

Offline RL

Papers

Showing 441450 of 755 papers

TitleStatusHype
Wall Street Tree Search: Risk-Aware Planning for Offline Reinforcement Learning0
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap0
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
What Matters for Batch Online Reinforcement Learning in Robotics?0
When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?0
Which Features are Best for Successor Features?0
Why Online Reinforcement Learning is Causal0
Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters.0
Why So Pessimistic? Estimating Uncertainties for Offline RL through Ensembles, and Why Their Independence Matters0
Yes, Q-learning Helps Offline In-Context RL0
Show:102550
← PrevPage 45 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified