SOTAVerified

Offline RL

Papers

Showing 551560 of 755 papers

TitleStatusHype
On the Role of Discount Factor in Offline Reinforcement Learning0
RORL: Robust Offline Reinforcement Learning via Conservative SmoothingCode1
Offline RL for Natural Language Generation with Implicit Language Q LearningCode2
Offline Reinforcement Learning with Causal Structured World Models0
Offline Reinforcement Learning with Differential Privacy0
Model Generation with Provable Coverability for Offline Reinforcement Learning0
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL0
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game0
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments0
Multi-Game Decision TransformersCode0
Show:102550
← PrevPage 56 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified