SOTAVerified

Offline RL

Papers

Showing 421430 of 755 papers

TitleStatusHype
Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear q^π-Realizability and Concentrability0
Trajectory-wise Iterative Reinforcement Learning Framework for Auto-bidding0
Transferred Q-learning0
UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning0
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning0
Uncertainty-Aware Decision Transformer for Stochastic Driving Environments0
Uncertainty-aware Distributional Offline Reinforcement Learning0
Uncertainty Regularized Policy Learning for Offline Reinforcement Learning0
Uncertainty Weighted Offline Reinforcement Learning0
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization0
Show:102550
← PrevPage 43 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified