SOTAVerified

Offline RL

Papers

Showing 511520 of 755 papers

TitleStatusHype
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?Code0
Robust Reinforcement Learning Objectives for Sequential Recommender SystemsCode0
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism0
Beyond Reward: Offline Preference-guided Policy OptimizationCode0
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement LearningCode0
Offline Primal-Dual Reinforcement Learning for Linear MDPs0
Offline Reinforcement Learning with Additional Covering Distributions0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
SLiC-HF: Sequence Likelihood Calibration with Human Feedback0
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning0
Show:102550
← PrevPage 52 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified