SOTAVerified

Offline RL

Papers

Showing 2130 of 755 papers

TitleStatusHype
Enhanced DACER Algorithm with High Diffusion Efficiency0
SOReL and TOReL: Two Methods for Fully Offline Reinforcement LearningCode0
Scaling Offline RL via Efficient and Expressive Shortcut Models0
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RLCode0
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning0
Diffusion Self-Weighted Guidance for Offline Reinforcement Learning0
PyTupli: A Scalable Infrastructure for Collaborative Offline Reinforcement Learning ProjectsCode0
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only0
Offline Guarded Safe Reinforcement Learning for Medical Treatment Optimization Strategies0
Unearthing Gems from Stones: Policy Optimization with Negative Sample Augmentation for LLM Reasoning0
Show:102550
← PrevPage 3 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified