SOTAVerified

Offline RL

Papers

Showing 191200 of 755 papers

TitleStatusHype
Critic Regularized RegressionCode1
Conservative Q-Learning for Offline Reinforcement LearningCode1
Deployment-Efficient Reinforcement Learning via Model-Based Offline OptimizationCode1
Acme: A Research Framework for Distributed Reinforcement LearningCode1
MOPO: Model-based Offline Policy OptimizationCode1
MOReL : Model-Based Offline Reinforcement LearningCode1
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
An Optimistic Perspective on Offline Reinforcement LearningCode1
From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning0
Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMsCode0
Show:102550
← PrevPage 20 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified