SOTAVerified

Offline RL

Papers

Showing 351360 of 755 papers

TitleStatusHype
Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive RecommendationCode1
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning0
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning0
Offline Reinforcement Learning with Imbalanced Datasets0
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning0
Model-Bellman Inconsistency for Model-based Offline Reinforcement LearningCode1
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning0
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization0
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer0
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching0
Show:102550
← PrevPage 36 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified