SOTAVerified

Offline RL

Papers

Showing 3140 of 755 papers

TitleStatusHype
Diffusion Guidance Is a Controllable Policy Improvement OperatorCode2
Offline RL for Natural Language Generation with Implicit Language Q LearningCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RLCode1
Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive RecommendationCode1
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement LearningCode1
Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced DatasetsCode1
AdaCat: Adaptive Categorical Discretization for Autoregressive ModelsCode1
Consistency Models as a Rich and Efficient Policy Class for Reinforcement LearningCode1
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationCode1
Show:102550
← PrevPage 4 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified