SOTAVerified

Offline RL

Papers

Showing 3140 of 755 papers

TitleStatusHype
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy OptimizationCode2
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy OptimizationCode2
Offline RL for Natural Language Generation with Implicit Language Q LearningCode2
All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RLCode1
Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive RecommendationCode1
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement LearningCode1
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-TuningCode1
AdaCat: Adaptive Categorical Discretization for Autoregressive ModelsCode1
Consistency Models as a Rich and Efficient Policy Class for Reinforcement LearningCode1
COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction EstimationCode1
Show:102550
← PrevPage 4 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified