SOTAVerified

Offline RL

Papers

Showing 681690 of 755 papers

TitleStatusHype
Particle Based Stochastic Policy Optimization0
Pareto Policy Pool for Model-based Offline Reinforcement Learning0
Uncertainty Regularized Policy Learning for Offline Reinforcement Learning0
Variational oracle guiding for reinforcement learning0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning0
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning0
Offline Reinforcement Learning with Resource Constrained Online Deployment0
Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters.0
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Show:102550
← PrevPage 69 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified