SOTAVerified

Offline RL

Papers

Showing 676700 of 755 papers

TitleStatusHype
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning0
Targeted Environment Design from Offline Data0
The Essential Elements of Offline RL via Supervised Learning0
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
Particle Based Stochastic Policy Optimization0
Pareto Policy Pool for Model-based Offline Reinforcement Learning0
Uncertainty Regularized Policy Learning for Offline Reinforcement Learning0
Variational oracle guiding for reinforcement learning0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning0
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning0
Offline Reinforcement Learning with Resource Constrained Online Deployment0
Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters.0
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
DCUR: Data Curriculum for Teaching via Samples with Reinforcement LearningCode0
Policy Gradients Incorporating the Future0
Offline Preference-Based Apprenticeship Learning0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage0
Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning0
The Least Restriction for Offline Reinforcement Learning0
Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement LearningCode0
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL0
Boosting Offline Reinforcement Learning with Residual Generative Modeling0
Show:102550
← PrevPage 28 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified