SOTAVerified

Offline RL

Papers

Showing 651675 of 755 papers

TitleStatusHype
The Challenges of Exploration for Offline Reinforcement Learning0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Offline Reinforcement Learning for Road Traffic Control0
Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning0
Single-Shot Pruning for Offline Reinforcement Learning0
A Validation Tool for Designing Reinforcement Learning Environments0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
Curriculum Offline Imitating Learning0
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningCode0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning0
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation0
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning0
d3rlpy: An Offline Deep Reinforcement Learning LibraryCode0
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism0
Value Penalized Q-Learning for Recommender Systems0
Representation Learning for Online and Offline RL in Low-rank MDPs0
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters0
Offline RL With Resource Constrained Online DeploymentCode0
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
Offline Reinforcement Learning for Large Scale Language Action Spaces0
Reward Shifting for Optimistic Exploration and Conservative Exploitation0
Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers0
Show:102550
← PrevPage 27 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified