SOTAVerified

Offline RL

Papers

Showing 651700 of 755 papers

TitleStatusHype
The Challenges of Exploration for Offline Reinforcement Learning0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Offline Reinforcement Learning for Road Traffic Control0
Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning0
Single-Shot Pruning for Offline Reinforcement Learning0
A Validation Tool for Designing Reinforcement Learning Environments0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
Curriculum Offline Imitating Learning0
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningCode0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning0
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation0
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning0
d3rlpy: An Offline Deep Reinforcement Learning LibraryCode0
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism0
Value Penalized Q-Learning for Recommender Systems0
Representation Learning for Online and Offline RL in Low-rank MDPs0
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters0
Offline RL With Resource Constrained Online DeploymentCode0
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
Offline Reinforcement Learning for Large Scale Language Action Spaces0
Reward Shifting for Optimistic Exploration and Conservative Exploitation0
Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers0
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning0
Targeted Environment Design from Offline Data0
The Essential Elements of Offline RL via Supervised Learning0
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
Particle Based Stochastic Policy Optimization0
Pareto Policy Pool for Model-based Offline Reinforcement Learning0
Uncertainty Regularized Policy Learning for Offline Reinforcement Learning0
Variational oracle guiding for reinforcement learning0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning0
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning0
Offline Reinforcement Learning with Resource Constrained Online Deployment0
Why so pessimistic? Estimating uncertainties for offline RL through ensembles, and why their independence matters.0
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
DCUR: Data Curriculum for Teaching via Samples with Reinforcement LearningCode0
Policy Gradients Incorporating the Future0
Offline Preference-Based Apprenticeship Learning0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Pessimistic Model-based Offline Reinforcement Learning under Partial Coverage0
Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning0
The Least Restriction for Offline Reinforcement Learning0
Optimality Inductive Biases and Agnostic Guidelines for Offline Reinforcement LearningCode0
Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL0
Boosting Offline Reinforcement Learning with Residual Generative Modeling0
Show:102550
← PrevPage 14 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified