SOTAVerified

Offline RL

Papers

Showing 601625 of 755 papers

TitleStatusHype
Can Wikipedia Help Offline Reinforcement Learning?Code1
The Challenges of Exploration for Offline Reinforcement Learning0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Offline Reinforcement Learning for Road Traffic Control0
Single-Shot Pruning for Offline Reinforcement Learning0
Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning0
RvS: What is Essential for Offline RL via Supervised Learning?Code1
A Validation Tool for Designing Reinforcement Learning Environments0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC TasksCode1
Curriculum Offline Imitating Learning0
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningCode0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor RectificationCode1
UMBRELLA: Uncertainty-Aware Model-Based Offline Reinforcement Learning Leveraging Planning0
Offline Reinforcement Learning: Fundamental Barriers for Value Function Approximation0
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning0
A Dataset Perspective on Offline Reinforcement LearningCode1
d3rlpy: An Offline Deep Reinforcement Learning LibraryCode0
RLDS: an Ecosystem to Generate, Share and Use Datasets in Reinforcement LearningCode1
Curriculum Offline Imitation LearningCode1
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0
False Correlation Reduction for Offline Reinforcement LearningCode1
Offline Reinforcement Learning with Value-based Episodic MemoryCode1
Towards Instance-Optimal Offline Reinforcement Learning with Pessimism0
Show:102550
← PrevPage 25 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified