SOTAVerified

Offline RL

Papers

Showing 726750 of 755 papers

TitleStatusHype
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
Batch Exploration with Examples for Scalable Robotic Reinforcement LearningCode1
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Learning Dexterous Manipulation from Suboptimal Experts0
Human-centric Dialog Training via Offline Reinforcement Learning0
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior RegularizationCode1
Rethinking Attention with PerformersCode2
The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line0
Offline Meta-Reinforcement Learning with Advantage WeightingCode1
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning0
Model-Based Offline Planning0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Hyperparameter Selection for Offline Reinforcement Learning0
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning0
Transformers are RNNs: Fast Autoregressive Transformers with Linear AttentionCode1
Critic Regularized RegressionCode1
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement LearningCode0
Conservative Q-Learning for Offline Reinforcement LearningCode1
Deployment-Efficient Reinforcement Learning via Model-Based Offline OptimizationCode1
Acme: A Research Framework for Distributed Reinforcement LearningCode1
MOPO: Model-based Offline Policy OptimizationCode1
MOReL : Model-Based Offline Reinforcement LearningCode1
D4RL: Datasets for Deep Data-Driven Reinforcement LearningCode2
Reformer: The Efficient TransformerCode2
An Optimistic Perspective on Offline Deep Reinforcement LearningCode1
Show:102550
← PrevPage 30 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified