SOTAVerified

Offline RL

Papers

Showing 741750 of 755 papers

TitleStatusHype
What are the Statistical Limits of Offline RL with Linear Function Approximation?0
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Learning Dexterous Manipulation from Suboptimal Experts0
Human-centric Dialog Training via Offline Reinforcement Learning0
The reinforcement learning-based multi-agent cooperative approach for the adaptive speed regulation on a metallurgical pickling line0
Model-Based Offline Planning0
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Hyperparameter Selection for Offline Reinforcement Learning0
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning0
Show:102550
← PrevPage 75 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified