SOTAVerified

Offline RL

Papers

Showing 611620 of 755 papers

TitleStatusHype
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning0
Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity0
2vec: Policy Representations with Successor Features0
Planning to Go Out-of-Distribution in Offline-to-Online Reinforcement Learning0
Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone0
Policy-Based Trajectory Clustering in Offline Reinforcement Learning0
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning0
Policy Gradients Incorporating the Future0
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation0
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning0
Show:102550
← PrevPage 62 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified