SOTAVerified

Offline RL

Papers

Showing 681690 of 755 papers

TitleStatusHype
Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning0
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game0
Near-Optimal Offline Reinforcement Learning via Double Variance Reduction0
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning0
Neural Network Approximation for Pessimistic Offline Reinforcement Learning0
Off-dynamics Conditional Diffusion Planners0
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control0
Offline Actor-Critic Reinforcement Learning Scales to Large Models0
Offline Evaluation for Reinforcement Learning-based Recommendation: A Critical Issue and Some Alternatives0
Offline Fictitious Self-Play for Competitive Games0
Show:102550
← PrevPage 69 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified