SOTAVerified

Offline RL

Papers

Showing 621630 of 755 papers

TitleStatusHype
Preference Elicitation for Offline Reinforcement Learning0
Preferred-Action-Optimized Diffusion Policies for Offline Reinforcement Learning0
Preserving Expert-Level Privacy in Offline Reinforcement Learning0
Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning0
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning0
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement0
Prompting Decision Transformer for Few-Shot Policy Generalization0
Provable Benefit of Multitask Representation Learning in Reinforcement Learning0
What can online reinforcement learning with function approximation benefit from general coverage conditions?0
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation0
Show:102550
← PrevPage 63 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified