SOTAVerified

Offline RL

Papers

Showing 701710 of 755 papers

TitleStatusHype
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL0
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning0
Offline Reinforcement Learning as Anti-Exploration0
Corruption-Robust Offline Reinforcement Learning0
Offline Inverse Reinforcement Learning0
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning0
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning0
Revisiting Design Choices in Offline Model Based Reinforcement Learning0
Model-Based Offline Planning with Trajectory PruningCode0
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings0
Show:102550
← PrevPage 71 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified