SOTAVerified

Offline RL

Papers

Showing 751755 of 755 papers

TitleStatusHype
A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement LearningCode0
Behavior Estimation from Multi-Source Data for Offline Reinforcement LearningCode0
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed DatasetsCode0
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningCode0
Robust Reinforcement Learning Objectives for Sequential Recommender SystemsCode0
Show:102550
← PrevPage 16 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified