SOTAVerified

Offline RL

Papers

Showing 261270 of 755 papers

TitleStatusHype
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning0
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only0
A Fast Convergence Theory for Offline Decision Making0
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning0
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization0
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer0
Efficient Imitation Learning with Conservative World Models0
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings0
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning0
Dual Generator Offline Reinforcement Learning0
Show:102550
← PrevPage 27 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified