SOTAVerified

Offline RL

Papers

Showing 561570 of 755 papers

TitleStatusHype
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
DRDT3: Diffusion-Refined Decision Test-Time Training Model0
Dual Generator Offline Reinforcement Learning0
Efficient Imitation Learning with Conservative World Models0
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Enabling A Network AI Gym for Autonomous Cyber Agents0
Show:102550
← PrevPage 57 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified