SOTAVerified

Offline RL

Papers

Showing 321330 of 755 papers

TitleStatusHype
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds0
Experimental evaluation of offline reinforcement learning for HVAC control in buildingsCode0
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
Consistent time travel for realistic interactions with historical data: reinforcement learning for market making0
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning0
Language-Conditioned Offline RL for Multi-Robot Navigation0
Diffusion Models as Optimizers for Efficient Planning in Offline RLCode0
ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender SystemsCode0
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning0
Show:102550
← PrevPage 33 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified