SOTAVerified

Offline RL

Papers

Showing 3140 of 755 papers

TitleStatusHype
Think-J: Learning to Think for Generative LLM-as-a-JudgeCode0
Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization0
Prior-Guided Diffusion Planning for Offline Reinforcement Learning0
ImagineBench: Evaluating Reinforcement Learning with Large Language Model RolloutsCode1
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data0
Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL0
What Matters for Batch Online Reinforcement Learning in Robotics?0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach0
Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning0
Show:102550
← PrevPage 4 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified