SOTAVerified

Offline RL

Papers

Showing 231240 of 755 papers

TitleStatusHype
Off-policy Evaluation in Doubly Inhomogeneous EnvironmentsCode0
Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement LearningCode0
Offline Equilibrium FindingCode0
Offline Data Enhanced On-Policy Policy Gradient with Provable GuaranteesCode0
Building Persona Consistent Dialogue Agents with Offline Reinforcement LearningCode0
AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained OptimizationCode0
Mutual Information Regularized Offline Reinforcement LearningCode0
Fat-to-Thin Policy Optimization: Offline RL with Sparse PoliciesCode0
Multi-Game Decision TransformersCode0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
Show:102550
← PrevPage 24 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified