SOTAVerified

Offline RL

Papers

Showing 676700 of 755 papers

TitleStatusHype
On Multi-objective Policy Optimization as a Tool for Reinforcement Learning: Case Studies in Offline RL and Finetuning0
Reinforcement Learning as One Big Sequence Modeling ProblemCode1
A Minimalist Approach to Offline Reinforcement LearningCode1
Corruption-Robust Offline Reinforcement Learning0
Offline Reinforcement Learning as Anti-Exploration0
Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning0
Offline Inverse Reinforcement Learning0
Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement LearningCode1
Online reinforcement learning with sparse rewards through an active inference capsuleCode1
Offline Reinforcement Learning as One Big Sequence Modeling ProblemCode1
Decision Transformer: Reinforcement Learning via Sequence ModelingCode1
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning0
Revisiting Design Choices in Offline Model Based Reinforcement Learning0
Uncertainty Weighted Actor-Critic for Offline Reinforcement LearningCode1
Model-Based Offline Planning with Trajectory PruningCode0
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Online and Offline Reinforcement Learning by Planning with a Learned ModelCode1
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism0
Regularized Behavior Value Estimation0
Offline Reinforcement Learning with Fisher Divergence Critic Regularization0
Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
Instabilities of Offline RL with Pre-Trained Neural Representation0
Show:102550
← PrevPage 28 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified