SOTAVerified

Offline RL

Papers

Showing 351375 of 755 papers

TitleStatusHype
Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive RecommendationCode1
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning0
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning0
Offline Reinforcement Learning with Imbalanced Datasets0
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning0
Model-Bellman Inconsistency for Model-based Offline Reinforcement LearningCode1
Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning0
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization0
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer0
Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching0
Offline Policy Evaluation for Reinforcement Learning with Adaptively Collected Data0
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning0
Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement LearningCode1
Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory WeightingCode1
Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap0
2vec: Policy Representations with Successor Features0
Automatic Trade-off Adaptation in Offline RL0
Semi-Offline Reinforcement Learning for Optimized Text GenerationCode0
Offline Multi-Agent Reinforcement Learning with Coupled Value Factorization0
Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources0
Off-policy Evaluation in Doubly Inhomogeneous EnvironmentsCode0
Unified Off-Policy Learning to Rank: a Reinforcement Learning PerspectiveCode0
Pruning the Way to Reliable Policies: A Multi-Objective Deep Q-Learning Approach to Critical Care0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles0
Show:102550
← PrevPage 15 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified