SOTAVerified

Offline RL

Papers

Showing 476500 of 755 papers

TitleStatusHype
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning0
A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs0
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
A Strong Baseline for Batch Imitation Learning0
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning0
A Survey on Model-based Reinforcement Learning0
A Fast Convergence Theory for Offline Decision Making0
Augmenting Offline RL with Unlabeled Data0
Automatic Trade-off Adaptation in Offline RL0
A Validation Tool for Designing Reinforcement Learning Environments0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion0
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning0
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning0
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL0
Behavior Regularized Offline Reinforcement Learning0
Behaviour Discovery and Attribution for Explainable Reinforcement Learning0
Bellman Residual Orthogonalization for Offline Reinforcement Learning0
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation0
Benchmarks and Algorithms for Offline Preference-Based Reward Learning0
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators0
Bi-Level Offline Policy Optimization with Limited Exploration0
Show:102550
← PrevPage 20 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified