SOTAVerified

Offline RL

Papers

Showing 451500 of 755 papers

TitleStatusHype
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments0
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization0
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation0
Prior-Guided Diffusion Planning for Offline Reinforcement Learning0
How to Provably Improve Return Conditioned Supervised Learning?0
Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation0
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation0
Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning0
A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies0
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning0
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting0
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning0
Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets0
Addressing Extrapolation Error in Deep Offline Reinforcement Learning0
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning0
A Dual Approach to Imitation Learning from Observations with Offline Datasets0
Advancing RAN Slicing with Offline Reinforcement Learning0
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning0
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning0
Align Your Intents: Offline Imitation Learning via Optimal Transport0
Task-Agnostic Learning to Accomplish New Tasks0
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning0
An Empirical Study of Implicit Regularization in Deep Offline RL0
An Offline Reinforcement Learning Algorithm Customized for Multi-Task Fusion in Large-Scale Recommender Systems0
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning0
A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Linear MDPs0
ARMOR: A Model-based Framework for Improving Arbitrary Baseline Policies with Offline Data0
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning0
A Strong Baseline for Batch Imitation Learning0
A Survey of Zero-shot Generalisation in Deep Reinforcement Learning0
A Survey on Model-based Reinforcement Learning0
A Fast Convergence Theory for Offline Decision Making0
Augmenting Offline RL with Unlabeled Data0
Automatic Trade-off Adaptation in Offline RL0
A Validation Tool for Designing Reinforcement Learning Environments0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion0
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning0
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning0
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL0
Behavior Regularized Offline Reinforcement Learning0
Behaviour Discovery and Attribution for Explainable Reinforcement Learning0
Bellman Residual Orthogonalization for Offline Reinforcement Learning0
Benchmarking Offline Reinforcement Learning Algorithms for E-Commerce Order Fraud Evaluation0
Benchmarks and Algorithms for Offline Preference-Based Reward Learning0
Benchmarks for Reinforcement Learning with Biased Offline Data and Imperfect Simulators0
Bi-Level Offline Policy Optimization with Limited Exploration0
Show:102550
← PrevPage 10 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified