SOTAVerified

Offline RL

Papers

Showing 451475 of 755 papers

TitleStatusHype
You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments0
You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL0
Your Offline Policy is Not Trustworthy: Bilevel Reinforcement Learning for Sequential Portfolio Optimization0
PEARL: Zero-shot Cross-task Preference Alignment and Robust Reward Learning for Robotic Manipulation0
Prior-Guided Diffusion Planning for Offline Reinforcement Learning0
How to Provably Improve Return Conditioned Supervised Learning?0
Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation0
Accelerating Offline Reinforcement Learning Application in Real-Time Bidding and Recommendation: Potential Use of Simulation0
Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning0
A Conservative Q-Learning approach for handling distribution shift in sepsis treatment strategies0
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning0
AdaCred: Adaptive Causal Decision Transformers with Feature Crediting0
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning0
Adaptive Q-learning for Interaction-Limited Reinforcement Learning0
Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets0
Addressing Extrapolation Error in Deep Offline Reinforcement Learning0
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning0
A Dual Approach to Imitation Learning from Observations with Offline Datasets0
Advancing RAN Slicing with Offline Reinforcement Learning0
Adversarially Trained Weighted Actor-Critic for Safe Offline Reinforcement Learning0
A Fully Data-Driven Approach for Realistic Traffic Signal Control Using Offline Reinforcement Learning0
Align Your Intents: Offline Imitation Learning via Optimal Transport0
Task-Agnostic Learning to Accomplish New Tasks0
Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning0
An Empirical Study of Implicit Regularization in Deep Offline RL0
Show:102550
← PrevPage 19 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified