SOTAVerified

Offline RL

Papers

Showing 701750 of 755 papers

TitleStatusHype
A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open ProblemsCode0
Skill Decision TransformerCode0
Multi-Game Decision TransformersCode0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
Corruption-Robust Offline Reinforcement Learning with General Function ApproximationCode0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?Code0
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksCode0
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
SOReL and TOReL: Two Methods for Fully Offline Reinforcement LearningCode0
Model-based Offline Reinforcement Learning with Count-based ConservatismCode0
Solving Offline Reinforcement Learning with Decision Tree RegressionCode0
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using SparsityCode0
Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement LearningCode0
Beyond Reward: Offline Preference-guided Policy OptimizationCode0
SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement LearningCode0
Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics BeliefCode0
Model-based Offline Policy Optimization with Adversarial NetworkCode0
Active Advantage-Aligned Online Reinforcement Learning with Offline DataCode0
Stabilizing Extreme Q-learning by Maclaurin ExpansionCode0
Model-Based Offline Planning with Trajectory PruningCode0
Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based ImaginationCode0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation PoliciesCode0
Contrastive Example-Based ControlCode0
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and SmoothnessCode0
Diffusion Models as Optimizers for Efficient Planning in Offline RLCode0
Step-wise Policy for Rare-tool Knowledge (SPaRK): Offline RL that Drives Diverse Tool Use in LLMsCode0
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement LearningCode0
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman OperatorCode0
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy LearningCode0
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from ObservationsCode0
Revisiting Bellman Errors for Offline Model SelectionCode0
Unified Off-Policy Learning to Rank: a Reinforcement Learning PerspectiveCode0
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement LearningCode0
Continual Task Learning through Adaptive Policy Self-CompositionCode0
Learning Versatile Skills with Curriculum MaskingCode0
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Behavior Prior Representation learning for Offline Reinforcement LearningCode0
Compositional Conservatism: A Transductive Approach in Offline Reinforcement LearningCode0
Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RLCode0
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement LearningCode0
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement LearningCode0
Decision Transformer under Random Frame DroppingCode0
Learning to Reach Goals via DiffusionCode0
The CoSTAR Block Stacking Dataset: Learning with Workspace ConstraintsCode0
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RLCode0
TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning AgentsCode0
Robust Offline Reinforcement learning with Heavy-Tailed RewardsCode0
Show:102550
← PrevPage 15 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified