SOTAVerified

Offline RL

Papers

Showing 326350 of 755 papers

TitleStatusHype
Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Exclusively Penalized Q-learning for Offline Reinforcement Learning0
Evaluation-Time Policy Switching for Offline Reinforcement Learning0
Evaluation of Active Feature Acquisition Methods for Static Feature Settings0
Equivariant Offline Reinforcement Learning0
Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning0
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning0
Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Align Your Intents: Offline Imitation Learning via Optimal Transport0
Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning0
ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles0
Confidence-Conditioned Value Functions for Offline Reinforcement Learning0
Enhancing Reinforcement Learning Through Guided Search0
Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits0
A Validation Tool for Designing Reinforcement Learning Environments0
Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective0
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention0
Enhanced DACER Algorithm with High Diffusion Efficiency0
Energy-Weighted Flow Matching for Offline Reinforcement Learning0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Automatic Trade-off Adaptation in Offline RL0
End-to-end Offline Reinforcement Learning for Glycemia Control0
Show:102550
← PrevPage 14 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified