SOTAVerified

Offline RL

Papers

Showing 501525 of 755 papers

TitleStatusHype
Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills0
Boosting Offline Reinforcement Learning via Data Rebalancing0
Boosting Offline Reinforcement Learning with Residual Generative Modeling0
Bootstrapped Transformer for Offline Reinforcement Learning0
BRAC+: Going Deeper with Behavior Regularized Offline Reinforcement Learning0
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism0
Bridging the Gap Between Offline and Online Reinforcement Learning Evaluation Methodologies0
Budgeting Counterfactual for Offline RL0
Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains0
Enhancing Video Analytics Accuracy via Real-time Automated Camera Parameter Tuning0
Can Offline Reinforcement Learning Help Natural Language Understanding?0
Causal prompting model-based offline reinforcement learning0
CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning0
Optimal Uniform OPE and Model-based Offline Reinforcement Learning in Time-Homogeneous, Reward-Free and Task-Agnostic Settings0
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer0
CLUE: Calibrated Latent Guidance for Offline Reinforcement Learning0
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization0
Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning0
Confidence-Conditioned Value Functions for Offline Reinforcement Learning0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Context-Former: Stitching via Latent Conditioned Sequence Modeling0
Contextual Transformer for Offline Meta Reinforcement Learning0
Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning0
Contrastive Learning as Goal-Conditioned Reinforcement Learning0
Show:102550
← PrevPage 21 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified