SOTAVerified

Offline RL

Papers

Showing 326350 of 755 papers

TitleStatusHype
MOORL: A Framework for Integrating Offline-Online Reinforcement Learning0
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning0
Learning to Clarify by Reinforcement Learning Through Reward-Weighted Fine-Tuning0
Dialogue Evaluation with Offline Reinforcement Learning0
Addressing Extrapolation Error in Deep Offline Reinforcement Learning0
Learning Pseudometric-based Action Representations for Offline Reinforcement Learning0
Learning Value Functions from Undirected State-only Experience0
Boosting Offline Reinforcement Learning with Residual Generative Modeling0
Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization0
Learning Dexterous Manipulation from Suboptimal Experts0
Boosting Offline Reinforcement Learning for Autonomous Driving with Hierarchical Latent Skills0
Model Generation with Provable Coverability for Offline Reinforcement Learning0
Launchpad: Learning to Schedule Using Offline and Online RL Methods0
Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments0
DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning0
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble0
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning0
Language Decision Transformers with Exponential Tilt for Interactive Text Environments0
Deploying Offline Reinforcement Learning with Human Feedback0
Large-Scale Retrieval for Reinforcement Learning0
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding0
Bi-Level Offline Policy Optimization with Limited Exploration0
Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning0
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game0
Large Language Model driven Policy Exploration for Recommender Systems0
Show:102550
← PrevPage 14 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified