SOTAVerified

Offline RL

Papers

Showing 341350 of 755 papers

TitleStatusHype
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble0
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning0
Language Decision Transformers with Exponential Tilt for Interactive Text Environments0
Deploying Offline Reinforcement Learning with Human Feedback0
Large-Scale Retrieval for Reinforcement Learning0
Delphic Offline Reinforcement Learning under Nonidentifiable Hidden Confounding0
Bi-Level Offline Policy Optimization with Limited Exploration0
Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning0
Nearly Minimax Optimal Offline Reinforcement Learning with Linear Function Approximation: Single-Agent MDP and Markov Game0
Large Language Model driven Policy Exploration for Recommender Systems0
Show:102550
← PrevPage 35 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified