SOTAVerified

Offline RL

Papers

Showing 661670 of 755 papers

TitleStatusHype
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble0
LLQL: Logistic Likelihood Q-Learning for Reinforcement Learning0
Language Decision Transformers with Exponential Tilt for Interactive Text Environments0
Measurement Scheduling for ICU Patients with Offline Reinforcement Learning0
Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning0
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning0
Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization0
Model-Based Offline Planning0
Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation0
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds0
Show:102550
← PrevPage 67 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified