SOTAVerified

Offline RL

Papers

Showing 641650 of 755 papers

TitleStatusHype
Real-World Fluid Directed Rigid Body Control via Deep Reinforcement Learning0
Real-World Offline Reinforcement Learning from Vision Language Model Feedback0
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage0
Regularized Behavior Value Estimation0
Reinforced Self-Training (ReST) for Language Modeling0
Reinforcement Learning: An Overview0
Reinforcement Learning-based Recommender Systems with Large Language Models for State Reward and Action Modeling0
Reinforcement Learning for Individual Optimal Policy from Heterogeneous Data0
Reinforcement Learning with Human Feedback: Learning Dynamic Choices via Pessimism0
Reliable validation of Reinforcement Learning Benchmarks0
Show:102550
← PrevPage 65 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified