SOTAVerified

Offline RL

Papers

Showing 211220 of 755 papers

TitleStatusHype
Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning0
Constraints Penalized Q-learning for Safe Offline Reinforcement Learning0
Align Your Intents: Offline Imitation Learning via Optimal Transport0
Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning0
Conservative Data Sharing for Multi-Task Offline Reinforcement Learning0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions0
Confidence-Conditioned Value Functions for Offline Reinforcement Learning0
Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits0
A Validation Tool for Designing Reinforcement Learning Environments0
Show:102550
← PrevPage 22 of 76Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified