SOTAVerified

Offline RL

Papers

Showing 5175 of 755 papers

TitleStatusHype
NeoRL-2: Near Real-World Benchmarks for Offline Reinforcement Learning with Extended Realistic ScenariosCode1
Behaviour Discovery and Attribution for Explainable Reinforcement Learning0
Evaluation-Time Policy Switching for Offline Reinforcement Learning0
The Pitfalls of Imitation Learning when Actions are Continuous0
Policy Regularization on Globally Accessible States in Cross-Dynamics Reinforcement Learning0
Policy Constraint by Only Support Constraint for Offline Reinforcement LearningCode0
Energy-Weighted Flow Matching for Offline Reinforcement Learning0
What Makes a Good Diffusion Planner for Decision Making?Code2
Scalable Decision-Making in Stochastic Environments through Learned Temporal AbstractionCode0
Yes, Q-learning Helps Offline In-Context RL0
Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective0
Which Features are Best for Successor Features?0
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches0
Active Advantage-Aligned Online Reinforcement Learning with Offline DataCode0
Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits0
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning0
OmniRL: In-Context Reinforcement Learning by Large-Scale Meta-Training in Randomized Worlds0
Policy-Guided Causal State Representation for Offline Reinforcement Learning Recommendation0
Flow Q-LearningCode3
GNN-DT: Graph Neural Network Enhanced Decision Transformer for Efficient Optimization in Dynamic EnvironmentsCode1
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning0
Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback0
Data Center Cooling System Optimization Using Offline Reinforcement Learning0
Fat-to-Thin Policy Optimization: Offline RL with Sparse PoliciesCode0
Large Language Model driven Policy Exploration for Recommender Systems0
Show:102550
← PrevPage 3 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified