SOTAVerified

Offline RL

Papers

Showing 626650 of 755 papers

TitleStatusHype
Improving Offline Reinforcement Learning with Inaccurate Simulators0
Improving Offline RL by Blending Heuristics0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning0
Instabilities of Offline RL with Pre-Trained Neural Representation0
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning0
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning0
Integrating Domain Knowledge for handling Limited Data in Offline RL0
Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba0
Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation0
Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm0
IntelliLung: Advancing Safe Mechanical Ventilation using Offline RL with Hybrid Actions and Clinically Aligned Rewards0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory0
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control0
Is Conditional Generative Modeling all you need for Decision-Making?0
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective0
Is Pessimism Provably Efficient for Offline RL?0
KAN v.s. MLP for Offline Reinforcement Learning0
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL0
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0
Language-Conditioned Offline RL for Multi-Robot Navigation0
Large Language Model driven Policy Exploration for Recommender Systems0
Large-Scale Retrieval for Reinforcement Learning0
Show:102550
← PrevPage 26 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified