SOTAVerified

Offline RL

Papers

Showing 601650 of 755 papers

TitleStatusHype
Uncertainty Estimation Using Riemannian Model~Dynamics for Offline Reinforcement Learning0
Generalize by Touching: Tactile Ensemble Skill Transfer for Robotic Furniture Assembly0
Generative Probabilistic Planning for Optimizing Supply Chain Networks0
GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning0
Goal-Conditioned Data Augmentation for Offline Reinforcement Learning0
Learning Goal-Conditioned Policies from Sub-Optimal Offline Data via Metric Learning0
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning0
Graph Decision Transformer0
GriddlyJS: A Web IDE for Reinforcement Learning0
Guided Data Augmentation for Offline Reinforcement Learning and Imitation Learning0
H2O+: An Improved Framework for Hybrid Offline-and-Online RL with Dynamics Gaps0
Harnessing Density Ratios for Online Reinforcement Learning0
H-GAP: Humanoid Control with a Generalist Planner0
How to Leverage Unlabeled Data in Offline Reinforcement Learning0
How to Spend Your Robot Time: Bridging Kickstarting and Offline Reinforcement Learning for Vision-based Robotic Manipulation0
Human-centric Dialog Training via Offline Reinforcement Learning0
Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance0
Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
Hyperparameter Selection for Offline Reinforcement Learning0
Implicit Offline Reinforcement Learning via Supervised Learning0
Importance of Empirical Sample Complexity Analysis for Offline Reinforcement Learning0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Reinforcement Learning0
Improving Multi-Step Reasoning Abilities of Large Language Models with Direct Advantage Policy Optimization0
Improving Offline Reinforcement Learning with Inaccurate Simulators0
Improving Offline RL by Blending Heuristics0
Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions0
InferNet for Delayed Reinforcement Tasks: Addressing the Temporal Credit Assignment Problem0
Iteratively Refined Behavior Regularization for Offline Reinforcement Learning0
Instabilities of Offline RL with Pre-Trained Neural Representation0
Instructed Diffuser with Temporal Condition Guidance for Offline Reinforcement Learning0
Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning0
Integrating Domain Knowledge for handling Limited Data in Offline RL0
Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba0
Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation0
Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm0
IntelliLung: Advancing Safe Mechanical Ventilation using Offline RL with Hybrid Actions and Clinically Aligned Rewards0
Interpretable performance analysis towards offline reinforcement learning: A dataset perspective0
Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory0
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control0
Is Conditional Generative Modeling all you need for Decision-Making?0
Is Inverse Reinforcement Learning Harder than Standard Reinforcement Learning? A Theoretical Perspective0
Is Pessimism Provably Efficient for Offline RL?0
KAN v.s. MLP for Offline Reinforcement Learning0
Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL0
Koopman Q-learning: Offline Reinforcement Learning via Symmetries of Dynamics0
Language-Conditioned Offline RL for Multi-Robot Navigation0
Large Language Model driven Policy Exploration for Recommender Systems0
Large-Scale Retrieval for Reinforcement Learning0
Show:102550
← PrevPage 13 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified