SOTAVerified

Offline RL

Papers

Showing 126150 of 755 papers

TitleStatusHype
OffRIPP: Offline RL-based Informative Path Planning0
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm0
KAN v.s. MLP for Offline Reinforcement Learning0
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning0
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention0
The Role of Deep Learning Regularizations on Actors in Offline RLCode0
Tractable Offline Learning of Regular Decision Processes0
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy OptimizationCode2
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning0
Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning0
Unsupervised-to-Online Reinforcement Learning0
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning0
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement LearningCode0
Domain Adaptation for Offline Reinforcement Learning with Limited Samples0
Preference-Guided Reflective Sampling for Aligning Language ModelsCode0
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning BenchmarksCode2
Offline Model-Based Reinforcement Learning with Anti-Exploration0
Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba0
Enhancing Reinforcement Learning Through Guided Search0
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds0
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning0
Experimental evaluation of offline reinforcement learning for HVAC control in buildingsCode0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
Consistent time travel for realistic interactions with historical data: reinforcement learning for market making0
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning0
Show:102550
← PrevPage 6 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified