SOTAVerified

Offline RL

Papers

Showing 101150 of 755 papers

TitleStatusHype
Doubly Mild Generalization for Offline Reinforcement LearningCode1
Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC0
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control0
Real-World Offline Reinforcement Learning from Vision Language Model Feedback0
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning0
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data CorruptionsCode0
NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network SimulationCode0
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation0
LongReward: Improving Long-context Large Language Models with AI FeedbackCode2
Offline Reinforcement Learning with OOD State Correction and OOD Action SuppressionCode1
Learning Versatile Skills with Curriculum MaskingCode0
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces0
Offline reinforcement learning for job-shop scheduling problems0
Steering Your Generalists: Improving Robotic Foundation Models via Value GuidanceCode1
Off-dynamics Conditional Diffusion Planners0
Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning0
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement LearningCode0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task0
Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm0
Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare0
The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability0
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization0
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model PretrainingCode1
DMC-VB: A Benchmark for Representation Learning for Control with Visual DistractorsCode1
OffRIPP: Offline RL-based Informative Path Planning0
Development and Validation of Heparin Dosing Policies Using an Offline Reinforcement Learning Algorithm0
KAN v.s. MLP for Offline Reinforcement Learning0
Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning0
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention0
The Role of Deep Learning Regularizations on Actors in Offline RLCode0
Tractable Offline Learning of Regular Decision Processes0
Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy OptimizationCode2
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning0
Optimization Solution Functions as Deterministic Policies for Offline Reinforcement Learning0
Unsupervised-to-Online Reinforcement Learning0
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning0
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement LearningCode0
Domain Adaptation for Offline Reinforcement Learning with Limited Samples0
Preference-Guided Reflective Sampling for Aligning Language ModelsCode0
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning BenchmarksCode2
Offline Model-Based Reinforcement Learning with Anti-Exploration0
Integrating Multi-Modal Input Token Mixer Into Mamba-Based Decision Models: Decision MetaMamba0
Enhancing Reinforcement Learning Through Guided Search0
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds0
D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning0
Experimental evaluation of offline reinforcement learning for HVAC control in buildingsCode0
Hybrid Reinforcement Learning Breaks Sample Size Barriers in Linear MDPs0
Consistent time travel for realistic interactions with historical data: reinforcement learning for market making0
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning0
Show:102550
← PrevPage 3 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified