SOTAVerified

Offline RL

Papers

Showing 101125 of 755 papers

TitleStatusHype
Doubly Mild Generalization for Offline Reinforcement LearningCode1
Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC0
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control0
Real-World Offline Reinforcement Learning from Vision Language Model Feedback0
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning0
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data CorruptionsCode0
NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network SimulationCode0
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation0
LongReward: Improving Long-context Large Language Models with AI FeedbackCode2
Offline Reinforcement Learning with OOD State Correction and OOD Action SuppressionCode1
Learning Versatile Skills with Curriculum MaskingCode0
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces0
Offline reinforcement learning for job-shop scheduling problems0
Steering Your Generalists: Improving Robotic Foundation Models via Value GuidanceCode1
Off-dynamics Conditional Diffusion Planners0
Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning0
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement LearningCode0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task0
Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm0
Offline Inverse Constrained Reinforcement Learning for Safe-Critical Decision Making in Healthcare0
The Smart Buildings Control Suite: A Diverse Open Source Benchmark to Evaluate and Scale HVAC Control Policies for Sustainability0
ComaDICE: Offline Cooperative Multi-Agent Reinforcement Learning with Stationary Distribution Shift Regularization0
Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model PretrainingCode1
DMC-VB: A Benchmark for Representation Learning for Control with Visual DistractorsCode1
Show:102550
← PrevPage 5 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified