SOTAVerified

Offline RL

Papers

Showing 251275 of 755 papers

TitleStatusHype
Offline RL With Resource Constrained Online DeploymentCode0
Bridging Distributionally Robust Learning and Offline RL: An Approach to Mitigate Distribution Shift and Partial Data CoverageCode0
COPA: Certifying Robust Policies for Offline Reinforcement Learning against Poisoning AttacksCode0
Offline Reinforcement Learning from Datasets with Structured Non-StationarityCode0
BRAC+: Improved Behavior Regularized Actor Critic for Offline Reinforcement LearningCode0
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement LearningCode0
DiffClone: Enhanced Behaviour Cloning in Robotics with Diffusion-Driven Policy LearningCode0
Offline Equilibrium FindingCode0
A Connection between One-Step Regularization and Critic Regularization in Reinforcement LearningCode0
Offline Data Enhanced On-Policy Policy Gradient with Provable GuaranteesCode0
NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network SimulationCode0
Optimistic Critic Reconstruction and Constrained Fine-Tuning for General Offline-to-Online RLCode0
Beyond Reward: Offline Preference-guided Policy OptimizationCode0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
DeepAveragers: Offline Reinforcement Learning by Solving Derived Non-Parametric MDPsCode0
Decision Transformer under Random Frame DroppingCode0
Two-step reinforcement learning for model-free redesign of nonlinear optimal regulatorCode0
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RLCode0
DCUR: Data Curriculum for Teaching via Samples with Reinforcement LearningCode0
Model-based Offline Policy Optimization with Adversarial NetworkCode0
Model-Based Offline Planning with Trajectory PruningCode0
Model-Based Offline Reinforcement Learning with Pessimism-Modulated Dynamics BeliefCode0
MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman OperatorCode0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
Model-based Offline Reinforcement Learning with Count-based ConservatismCode0
Show:102550
← PrevPage 11 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified