SOTAVerified

Offline RL

Papers

Showing 501525 of 755 papers

TitleStatusHype
Launchpad: Learning to Schedule Using Offline and Online RL Methods0
Representation Learning for Online and Offline RL in Low-rank MDPs0
Representation Learning in Deep RL via Discrete Information Bottleneck0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning0
Rethinking Decision Transformer via Hierarchical Reinforcement Learning0
Revisiting Design Choices in Offline Model Based Reinforcement Learning0
Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints0
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning0
Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning0
Reward Shifting for Optimistic Exploration and Conservative Exploitation0
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning0
Robotic Offline RL from Internet Videos via Value-Function Pre-Training0
Robust Bandwidth Estimation for Real-Time Communication with Offline Reinforcement Learning0
Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling0
Robust Offline Reinforcement Learning from Low-Quality Data0
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation0
Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving0
Scaling Offline RL via Efficient and Expressive Shortcut Models0
Scaling Vision-and-Language Navigation With Offline RL0
Selective Uncertainty Propagation in Offline RL0
Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning0
Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning0
Show:102550
← PrevPage 21 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified