SOTAVerified

Offline RL

Papers

Showing 351400 of 755 papers

TitleStatusHype
Representation Learning in Deep RL via Discrete Information Bottleneck0
Representation Matters: Offline Pretraining for Sequential Decision Making0
Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning0
Rethinking Decision Transformer via Hierarchical Reinforcement Learning0
Revisiting Design Choices in Offline Model Based Reinforcement Learning0
Offline Reinforcement Learning via Linear-Programming with Error-Bound Induced Constraints0
Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning0
Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning0
Reward Shifting for Optimistic Exploration and Conservative Exploitation0
Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning0
Robotic Offline RL from Internet Videos via Value-Function Pre-Training0
Robust Bandwidth Estimation for Real-Time Communication with Offline Reinforcement Learning0
Robust Decision Transformer: Tackling Data Corruption in Offline RL via Sequence Modeling0
Robust Offline Reinforcement Learning from Low-Quality Data0
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation0
Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization0
S4RL: Surprisingly Simple Self-Supervision for Offline Reinforcement Learning0
Safety-aware Causal Representation for Trustworthy Offline Reinforcement Learning in Autonomous Driving0
Scaling Offline RL via Efficient and Expressive Shortcut Models0
Scaling Vision-and-Language Navigation With Offline RL0
Selective Uncertainty Propagation in Offline RL0
Self-Confirming Transformer for Belief-Conditioned Adaptation in Offline Multi-Agent Reinforcement Learning0
Self-Driving Telescopes: Autonomous Scheduling of Astronomical Observation Campaigns with Offline Reinforcement Learning0
Self-Play with Adversarial Critic: Provable and Scalable Offline Alignment for Language Models0
Semi-gradient DICE for Offline Constrained Reinforcement Learning0
Semi-supervised Offline Reinforcement Learning with Pre-trained Decision Transformers0
SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets0
Improving Offline-to-Online Reinforcement Learning with Q Conditioned State Entropy Exploration0
Settling the Communication Complexity for Distributed Offline Reinforcement Learning0
Settling the Sample Complexity of Model-Based Offline Reinforcement Learning0
Should I Run Offline Reinforcement Learning or Behavioral Cloning?0
Showing Your Offline Reinforcement Learning Work: Online Evaluation Budget Matters0
Single-Shot Pruning for Offline Reinforcement Learning0
Data-Incremental Continual Offline Reinforcement Learning0
Skills Regularized Task Decomposition for Multi-task Offline Reinforcement Learning0
SLiC-HF: Sequence Likelihood Calibration with Human Feedback0
Solving Continual Offline Reinforcement Learning with Decision Transformer0
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces0
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning0
SR-Reward: Taking The Path More Traveled0
State Advantage Weighting for Offline RL0
State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning0
State Regularized Policy Optimization on Data with Dynamics Shift0
Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments0
Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC0
Striving for Simplicity in Off-Policy Deep Reinforcement Learning0
SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning0
Survival Instinct in Offline Reinforcement Learning0
Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning0
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach0
Show:102550
← PrevPage 8 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified