SOTAVerified

Offline RL

Papers

Showing 526550 of 755 papers

TitleStatusHype
Minimax-Optimal Reward-Agnostic Exploration in Reinforcement Learning0
Uncertainty-driven Trajectory Truncation for Data Augmentation in Offline Reinforcement LearningCode0
Unified Emulation-Simulation Training Environment for Autonomous Cyber Agents0
Enabling A Network AI Gym for Autonomous Cyber Agents0
Understanding Reinforcement Learning Algorithms: The Progress from Basic Q-learning to Proximal Policy Optimization0
MAHALO: Unifying Offline Reinforcement Learning and Imitation Learning from ObservationsCode0
Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions0
Deep RL with Hierarchical Action Exploration for Dialogue Generation0
Adaptive Policy Learning for Offline-to-Online Reinforcement Learning0
Deploying Offline Reinforcement Learning with Human Feedback0
Graph Decision Transformer0
Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning0
On the Sample Complexity of Vanilla Model-Based Offline Reinforcement Learning with Dependent Samples0
Learning to Influence Human Behavior with Offline Reinforcement Learning0
Decision Transformer under Random Frame DroppingCode0
Learning to Control Autonomous Fleets from Observation via Offline Reinforcement LearningCode0
The Provable Benefits of Unsupervised Data Sharing for Offline Reinforcement Learning0
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation0
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function ApproximationCode0
Deep Offline Reinforcement Learning for Real-world Treatment Optimization Applications0
Language Decision Transformers with Exponential Tilt for Interactive Text Environments0
A Strong Baseline for Batch Imitation Learning0
Offline Minimax Soft-Q-learning Under Realizability and Partial Coverage0
Selective Uncertainty Propagation in Offline RL0
Revisiting Bellman Errors for Offline Model SelectionCode0
Show:102550
← PrevPage 22 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified