SOTAVerified

Offline RL

Papers

Showing 401425 of 755 papers

TitleStatusHype
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task0
Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning0
Diffusion Policies for Out-of-Distribution Generalization in Offline Reinforcement Learning0
Diffusion Self-Weighted Guidance for Offline Reinforcement Learning0
Discovering Multiple Solutions from a Single Task in Offline Reinforcement Learning0
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity0
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches0
Domain Adaptation for Offline Reinforcement Learning with Limited Samples0
Domain Generalization for Robust Model-Based Offline Reinforcement Learning0
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage0
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization0
DRDT3: Diffusion-Refined Decision Test-Time Training Model0
Dual Generator Offline Reinforcement Learning0
Efficient Imitation Learning with Conservative World Models0
Efficient Online RL Fine Tuning with Offline Pre-trained Policy Only0
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL0
Empowering Embodied Visual Tracking with Visual Foundation Models and Offline RL0
Enabling A Network AI Gym for Autonomous Cyber Agents0
End-to-End Offline Goal-Oriented Dialog Policy Learning via Policy Gradient0
End-to-end Offline Reinforcement Learning for Glycemia Control0
Energy-Weighted Flow Matching for Offline Reinforcement Learning0
Enhanced DACER Algorithm with High Diffusion Efficiency0
Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention0
Show:102550
← PrevPage 17 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified