SOTAVerified

Offline RL

Papers

Showing 276300 of 755 papers

TitleStatusHype
Reinforcement Learning: An Overview0
Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting0
Improving Dynamic Object Interactions in Text-to-Video Generation with AI Feedback0
Robust Offline Reinforcement Learning with Linearly Structured f-Divergence Regularization0
LLM-Based Offline Learning for Embodied Agents via Consistency-Guided Reward Ensemble0
PROGRESSOR: A Perceptually Guided Reward Estimator with Self-Supervised Online Refinement0
Continual Task Learning through Adaptive Policy Self-CompositionCode0
Preserving Expert-Level Privacy in Offline Reinforcement Learning0
Navigation with QPHIL: Quantizing Planner for Hierarchical Implicit Q-Learning0
Streetwise Agents: Empowering Offline RL Policies to Outsmart Exogenous Stochastic Disturbances in RTC0
OffLight: An Offline Multi-Agent Reinforcement Learning Framework for Traffic Signal Control0
Real-World Offline Reinforcement Learning from Vision Language Model Feedback0
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning0
Uncertainty-based Offline Variational Bayesian Reinforcement Learning for Robustness under Diverse Data CorruptionsCode0
Offline Reinforcement Learning and Sequence Modeling for Downlink Link Adaptation0
NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network SimulationCode0
Learning Versatile Skills with Curriculum MaskingCode0
Offline reinforcement learning for job-shop scheduling problems0
Solving Continual Offline RL through Selective Weights Activation on Aligned Spaces0
Off-dynamics Conditional Diffusion Planners0
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement LearningCode0
Diffusion-Based Offline RL for Improved Decision-Making in Augmented ARC Task0
DIAR: Diffusion-model-guided Implicit Q-learning with Adaptive Revaluation0
Multi-Objective-Optimization Multi-AUV Assisted Data Collection Framework for IoUT Based on Offline Reinforcement Learning0
Integrating Reinforcement Learning and Large Language Models for Crop Production Process Management Optimization and Control through A New Knowledge-Based Deep Learning Paradigm0
Show:102550
← PrevPage 12 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1KFCAverage Reward81.8Unverified
2ADMPOAverage Reward81Unverified
3Decision Transformer (DT)Average Reward73.5Unverified
#ModelMetricClaimedVerifiedStatus
1ParPID4RL Normalized Score151.4Unverified