| From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning | Jul 17, 2025 | D4RLOffline RL | —Unverified | 0 |
| Accelerating Residual Reinforcement Learning with Uncertainty Estimation | Jun 21, 2025 | D4RLreinforcement-learning | —Unverified | 0 |
| CAWR: Corruption-Averse Advantage-Weighted Regression for Robust Policy Optimization | Jun 18, 2025 | D4RLOffline RL | CodeCode Available | 0 |
| MOORL: A Framework for Integrating Offline-Online Reinforcement Learning | Jun 11, 2025 | D4RLDeep Reinforcement Learning | —Unverified | 0 |
| Policy-Based Trajectory Clustering in Offline Reinforcement Learning | Jun 10, 2025 | ClusteringD4RL | —Unverified | 0 |
| Offline RL with Smooth OOD Generalization in Convex Hull and its Neighborhood | Jun 10, 2025 | Computational EfficiencyD4RL | CodeCode Available | 0 |
| STITCH-OPE: Trajectory Stitching with Guided Diffusion for Off-Policy Evaluation | May 27, 2025 | D4RLDenoising | —Unverified | 0 |
| Learning to Trust Bellman Updates: Selective State-Adaptive Regularization for Offline RL | May 26, 2025 | D4RLOffline RL | CodeCode Available | 0 |
| Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLmodel | —Unverified | 0 |
| Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Imagination-Limited Q-Learning for Offline Reinforcement Learning | May 18, 2025 | D4RLQ-Learning | —Unverified | 0 |
| Beyond the Known: Decision Making with Counterfactual Reasoning Decision Transformer | May 14, 2025 | counterfactualCounterfactual Reasoning | CodeCode Available | 0 |
| Pretraining a Shared Q-Network for Data-Efficient Offline Reinforcement Learning | May 9, 2025 | D4RLOffline RL | —Unverified | 0 |
| Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach | May 8, 2025 | D4RLDecision Making | —Unverified | 0 |
| Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning | May 3, 2025 | D4RLOffline RL | —Unverified | 0 |
| Directly Forecasting Belief for Reinforcement Learning with Delays | May 1, 2025 | D4RLMuJoCo | CodeCode Available | 0 |
| An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning | Apr 17, 2025 | D4RLreinforcement-learning | —Unverified | 0 |
| VIPO: Value Function Inconsistency Penalized Offline Reinforcement Learning | Apr 16, 2025 | D4RLOffline RL | —Unverified | 0 |
| Decision SpikeFormer: Spike-Driven Transformer for Decision Making | Apr 4, 2025 | D4RLDecision Making | —Unverified | 0 |
| Model-Based Offline Reinforcement Learning with Adversarial Data Augmentation | Mar 26, 2025 | D4RLData Augmentation | —Unverified | 0 |
| Diverse Transformer Decoding for Offline Reinforcement Learning Using Financial Algorithmic Approaches | Feb 13, 2025 | D4RLOffline RL | —Unverified | 0 |
| Habitizing Diffusion Planning for Efficient and Effective Decision Making | Feb 10, 2025 | CPUD4RL | CodeCode Available | 1 |
| Skill Expansion and Composition in Parameter Space | Feb 9, 2025 | D4RL | CodeCode Available | 2 |
| Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning | Feb 7, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Flow Q-Learning | Feb 4, 2025 | Action GenerationD4RL | CodeCode Available | 3 |
| Learning from Suboptimal Data in Continuous Control via Auto-Regressive Soft Q-Network | Feb 1, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Projection Implicit Q-Learning with Support Constraint for Offline Reinforcement Learning | Jan 15, 2025 | D4RLQ-Learning | —Unverified | 0 |
| DRDT3: Diffusion-Refined Decision Test-Time Training Model | Jan 12, 2025 | D4RLOffline RL | —Unverified | 0 |
| SALE-Based Offline Reinforcement Learning with Ensemble Q-Networks | Jan 7, 2025 | D4RLDiversity | —Unverified | 0 |
| SR-Reward: Taking The Path More Traveled | Jan 4, 2025 | D4RLImitation Learning | —Unverified | 0 |
| Goal-Conditioned Data Augmentation for Offline Reinforcement Learning | Dec 29, 2024 | D4RLData Augmentation | —Unverified | 0 |
| ACL-QL: Adaptive Conservative Level in Q-Learning for Offline Reinforcement Learning | Dec 22, 2024 | D4RLQ-Learning | —Unverified | 0 |
| Are Expressive Models Truly Necessary for Offline RL? | Dec 15, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model | Dec 7, 2024 | D4RLmodel | CodeCode Available | 1 |
| Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Dec 5, 2024 | D4RLOffline RL | —Unverified | 0 |
| Learning on One Mode: Addressing Multi-Modality in Offline Reinforcement Learning | Dec 4, 2024 | D4RLImitation Learning | CodeCode Available | 0 |
| Enhancing Decision Transformer with Diffusion-Based Trajectory Branch Generation | Nov 18, 2024 | D4RLReinforcement Learning (RL) | —Unverified | 0 |
| Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning | Nov 7, 2024 | D4RLreinforcement-learning | CodeCode Available | 0 |
| Hypercube Policy Regularization Framework for Offline Reinforcement Learning | Nov 7, 2024 | D4RLGeneral Reinforcement Learning | CodeCode Available | 0 |
| NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation | Oct 30, 2024 | D4RLManagement | CodeCode Available | 0 |
| Offline Behavior Distillation | Oct 30, 2024 | D4RLReinforcement Learning (RL) | CodeCode Available | 0 |
| Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning | Oct 30, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Q-Distribution guided Q-learning for offline reinforcement learning: Uncertainty penalized Q-value via consistency model | Oct 27, 2024 | D4RLQ-Learning | CodeCode Available | 0 |
| SAMG: State-Action-Aware Offline-to-Online Reinforcement Learning with Offline Model Guidance | Oct 24, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Oct 21, 2024 | ClusteringD4RL | —Unverified | 0 |
| Rethinking Optimal Transport in Offline Reinforcement Learning | Oct 17, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 |
| Diffusion Model Predictive Control | Oct 7, 2024 | D4RLmodel | —Unverified | 0 |
| KAN v.s. MLP for Offline Reinforcement Learning | Sep 15, 2024 | D4RLKolmogorov-Arnold Networks | —Unverified | 0 |
| Planning Transformer: Long-Horizon Offline Reinforcement Learning with Planning Tokens | Sep 14, 2024 | D4RLreinforcement-learning | —Unverified | 0 |