| Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL | Jun 8, 2024 | Data AugmentationMamba | CodeCode Available | 0 | 5 |
| On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency | Mar 3, 2022 | Offline RLreinforcement-learning | CodeCode Available | 0 | 5 |
| From Novelty to Imitation: Self-Distilled Rewards for Offline Reinforcement Learning | Jul 17, 2025 | D4RLOffline RL | —Unverified | 0 | 0 |
| FOSP: Fine-tuning Offline Safe Policy through World Models | Jul 6, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Contrastive Value Learning: Implicit Models for Simple Offline RL | Nov 3, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning | Feb 6, 2025 | Dataset GenerationMuJoCo | —Unverified | 0 | 0 |
| Flow-Based Single-Step Completion for Efficient and Expressive Policy Learning | Jun 26, 2025 | Action GenerationDecision Making | —Unverified | 0 | 0 |
| Flexible Blood Glucose Control: Offline Reinforcement Learning from Human Feedback | Jan 27, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Contrastive Learning as Goal-Conditioned Reinforcement Learning | Jun 15, 2022 | Contrastive LearningData Augmentation | —Unverified | 0 | 0 |
| BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning | Jul 15, 2024 | Model-based Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Finetuning Offline World Models in the Real World | Oct 24, 2023 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions | Mar 30, 2023 | DiversityOffline RL | —Unverified | 0 | 0 |
| Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Feb 5, 2024 | Contrastive LearningD4RL | —Unverified | 0 | 0 |
| Finer Behavioral Foundation Models via Auto-Regressive Features and Advantage Weighting | Dec 5, 2024 | D4RLOffline RL | —Unverified | 0 | 0 |
| Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching | Jun 24, 2023 | Imitation LearningOffline RL | —Unverified | 0 | 0 |
| BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion | Jul 16, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices | Feb 8, 2024 | Federated LearningOffline RL | —Unverified | 0 | 0 |
| Federated Offline Reinforcement Learning | Jun 11, 2022 | Offline RLPrivacy Preserving | —Unverified | 0 | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Feasibility-Aware Pessimistic Estimation: Toward Long-Horizon Safety in Offline RL | May 13, 2025 | Offline RLSafe Reinforcement Learning | —Unverified | 0 | 0 |
| Context-Former: Stitching via Latent Conditioned Sequence Modeling | Jan 29, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models | May 18, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| AdaCred: Adaptive Causal Decision Transformers with Feature Crediting | Dec 19, 2024 | AttributeImitation Learning | —Unverified | 0 | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| Exploring the Potential of Offline RL for Reasoning in LLMs: A Preliminary Study | May 4, 2025 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Exploiting Generalization in Offline Reinforcement Learning via Unseen State Augmentations | Aug 7, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Constraints Penalized Q-learning for Safe Offline Reinforcement Learning | Jul 19, 2021 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Exclusively Penalized Q-learning for Offline Reinforcement Learning | May 23, 2024 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Evaluation-Time Policy Switching for Offline Reinforcement Learning | Mar 15, 2025 | Behavioural cloningOffline RL | —Unverified | 0 | 0 |
| Evaluation of Active Feature Acquisition Methods for Static Feature Settings | Dec 6, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Equivariant Offline Reinforcement Learning | Jun 20, 2024 | Offline RLQ-Learning | —Unverified | 0 | 0 |
| Equivariant Data Augmentation for Generalization in Offline Reinforcement Learning | Sep 14, 2023 | Data AugmentationOffline RL | —Unverified | 0 | 0 |
| Environment Transformer and Policy Optimization for Model-Based Offline Reinforcement Learning | Mar 7, 2023 | Continuous ControlOffline RL | —Unverified | 0 | 0 |
| Ensemble Successor Representations for Task Generalization in Offline-to-Online Reinforcement Learning | May 12, 2024 | Offline RLReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Conservative Data Sharing for Multi-Task Offline Reinforcement Learning | Sep 16, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 | 0 |
| Achieving Fairness in Multi-Agent Markov Decision Processes Using Reinforcement Learning | Jun 1, 2023 | FairnessOffline RL | —Unverified | 0 | 0 |
| ENOTO: Improving Offline-to-Online Reinforcement Learning with Q-Ensembles | Jun 12, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Confidence-Conditioned Value Functions for Offline Reinforcement Learning | Dec 8, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Enhancing Reinforcement Learning Through Guided Search | Aug 19, 2024 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Enhancing Pre-Trained Decision Transformers with Prompt-Tuning Bandits | Feb 7, 2025 | InformativenessOffline RL | —Unverified | 0 | 0 |
| A Validation Tool for Designing Reinforcement Learning Environments | Dec 10, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective | Feb 17, 2025 | Bayesian Optimizationmodel | —Unverified | 0 | 0 |
| Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention | Sep 11, 2024 | Offline RL | —Unverified | 0 | 0 |
| Enhanced DACER Algorithm with High Diffusion Efficiency | May 29, 2025 | DenoisingImitation Learning | —Unverified | 0 | 0 |
| Energy-Weighted Flow Matching for Offline Reinforcement Learning | Mar 6, 2025 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning | Jan 14, 2022 | modelMuJoCo | —Unverified | 0 | 0 |
| Automatic Trade-off Adaptation in Offline RL | Jun 16, 2023 | Offline RL | —Unverified | 0 | 0 |
| End-to-end Offline Reinforcement Learning for Glycemia Control | Oct 16, 2023 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |