| Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning | Sep 12, 2024 | D4RLOffline RL | —Unverified | 0 |
| The Role of Deep Learning Regularizations on Actors in Offline RL | Sep 11, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Forward KL Regularized Preference Optimization for Aligning Diffusion Policies | Sep 9, 2024 | D4RLDecision Making | —Unverified | 0 |
| SUMO: Search-Based Uncertainty Estimation for Model-Based Offline Reinforcement Learning | Aug 23, 2024 | D4RLOffline RL | —Unverified | 0 |
| Offline Model-Based Reinforcement Learning with Anti-Exploration | Aug 20, 2024 | D4RLmodel | —Unverified | 0 |
| SelfBC: Self Behavior Cloning for Offline Reinforcement Learning | Aug 4, 2024 | AttributeD4RL | —Unverified | 0 |
| Diffusion Models as Optimizers for Efficient Planning in Offline RL | Jul 23, 2024 | D4RLDecision Making | CodeCode Available | 0 |
| Offline Reinforcement Learning with Imputed Rewards | Jul 15, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Jul 12, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning | Jun 30, 2024 | D4RLOffline RL | —Unverified | 0 |
| Binary Reward Labeling: Bridging Offline Preference and Reward-Based Reinforcement Learning | Jun 14, 2024 | D4RLOffline RL | —Unverified | 0 |
| DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets | Jun 13, 2024 | D4RLOffline RL | —Unverified | 0 |
| Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning | Jun 12, 2024 | D4RLMuJoCo | CodeCode Available | 0 |
| CDSA: Conservative Denoising Score-based Algorithm for Offline Reinforcement Learning | Jun 11, 2024 | D4RLDenoising | —Unverified | 0 |
| PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer | Jun 10, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Stabilizing Extreme Q-learning by Maclaurin Expansion | Jun 7, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Strategically Conservative Q-Learning | Jun 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| UDQL: Bridging The Gap between MSE Loss and The Optimal Value Function in Offline Reinforcement Learning | Jun 5, 2024 | D4RLOffline RL | —Unverified | 0 |
| Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling | May 31, 2024 | D4RLMamba | —Unverified | 0 |
| Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning | May 31, 2024 | D4RLReinforcement Learning (RL) | CodeCode Available | 1 |
| In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought | May 31, 2024 | D4RLDecision Making | CodeCode Available | 1 |
| Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning | May 30, 2024 | D4RLDecision Making | —Unverified | 0 |
| Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models | May 30, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning | May 30, 2024 | D4RLreinforcement-learning | CodeCode Available | 1 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Q-value Regularized Transformer for Offline Reinforcement Learning | May 27, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation | May 23, 2024 | D4RLDecision Making | CodeCode Available | 0 |
| State-Constrained Offline Reinforcement Learning | May 23, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Task-agnostic Decision Transformer for Multi-type Agent Control with Federated Split Training | May 22, 2024 | AI AgentAutonomous Driving | —Unverified | 0 |
| Towards Robust Policy: Enhancing Offline Reinforcement Learning with Adversarial Attacks and Defenses | May 18, 2024 | D4RLOffline RL | —Unverified | 0 |
| Reinformer: Max-Return Sequence Modeling for Offline RL | May 14, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Decision Mamba Architectures | May 13, 2024 | D4RLImitation Learning | CodeCode Available | 0 |
| Improving Offline Reinforcement Learning with Inaccurate Simulators | May 7, 2024 | D4RLGenerative Adversarial Network | —Unverified | 0 |
| Offline Trajectory Generalization for Offline Reinforcement Learning | Apr 16, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Regularized Conditional Diffusion Model for Multi-Task Preference Alignment | Apr 7, 2024 | D4RLDecision Making | —Unverified | 0 |
| Compositional Conservatism: A Transductive Approach in Offline Reinforcement Learning | Apr 6, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning | Apr 3, 2024 | D4RLreinforcement-learning | CodeCode Available | 0 |
| Simple Ingredients for Offline Reinforcement Learning | Mar 19, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| A2PO: Towards Effective Offline Reinforcement Learning from an Advantage-aware Perspective | Mar 12, 2024 | D4RLreinforcement-learning | CodeCode Available | 0 |
| Align Your Intents: Offline Imitation Learning via Optimal Transport | Feb 20, 2024 | D4RLDecision Making | —Unverified | 0 |
| SEABO: A Simple Search-Based Method for Offline Imitation Learning | Feb 6, 2024 | D4RLImitation Learning | CodeCode Available | 1 |
| Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning | Feb 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning | Feb 5, 2024 | D4RLQ-Learning | —Unverified | 0 |
| Deep autoregressive density nets vs neural ensembles for model-based offline reinforcement learning | Feb 5, 2024 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Contrastive Diffuser: Planning Towards High Return States via Contrastive Learning | Feb 5, 2024 | Contrastive LearningD4RL | —Unverified | 0 |
| DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching | Feb 4, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Augmenting Offline Reinforcement Learning with State-only Interactions | Feb 1, 2024 | D4RLData Augmentation | —Unverified | 0 |
| Context-Former: Stitching via Latent Conditioned Sequence Modeling | Jan 29, 2024 | D4RLDecision Making | —Unverified | 0 |