| Habitizing Diffusion Planning for Efficient and Effective Decision Making | Feb 10, 2025 | CPUD4RL | CodeCode Available | 1 |
| Are Expressive Models Truly Necessary for Offline RL? | Dec 15, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| M^3PC: Test-time Model Predictive Control for Pretrained Masked Trajectory Model | Dec 7, 2024 | D4RLmodel | CodeCode Available | 1 |
| Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Jul 12, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer | Jun 10, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Strategically Conservative Q-Learning | Jun 6, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning | May 31, 2024 | D4RLReinforcement Learning (RL) | CodeCode Available | 1 |
| In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought | May 31, 2024 | D4RLDecision Making | CodeCode Available | 1 |
| Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning | May 30, 2024 | D4RLreinforcement-learning | CodeCode Available | 1 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 |