| Diffusion Actor-Critic: Formulating Constrained Policy Iteration as Diffusion Noise Regression for Offline Reinforcement Learning | May 31, 2024 | D4RLReinforcement Learning (RL) | CodeCode Available | 1 |
| In-Context Decision Transformer: Reinforcement Learning via Hierarchical Chain-of-Thought | May 31, 2024 | D4RLDecision Making | CodeCode Available | 1 |
| Learning from Random Demonstrations: Offline Reinforcement Learning with Importance-Sampled Diffusion Models | May 30, 2024 | D4RLreinforcement-learning | —Unverified | 0 |
| Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning | May 30, 2024 | D4RLDecision Making | —Unverified | 0 |
| Adaptive Advantage-Guided Policy Regularization for Offline Reinforcement Learning | May 30, 2024 | D4RLreinforcement-learning | CodeCode Available | 1 |
| Diffusion Policies creating a Trust Region for Offline Reinforcement Learning | May 30, 2024 | D4RLDenoising | CodeCode Available | 1 |
| AlignIQL: Policy Alignment in Implicit Q-Learning through Constrained Optimization | May 28, 2024 | D4RLOffline RL | CodeCode Available | 0 |
| Q-value Regularized Transformer for Offline Reinforcement Learning | May 27, 2024 | D4RLOffline RL | CodeCode Available | 1 |
| DIDI: Diffusion-Guided Diversity for Offline Behavioral Generation | May 23, 2024 | D4RLDecision Making | CodeCode Available | 0 |
| State-Constrained Offline Reinforcement Learning | May 23, 2024 | D4RLreinforcement-learning | —Unverified | 0 |