| Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization | May 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 | 5 |
| Enhancing Sample Efficiency and Exploration in Reinforcement Learning through the Integration of Diffusion Models and Proximal Policy Optimization | Sep 2, 2024 | DiversityOffline RL | CodeCode Available | 2 | 5 |
| Offline RL for Natural Language Generation with Implicit Language Q Learning | Jun 5, 2022 | Language ModellingOffline RL | CodeCode Available | 2 | 5 |
| All You Need Is Supervised Learning: From Imitation Learning to Meta-RL With Upside Down RL | Feb 24, 2022 | AllImitation Learning | CodeCode Available | 1 | 5 |
| Alleviating Matthew Effect of Offline Reinforcement Learning in Interactive Recommendation | Jul 10, 2023 | Decision MakingInteractive Recommendation | CodeCode Available | 1 | 5 |
| Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning | Dec 25, 2024 | Decision MakingOffline RL | CodeCode Available | 1 | 5 |
| Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning | Mar 9, 2023 | Offline RLQ-Learning | CodeCode Available | 1 | 5 |
| AdaCat: Adaptive Categorical Discretization for Autoregressive Models | Aug 3, 2022 | Density EstimationOffline RL | CodeCode Available | 1 | 5 |
| Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning | Sep 29, 2023 | Image GenerationOffline RL | CodeCode Available | 1 | 5 |
| COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation | Apr 19, 2022 | Offline RLOff-policy evaluation | CodeCode Available | 1 | 5 |