| DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning | Oct 9, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning | Oct 9, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets | Oct 6, 2023 | D4RLDecision Making | CodeCode Available | 1 |
| Pre-training with Synthetic Data Helps Offline Reinforcement Learning | Oct 1, 2023 | D4RLDeep Reinforcement Learning | CodeCode Available | 0 |
| DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning | Sep 16, 2023 | D4RLmodel | —Unverified | 0 |
| Reasoning with Latent Diffusion in Offline Reinforcement Learning | Sep 12, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Multi-Objective Decision Transformers for Offline Reinforcement Learning | Aug 31, 2023 | D4RLOffline RL | —Unverified | 0 |
| Statistically Efficient Variance Reduction with Double Policy Estimation for Off-Policy Evaluation in Sequence-Modeled Reinforcement Learning | Aug 28, 2023 | D4RLOff-policy evaluation | —Unverified | 0 |
| Learning Computational Efficient Bots with Costly Features | Aug 18, 2023 | Computational EfficiencyD4RL | —Unverified | 0 |
| Offline Reinforcement Learning with On-Policy Q-Function Regularization | Jul 25, 2023 | D4RLreinforcement-learning | —Unverified | 0 |