| A Strong Baseline for Batch Imitation Learning | Feb 6, 2023 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| A Survey of Zero-shot Generalisation in Deep Reinforcement Learning | Nov 18, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| A Survey on Model-based Reinforcement Learning | Jun 19, 2022 | Decision Makingmodel | —Unverified | 0 | 0 |
| A Fast Convergence Theory for Offline Decision Making | Jun 3, 2024 | Decision MakingOffline RL | —Unverified | 0 | 0 |
| Augmenting Offline RL with Unlabeled Data | Jun 11, 2024 | Offline RLTransfer Learning | —Unverified | 0 | 0 |
| Automatic Trade-off Adaptation in Offline RL | Jun 16, 2023 | Offline RL | —Unverified | 0 | 0 |
| A Validation Tool for Designing Reinforcement Learning Environments | Dec 10, 2021 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |
| Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation | Dec 16, 2020 | Deep Reinforcement LearningDistributional Reinforcement Learning | —Unverified | 0 | 0 |
| Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models | May 18, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| BCRLSP: An Offline Reinforcement Learning Framework for Sequential Targeted Promotion | Jul 16, 2022 | Offline RLreinforcement-learning | —Unverified | 0 | 0 |