| AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning | Aug 7, 2023 | Offline RLreinforcement-learning | CodeCode Available | 2 |
| Integrating Offline Reinforcement Learning with Transformers for Sequential Recommendation | Jul 26, 2023 | Offline RLreinforcement-learning | —Unverified | 0 |
| Contrastive Example-Based Control | Jul 24, 2023 | Offline RL | CodeCode Available | 0 |
| A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning | Jul 24, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| On the Effectiveness of Offline RL for Dialogue Response Generation | Jul 23, 2023 | Offline RLreinforcement-learning | CodeCode Available | 0 |
| Model-based Offline Reinforcement Learning with Count-based Conservatism | Jul 21, 2023 | D4RLOffline RL | CodeCode Available | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Self-Assembling Artificial Neural Networks through Neural Developmental Programs | Jul 17, 2023 | Offline RL | CodeCode Available | 1 |
| Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning | Jul 13, 2023 | BenchmarkingOffline RL | CodeCode Available | 1 |
| Budgeting Counterfactual for Offline RL | Jul 12, 2023 | counterfactualCounterfactual Reasoning | —Unverified | 0 |