| Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning | Jan 18, 2018 | Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 | 5 |
| Discriminative Deep Dyna-Q: Robust Planning for Dialogue Policy Learning | Aug 28, 2018 | Task-Completion Dialogue Policy Learning | CodeCode Available | 0 | 5 |
| Switch-based Active Deep Dyna-Q: Efficient Adaptive Planning for Task-Completion Dialogue Policy Learning | Nov 19, 2018 | Active LearningQ-Learning | CodeCode Available | 0 | 5 |
| Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning | Oct 31, 2017 | Task-Completion Dialogue Policy Learning | —Unverified | 0 | 0 |
| Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning | Apr 10, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Task-Completion Dialogue Policy Learning via Monte Carlo Tree Search with Dueling Network | Nov 1, 2020 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |