| Task-Completion Dialogue Policy Learning via Monte Carlo Tree Search with Dueling Network | Nov 1, 2020 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Bayes-Adaptive Deep Model-Based Policy Optimisation | Oct 29, 2020 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| Planning with Exploration: Addressing Dynamics Bottleneck in Model-based Reinforcement Learning | Oct 24, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving the Exploration of Deep Reinforcement Learning in Continuous Domains using Planning for Policy Search | Oct 24, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Error Bounds of Imitating Policies and Environments | Oct 22, 2020 | Imitation LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Safety Verification of Model Based Reinforcement Learning Controllers | Oct 21, 2020 | Autonomous Drivingmodel | —Unverified | 0 |
| Uncertainty-aware Contact-safe Model-based Reinforcement Learning | Oct 16, 2020 | Contact-rich ManipulationModel-based Reinforcement Learning | —Unverified | 0 |
| Model-Based Reinforcement Learning for Type 1Diabetes Blood Glucose Control | Oct 13, 2020 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforcement Learning on Computational Resource Allocation of Cloud-based Wireless Networks | Oct 10, 2020 | CPUManagement | —Unverified | 0 |
| Trust the Model When It Is Confident: Masked Model-based Actor-Critic | Oct 10, 2020 | continuous-controlContinuous Control | —Unverified | 0 |