| Divide-and-Conquer Reinforcement Learning | Nov 27, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Malaria Likelihood Prediction By Effectively Surveying Households Using Deep Reinforcement Learning | Nov 25, 2017 | Deep Reinforcement LearningHoldout Set | —Unverified | 0 |
| Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards | Nov 21, 2017 | Deep Reinforcement LearningInformativeness | —Unverified | 0 |
| Teaching a Machine to Read Maps with Deep Reinforcement Learning | Nov 20, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Implementing the Deep Q-Network | Nov 20, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Multi-Resource Multi-Machine Job Scheduling | Nov 20, 2017 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning | Nov 18, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Integrating User and Agent Models: A Deep Task-Oriented Dialogue System | Nov 10, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Towards the Use of Deep Reinforcement Learning with Global Policy For Query-based Extractive Summarisation | Nov 10, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? | Nov 7, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning | Nov 4, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning | Nov 2, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Paraphrase Generation with Deep Reinforcement Learning | Nov 1, 2017 | Deep Reinforcement LearningParaphrase Generation | —Unverified | 0 |
| Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning | Nov 1, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Diagnose: Assimilating Clinical Narratives using Deep Reinforcement Learning | Nov 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Intelligent Parameter Tuning in Optimization-based Iterative CT Reconstruction via Deep Reinforcement Learning | Nov 1, 2017 | CT ReconstructionDeep Reinforcement Learning | —Unverified | 0 |
| TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning | Oct 31, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Automata-Guided Hierarchical Reinforcement Learning for Skill Composition | Oct 31, 2017 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Visualizing and Understanding Atari Agents | Oct 31, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Regret Minimization for Partially Observable Deep Reinforcement Learning | Oct 31, 2017 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach | Oct 30, 2017 | Deep Reinforcement LearningPosition | CodeCode Available | 0 |
| Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning | Oct 28, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |