| Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning | Dec 20, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| ES Is More Than Just a Traditional Finite-Difference Approximator | Dec 18, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Multi-focus Attention Network for Efficient Deep Reinforcement Learning | Dec 13, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning Boosted by External Knowledge | Dec 12, 2017 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Simulated Autonomous Driving on Realistic Road Networks using Deep Reinforcement Learning | Dec 12, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments | Dec 11, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Learning Robust Dialog Policies in Noisy Environments | Dec 11, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Robust Deep Reinforcement Learning with Adversarial Attacks | Dec 11, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Reinforced dynamics for enhanced sampling in large atomic and molecular systems | Dec 10, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A Novel Model for Arbitration between Planning and Habitual Control Systems | Dec 6, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Deeper Look at Experience Replay | Dec 4, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Towards the Use of Deep Reinforcement Learning with Global Policy for Query-based Extractive Summarisation | Dec 1, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control | Nov 30, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Improved Learning in Evolution Strategies via Sparser Inter-Agent Network Topologies | Nov 30, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation | Nov 30, 2017 | Deep Reinforcement LearningDialogue Management | —Unverified | 0 |
| End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning | Nov 29, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management | Nov 29, 2017 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing | Nov 29, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for De-Novo Drug Design | Nov 29, 2017 | Deep Reinforcement LearningDrug Design | CodeCode Available | 0 |
| AI Safety Gridworlds | Nov 27, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Sepsis Treatment | Nov 27, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Divide-and-Conquer Reinforcement Learning | Nov 27, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Malaria Likelihood Prediction By Effectively Surveying Households Using Deep Reinforcement Learning | Nov 25, 2017 | Deep Reinforcement LearningHoldout Set | —Unverified | 0 |
| Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards | Nov 21, 2017 | Deep Reinforcement LearningInformativeness | —Unverified | 0 |
| Teaching a Machine to Read Maps with Deep Reinforcement Learning | Nov 20, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Implementing the Deep Q-Network | Nov 20, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Multi-Resource Multi-Machine Job Scheduling | Nov 20, 2017 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning | Nov 18, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Integrating User and Agent Models: A Deep Task-Oriented Dialogue System | Nov 10, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Towards the Use of Deep Reinforcement Learning with Global Policy For Query-based Extractive Summarisation | Nov 10, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? | Nov 7, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning | Nov 4, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning | Nov 2, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Paraphrase Generation with Deep Reinforcement Learning | Nov 1, 2017 | Deep Reinforcement LearningParaphrase Generation | —Unverified | 0 |
| Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning | Nov 1, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Diagnose: Assimilating Clinical Narratives using Deep Reinforcement Learning | Nov 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Intelligent Parameter Tuning in Optimization-based Iterative CT Reconstruction via Deep Reinforcement Learning | Nov 1, 2017 | CT ReconstructionDeep Reinforcement Learning | —Unverified | 0 |
| TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning | Oct 31, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Automata-Guided Hierarchical Reinforcement Learning for Skill Composition | Oct 31, 2017 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Visualizing and Understanding Atari Agents | Oct 31, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Regret Minimization for Partially Observable Deep Reinforcement Learning | Oct 31, 2017 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach | Oct 30, 2017 | Deep Reinforcement LearningPosition | CodeCode Available | 0 |
| Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning | Oct 28, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |