| Long Term Memory Network for Combinatorial Optimization Problems | Jan 1, 2018 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Using Deep Reinforcement Learning to Generate Rationales for Molecules | Jan 1, 2018 | Deep Reinforcement LearningDrug Design | —Unverified | 0 |
| Latent forward model for Real-time Strategy game planning with incomplete information | Jan 1, 2018 | Atari GamesDecision Making | —Unverified | 0 |
| LSD-Net: Look, Step and Detect for Joint Navigation and Multi-View Recognition with Deep Reinforcement Learning | Jan 1, 2018 | Deep Reinforcement LearningGeneral Classification | —Unverified | 0 |
| Learning to Treat Sepsis with Multi-Output Gaussian Process Deep Recurrent Q-Networks | Jan 1, 2018 | Deep Reinforcement LearningGaussian Processes | —Unverified | 0 |
| A dynamic game approach to training robust deep policies | Jan 1, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| PARAMETRIZED DEEP Q-NETWORKS LEARNING: PLAYING ONLINE BATTLE ARENA WITH DISCRETE-CONTINUOUS HYBRID ACTION SPACE | Jan 1, 2018 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Domain Adaptation for Deep Reinforcement Learning in Visually Distinct Games | Jan 1, 2018 | Deep Reinforcement LearningDomain Adaptation | —Unverified | 0 |
| Avoiding Catastrophic States with Intrinsic Fear | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Do Deep Reinforcement Learning Algorithms really Learn to Navigate? | Jan 1, 2018 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| A Hierarchical Model for Device Placement | Jan 1, 2018 | Deep Reinforcement LearningMachine Translation | —Unverified | 0 |
| Learning Robust Rewards with Adverserial Inverse Reinforcement Learning | Jan 1, 2018 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for List-wise Recommendations | Dec 30, 2017 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger | Dec 23, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning | Dec 22, 2017 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| A Deep Policy Inference Q-Network for Multi-Agent Systems | Dec 21, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning | Dec 20, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Neuroevolution: Genetic Algorithms Are a Competitive Alternative for Training Deep Neural Networks for Reinforcement Learning | Dec 18, 2017 | Deep Reinforcement LearningEvolutionary Algorithms | CodeCode Available | 0 |
| ES Is More Than Just a Traditional Finite-Difference Approximator | Dec 18, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Towards a Deep Reinforcement Learning Approach for Tower Line Wars | Dec 17, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| AI2-THOR: An Interactive 3D Environment for Visual AI | Dec 14, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Multi-focus Attention Network for Efficient Deep Reinforcement Learning | Dec 13, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning Boosted by External Knowledge | Dec 12, 2017 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Simulated Autonomous Driving on Realistic Road Networks using Deep Reinforcement Learning | Dec 12, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Learning Robust Dialog Policies in Noisy Environments | Dec 11, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Robust Deep Reinforcement Learning with Adversarial Attacks | Dec 11, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments | Dec 11, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Reinforced dynamics for enhanced sampling in large atomic and molecular systems | Dec 10, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| A Novel Model for Arbitration between Planning and Habitual Control Systems | Dec 6, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| A Deeper Look at Experience Replay | Dec 4, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards the Use of Deep Reinforcement Learning with Global Policy for Query-based Extractive Summarisation | Dec 1, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Uncertainty Estimates for Efficient Neural Network-based Dialogue Policy Optimisation | Nov 30, 2017 | Deep Reinforcement LearningDialogue Management | —Unverified | 0 |
| Improved Learning in Evolution Strategies via Sparser Inter-Agent Network Topologies | Nov 30, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control | Nov 30, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| End-to-End Optimization of Task-Oriented Dialogue Model with Deep Reinforcement Learning | Nov 29, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Automating Vehicles by Deep Reinforcement Learning using Task Separation with Hill Climbing | Nov 29, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| A Benchmarking Environment for Reinforcement Learning Based Task Oriented Dialogue Management | Nov 29, 2017 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for De-Novo Drug Design | Nov 29, 2017 | Deep Reinforcement LearningDrug Design | CodeCode Available | 0 |
| Deep Reinforcement Learning for Sepsis Treatment | Nov 27, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| AI Safety Gridworlds | Nov 27, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Population Based Training of Neural Networks | Nov 27, 2017 | Deep Reinforcement LearningMachine Translation | CodeCode Available | 1 |
| Divide-and-Conquer Reinforcement Learning | Nov 27, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Malaria Likelihood Prediction By Effectively Surveying Households Using Deep Reinforcement Learning | Nov 25, 2017 | Deep Reinforcement LearningHoldout Set | —Unverified | 0 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Asking the Difficult Questions: Goal-Oriented Visual Question Generation via Intermediate Rewards | Nov 21, 2017 | Deep Reinforcement LearningInformativeness | —Unverified | 0 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Implementing the Deep Q-Network | Nov 20, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |