| Teaching a Machine to Read Maps with Deep Reinforcement Learning | Nov 20, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Deep Reinforcement Learning for Multi-Resource Multi-Machine Job Scheduling | Nov 20, 2017 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning | Nov 18, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems | Nov 15, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Integrating User and Agent Models: A Deep Task-Oriented Dialogue System | Nov 10, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Towards the Use of Deep Reinforcement Learning with Global Policy For Query-based Extractive Summarisation | Nov 10, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Can Deep Reinforcement Learning Solve Erdos-Selfridge-Spencer Games? | Nov 7, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning | Nov 4, 2017 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning | Nov 2, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Diagnose: Assimilating Clinical Narratives using Deep Reinforcement Learning | Nov 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Intelligent Parameter Tuning in Optimization-based Iterative CT Reconstruction via Deep Reinforcement Learning | Nov 1, 2017 | CT ReconstructionDeep Reinforcement Learning | —Unverified | 0 |
| Acquiring Target Stacking Skills by Goal-Parameterized Deep Reinforcement Learning | Nov 1, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Paraphrase Generation with Deep Reinforcement Learning | Nov 1, 2017 | Deep Reinforcement LearningParaphrase Generation | —Unverified | 0 |
| Regret Minimization for Partially Observable Deep Reinforcement Learning | Oct 31, 2017 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement Learning | Oct 31, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Visualizing and Understanding Atari Agents | Oct 31, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Automata-Guided Hierarchical Reinforcement Learning for Skill Composition | Oct 31, 2017 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Learning Robust Rewards with Adversarial Inverse Reinforcement Learning | Oct 30, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Predicting Head Movement in Panoramic Video: A Deep Reinforcement Learning Approach | Oct 30, 2017 | Deep Reinforcement LearningPosition | CodeCode Available | 0 |
| Eigenoption Discovery through the Deep Successor Representation | Oct 30, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Diff-DAC: Distributed Actor-Critic for Average Multitask Deep Reinforcement Learning | Oct 28, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Consequentialist conditional cooperation in social dilemmas with imperfect information | Oct 19, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Asymmetric Actor Critic for Image-Based Robot Learning | Oct 18, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| The Effects of Memory Replay in Reinforcement Learning | Oct 18, 2017 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Map-based Multi-Policy Reinforcement Learning: Enhancing Adaptability of Robots by Deep Reinforcement Learning | Oct 17, 2017 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Flow: A Modular Learning Framework for Mixed Autonomy Traffic | Oct 16, 2017 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 2 |
| Deep Reinforcement Learning: Framework, Applications, and Embedded Implementations | Oct 10, 2017 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 |
| Rainbow: Combining Improvements in Deep Reinforcement Learning | Oct 6, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 |
| Deep Abstract Q-Networks | Oct 2, 2017 | Deep Reinforcement LearningMontezuma's Revenge | —Unverified | 0 |
| Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight | Oct 2, 2017 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| Attention-Aware Deep Reinforcement Learning for Video Face Recognition | Oct 1, 2017 | Deep Reinforcement LearningFace Recognition | —Unverified | 0 |
| Parameter Sharing Deep Deterministic Policy Gradient for Cooperative Multi-agent Reinforcement Learning | Oct 1, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Vision-based deep execution monitoring | Sep 29, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Self-supervised Deep Reinforcement Learning with Generalized Computation Graphs for Robot Navigation | Sep 29, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations | Sep 28, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep TAMER: Interactive Agent Shaping in High-Dimensional State Spaces | Sep 28, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Towards Optimally Decentralized Multi-Robot Collision Avoidance via Deep Reinforcement Learning | Sep 28, 2017 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 0 |
| Exposure: A White-Box Photo Post-Processing Framework | Sep 27, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 1 |
| Towards continuous control of flippers for a multi-terrain robot using deep reinforcement learning | Sep 25, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Unmanned Aerial Vehicle Control for Autonomous Target Following | Sep 24, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| OptLayer - Practical Constrained Optimization for Deep Reinforcement Learning in the Real World | Sep 22, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning | Sep 21, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Deep-Reinforcement Learning Approach for Software-Defined Networking Routing Optimization | Sep 20, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Dexterous Manipulation with Concept Networks | Sep 20, 2017 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Deep Reinforcement Learning for Event-Driven Multi-Agent Decision Processes | Sep 19, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning that Matters | Sep 19, 2017 | Atari GamesContinuous Control | CodeCode Available | 0 |
| Iterative Policy Learning in End-to-End Trainable Task-Oriented Neural Dialog Models | Sep 18, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Guided Deep Reinforcement Learning for Swarm Systems | Sep 18, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning for Conversational AI | Sep 15, 2017 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 0 |