| An Information-Theoretic Optimality Principle for Deep Reinforcement Learning | Aug 6, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Inquiry Dialog Policies with Logical Formula Embeddings | Aug 2, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Grounding Language for Transfer in Deep Reinforcement Learning | Aug 1, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution | Aug 1, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards | Jul 27, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| DARLA: Improving Zero-Shot Transfer in Reinforcement Learning | Jul 26, 2017 | Deep Reinforcement LearningDomain Adaptation | CodeCode Available | 0 |
| Direct Load Control of Thermostatically Controlled Loads Based on Sparse Observations Using Deep Reinforcement Learning | Jul 26, 2017 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| 3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds | Jul 21, 2017 | ClassificationDeep Reinforcement Learning | —Unverified | 0 |
| Imagination-Augmented Agents for Deep Reinforcement Learning | Jul 19, 2017 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| On-line Building Energy Optimization using Deep Reinforcement Learning | Jul 18, 2017 | Deep Reinforcement Learningenergy management | —Unverified | 0 |
| Tracking as Online Decision-Making: Learning a Policy from Streaming Videos with Reinforcement Learning | Jul 17, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Distral: Robust Multitask Reinforcement Learning | Jul 13, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Macromanagement in StarCraft from Replays using Deep Learning | Jul 12, 2017 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 |
| Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation | Jul 11, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Value Prediction Network | Jul 11, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Q-Learning for Self-Organizing Networks Fault Management and Radio Performance Improvement | Jul 10, 2017 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Deep Reinforcement Learning Attention Selection for Person Re-Identification | Jul 10, 2017 | Deep Reinforcement LearningPerson Re-Identification | —Unverified | 0 |
| Solving high-dimensional partial differential equations using deep learning | Jul 9, 2017 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning human behaviors from motion capture by adversarial imitation | Jul 7, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Maintaining cooperation in complex social dilemmas using deep reinforcement learning | Jul 4, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Hashing over Predicted Future Frames for Informed Exploration of Deep Reinforcement Learning | Jul 3, 2017 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management | Jul 1, 2017 | Deep Reinforcement LearningDialogue Management | —Unverified | 0 |
| Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning | Jul 1, 2017 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| Noisy Networks for Exploration | Jun 30, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Structure Learning in Motor Control:A Deep Reinforcement Learning Model | Jun 21, 2017 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning | Jun 19, 2017 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning | Jun 15, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations | Jun 15, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep reinforcement learning from human preferences | Jun 12, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Parameter Space Noise for Exploration | Jun 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| UCB Exploration via Q-Ensembles | Jun 5, 2017 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning | Jun 1, 2017 | continuous-controlContinuous Control | —Unverified | 0 |
| Reinforcement Learning for Learning Rate Control | May 31, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Fine-grained acceleration control for autonomous intersection management using deep reinforcement learning | May 30, 2017 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| End-to-end Active Object Tracking via Reinforcement Learning | May 30, 2017 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Cross-Domain Perceptual Reward Functions | May 25, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| State Space Decomposition and Subgoal Creation for Transfer in Deep Reinforcement Learning | May 24, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Continuous State-Space Models for Optimal Sepsis Treatment - a Deep Reinforcement Learning Approach | May 23, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Enhanced Experience Replay Generation for Efficient Reinforcement Learning | May 23, 2017 | Deep Reinforcement LearningGenerative Adversarial Network | —Unverified | 0 |
| Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning | May 21, 2017 | BenchmarkingDecision Making | —Unverified | 0 |
| Shallow Updates for Deep Reinforcement Learning | May 21, 2017 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Learning to Factor Policies and Action-Value Functions: Factored Action Space Representations for Deep Reinforcement learning | May 20, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Atari games and Intel processors | May 19, 2017 | Atari GamesBIG-bench Machine Learning | —Unverified | 0 |
| Delving into adversarial attacks on deep policies | May 18, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Efficient Parallel Methods for Deep Reinforcement Learning | May 13, 2017 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| Analyzing Knowledge Transfer in Deep Q-Networks for Autonomously Handling Multiple Intersections | May 2, 2017 | Deep Reinforcement LearningLifelong learning | —Unverified | 0 |
| Navigating Occluded Intersections with Autonomous Vehicles using Deep Reinforcement Learning | May 2, 2017 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 26, 2017 | Atari GamesDecision Making | CodeCode Available | 0 |
| Molecular De Novo Design through Deep Reinforcement Learning | Apr 25, 2017 | Activity PredictionDeep Reinforcement Learning | CodeCode Available | 0 |
| Modular Multi-Objective Deep Reinforcement Learning with Decision Values | Apr 21, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |