| Verifiable Reinforcement Learning via Policy Extraction | May 22, 2018 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills | Apr 8, 2018 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 |
| Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning | Mar 27, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | Jan 4, 2018 | Continuous ControlDecision Making | CodeCode Available | 1 |
| Deep Reinforcement Learning for List-wise Recommendations | Dec 30, 2017 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger | Dec 23, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| AI2-THOR: An Interactive 3D Environment for Visual AI | Dec 14, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Population Based Training of Neural Networks | Nov 27, 2017 | Deep Reinforcement LearningMachine Translation | CodeCode Available | 1 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Eigenoption Discovery through the Deep Successor Representation | Oct 30, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Robust Rewards with Adversarial Inverse Reinforcement Learning | Oct 30, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations | Sep 28, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Exposure: A White-Box Photo Post-Processing Framework | Sep 27, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 1 |
| Automated Cloud Provisioning on AWS using Deep Reinforcement Learning | Sep 13, 2017 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation | Aug 17, 2017 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Lenient Multi-Agent Deep Reinforcement Learning | Jul 14, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments | Jun 7, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Thinking Fast and Slow with Deep Learning and Tree Search | May 23, 2017 | Decision MakingDeep Learning | CodeCode Available | 1 |
| Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning | Mar 20, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation | Mar 1, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Cryptocurrency Portfolio Management with Deep Reinforcement Learning | Dec 5, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning | Nov 15, 2016 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| Sample Efficient Actor-Critic with Experience Replay | Nov 3, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 |