Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Jan 4, 2018 Continuous Control Decision Making
Code Code Available 1DeepMind Control Suite Jan 2, 2018 continuous-control Continuous Control
Code Code Available 1Deep Reinforcement Learning for List-wise Recommendations Dec 30, 2017 Deep Reinforcement Learning Recommendation Systems
Code Code Available 1Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger Dec 23, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1AI2-THOR: An Interactive 3D Environment for Visual AI Dec 14, 2017 Deep Reinforcement Learning Imitation Learning
Code Code Available 1Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm Dec 5, 2017 Game of Chess Game of Go
Code Code Available 1Time Limits in Reinforcement Learning Dec 1, 2017 General Reinforcement Learning reinforcement-learning
Code Code Available 1Plan, Attend, Generate: Planning for Sequence-to-Sequence Models Nov 28, 2017 Question Generation Question-Generation
Code Code Available 1One-Shot Reinforcement Learning for Robot Navigation with Interactive Replay Nov 28, 2017 Navigate reinforcement-learning
Code Code Available 1Action Branching Architectures for Deep Reinforcement Learning Nov 24, 2017 continuous-control Continuous Control
Code Code Available 1Eigenoption Discovery through the Deep Successor Representation Oct 30, 2017 Atari Games Deep Reinforcement Learning
Code Code Available 1Learning Robust Rewards with Adversarial Inverse Reinforcement Learning Oct 30, 2017 Decision Making Deep Reinforcement Learning
Code Code Available 1Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations Sep 28, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1A Benchmark Environment Motivated by Industrial Control Problems Sep 27, 2017 OpenAI Gym Reinforcement Learning
Code Code Available 1Automated Cloud Provisioning on AWS using Deep Reinforcement Learning Sep 13, 2017 Cloud Computing Deep Reinforcement Learning
Code Code Available 1Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning Aug 31, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 1Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation Aug 17, 2017 Atari Games continuous-control
Code Code Available 1Meta-SGD: Learning to Learn Quickly for Few-Shot Learning Jul 31, 2017 Few-Shot Learning Meta-Learning
Code Code Available 1A Distributional Perspective on Reinforcement Learning Jul 21, 2017 Atari Games reinforcement-learning
Code Code Available 1A multi-agent reinforcement learning model of common-pool resource appropriation Jul 20, 2017 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 1Lenient Multi-Agent Deep Reinforcement Learning Jul 14, 2017 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 1Emergence of Locomotion Behaviours in Rich Environments Jul 7, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 1Hindsight Experience Replay Jul 5, 2017 Reinforcement Learning Reinforcement Learning (RL)
Code Code Available 1A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem Jun 30, 2017 Deep Reinforcement Learning Management
Code Code Available 1Value-Decomposition Networks For Cooperative Multi-Agent Learning Jun 16, 2017 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments Jun 7, 2017 Deep Reinforcement Learning Multi-agent Reinforcement Learning
Code Code Available 1Thinking Fast and Slow with Deep Learning and Tree Search May 23, 2017 Decision Making Deep Learning
Code Code Available 1ParlAI: A Dialog Research Software Platform May 18, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 1A Deep Reinforced Model for Abstractive Summarization May 11, 2017 Abstractive Text Summarization Decoder
Code Code Available 1Time-Contrastive Networks: Self-Supervised Learning from Video Apr 23, 2017 Metric Learning reinforcement-learning
Code Code Available 1Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning Mar 20, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1Evolution Strategies as a Scalable Alternative to Reinforcement Learning Mar 10, 2017 Atari Games MuJoCo
Code Code Available 1Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Mar 9, 2017 Category-Agnostic Pose Estimation Few-Shot Image Classification
Code Code Available 1Robust Adversarial Reinforcement Learning Mar 8, 2017 Friction reinforcement-learning
Code Code Available 1Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation Mar 1, 2017 continuous-control Continuous Control
Code Code Available 1Stabilising Experience Replay for Deep Multi-Agent Reinforcement Learning Feb 28, 2017 Multi-agent Reinforcement Learning Q-Learning
Code Code Available 1Multi-agent Reinforcement Learning in Sequential Social Dilemmas Feb 10, 2017 Multi-agent Reinforcement Learning reinforcement-learning
Code Code Available 1An Alternative Softmax Operator for Reinforcement Learning Dec 16, 2016 Decision Making reinforcement-learning
Code Code Available 1Cryptocurrency Portfolio Management with Deep Reinforcement Learning Dec 5, 2016 Decision Making Deep Reinforcement Learning
Code Code Available 1Self-critical Sequence Training for Image Captioning Dec 2, 2016 Image Captioning Policy Gradient Methods
Code Code Available 1Neural Combinatorial Optimization with Reinforcement Learning Nov 29, 2016 Combinatorial Optimization reinforcement-learning
Code Code Available 1#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning Nov 15, 2016 Atari Games continuous-control
Code Code Available 1Sample Efficient Actor-Critic with Experience Replay Nov 3, 2016 continuous-control Continuous Control
Code Code Available 1Progressive Neural Networks Jun 15, 2016 Continual Learning reinforcement-learning
Code Code Available 1Generative Adversarial Imitation Learning Jun 10, 2016 Imitation Learning reinforcement-learning
Code Code Available 1OpenAI Gym Jun 5, 2016 reinforcement-learning Reinforcement Learning
Code Code Available 1Deep Reinforcement Learning from Self-Play in Imperfect-Information Games Mar 3, 2016 Card Games Deep Reinforcement Learning
Code Code Available 1Continuous Deep Q-Learning with Model-based Acceleration Mar 2, 2016 continuous-control Continuous Control
Code Code Available 1Investigating practical linear temporal difference learning Feb 28, 2016 reinforcement-learning Reinforcement Learning
Code Code Available 1Asynchronous Methods for Deep Reinforcement Learning Feb 4, 2016 Atari Games CPU
Code Code Available 1