| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Model-free Deep Reinforcement Learning for Urban Autonomous Driving | Apr 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| On the Pitfalls of Measuring Emergent Communication | Mar 12, 2019 | Deep Reinforcement LearningFault Detection | CodeCode Available | 1 |
| Learning to Paint With Model-based Deep Reinforcement Learning | Mar 11, 2019 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Catalyst.RL: A Distributed Framework for Reproducible RL Research | Feb 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Marathon Environments: Multi-Agent Continuous Control Benchmarks in a Modern Video Game Engine | Feb 25, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity | Feb 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Off-Policy Deep Reinforcement Learning without Exploration | Dec 7, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Quantifying Generalization in Reinforcement Learning | Dec 6, 2018 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| An Introduction to Deep Reinforcement Learning | Nov 30, 2018 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 |
| Exploration by Random Network Distillation | Oct 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling | Oct 29, 2018 | Collaborative FilteringDecision Making | CodeCode Available | 1 |
| Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks | Oct 24, 2018 | Contact-rich ManipulationDeep Reinforcement Learning | CodeCode Available | 1 |
| Optimization of Molecules via Deep Reinforcement Learning | Oct 19, 2018 | Deep Reinforcement LearningMolecular Graph Generation | CodeCode Available | 1 |
| Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning | Oct 19, 2018 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Visual Semantic Navigation using Scene Priors | Oct 15, 2018 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space | Oct 10, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning | Sep 24, 2018 | Deep Reinforcement LearningHuman Dynamics | CodeCode Available | 1 |
| Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning | Sep 17, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Adversarial Deep Reinforcement Learning in Portfolio Management | Aug 29, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| BOHB: Robust and Efficient Hyperparameter Optimization at Scale | Jul 4, 2018 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Maximum a Posteriori Policy Optimisation | Jun 14, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Playing hard exploration games by watching YouTube | May 29, 2018 | Deep Reinforcement LearningMontezuma's Revenge | CodeCode Available | 1 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |
| Verifiable Reinforcement Learning via Policy Extraction | May 22, 2018 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills | Apr 8, 2018 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 |
| Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning | Mar 27, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | Jan 4, 2018 | Continuous ControlDecision Making | CodeCode Available | 1 |
| Deep Reinforcement Learning for List-wise Recommendations | Dec 30, 2017 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Whatever Does Not Kill Deep Reinforcement Learning, Makes It Stronger | Dec 23, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| AI2-THOR: An Interactive 3D Environment for Visual AI | Dec 14, 2017 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Population Based Training of Neural Networks | Nov 27, 2017 | Deep Reinforcement LearningMachine Translation | CodeCode Available | 1 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Eigenoption Discovery through the Deep Successor Representation | Oct 30, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Robust Rewards with Adversarial Inverse Reinforcement Learning | Oct 30, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations | Sep 28, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Exposure: A White-Box Photo Post-Processing Framework | Sep 27, 2017 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 1 |
| Automated Cloud Provisioning on AWS using Deep Reinforcement Learning | Sep 13, 2017 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation | Aug 17, 2017 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Lenient Multi-Agent Deep Reinforcement Learning | Jul 14, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem | Jun 30, 2017 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments | Jun 7, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Thinking Fast and Slow with Deep Learning and Tree Search | May 23, 2017 | Decision MakingDeep Learning | CodeCode Available | 1 |
| Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning | Mar 20, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Virtual-to-real Deep Reinforcement Learning: Continuous Control of Mobile Robots for Mapless Navigation | Mar 1, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Cryptocurrency Portfolio Management with Deep Reinforcement Learning | Dec 5, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| #Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning | Nov 15, 2016 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| Sample Efficient Actor-Critic with Experience Replay | Nov 3, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 |