| Adversarial Policies: Attacking Deep Reinforcement Learning | May 25, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Model-free Deep Reinforcement Learning for Urban Autonomous Driving | Apr 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| On the Pitfalls of Measuring Emergent Communication | Mar 12, 2019 | Deep Reinforcement LearningFault Detection | CodeCode Available | 1 |
| Learning to Paint With Model-based Deep Reinforcement Learning | Mar 11, 2019 | Deep Reinforcement LearningPosition | CodeCode Available | 1 |
| Catalyst.RL: A Distributed Framework for Reproducible RL Research | Feb 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Marathon Environments: Multi-Agent Continuous Control Benchmarks in a Modern Video Game Engine | Feb 25, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity | Feb 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Off-Policy Deep Reinforcement Learning without Exploration | Dec 7, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Quantifying Generalization in Reinforcement Learning | Dec 6, 2018 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| An Introduction to Deep Reinforcement Learning | Nov 30, 2018 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 |
| Exploration by Random Network Distillation | Oct 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning based Recommendation with Explicit User-Item Interactions Modeling | Oct 29, 2018 | Collaborative FilteringDecision Making | CodeCode Available | 1 |
| Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks | Oct 24, 2018 | Contact-rich ManipulationDeep Reinforcement Learning | CodeCode Available | 1 |
| Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning | Oct 19, 2018 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Optimization of Molecules via Deep Reinforcement Learning | Oct 19, 2018 | Deep Reinforcement LearningMolecular Graph Generation | CodeCode Available | 1 |
| Visual Semantic Navigation using Scene Priors | Oct 15, 2018 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space | Oct 10, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning | Sep 24, 2018 | Deep Reinforcement LearningHuman Dynamics | CodeCode Available | 1 |
| Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning | Sep 17, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Adversarial Deep Reinforcement Learning in Portfolio Management | Aug 29, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| BOHB: Robust and Efficient Hyperparameter Optimization at Scale | Jul 4, 2018 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Maximum a Posteriori Policy Optimisation | Jun 14, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Playing hard exploration games by watching YouTube | May 29, 2018 | Deep Reinforcement LearningMontezuma's Revenge | CodeCode Available | 1 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |