| Adversarial Deep Reinforcement Learning in Portfolio Management | Aug 29, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| BOHB: Robust and Efficient Hyperparameter Optimization at Scale | Jul 4, 2018 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Maximum a Posteriori Policy Optimisation | Jun 14, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Playing hard exploration games by watching YouTube | May 29, 2018 | Deep Reinforcement LearningMontezuma's Revenge | CodeCode Available | 1 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |
| Verifiable Reinforcement Learning via Policy Extraction | May 22, 2018 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills | Apr 8, 2018 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 |
| Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning | Mar 27, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor | Jan 4, 2018 | Continuous ControlDecision Making | CodeCode Available | 1 |
| Deep Reinforcement Learning for List-wise Recommendations | Dec 30, 2017 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |