| Gradientless Descent: High-Dimensional Zeroth-Order Optimization | Nov 14, 2019 | MuJoCoVocal Bursts Intensity Prediction | —Unverified | 0 |
| Multi-Path Policy Optimization | Nov 11, 2019 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Asynchronous Methods for Model-Based Reinforcement Learning | Oct 28, 2019 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales | Oct 23, 2019 | MuJoCoVariational Inference | CodeCode Available | 0 |
| On the Expressivity of Neural Networks for Deep Reinforcement Learning | Oct 14, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards | Oct 10, 2019 | Hierarchical Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Multi-step Greedy Reinforcement Learning Algorithms | Oct 7, 2019 | Continuous ControlGame of Go | —Unverified | 0 |
| Learning Calibratable Policies using Programmatic Style-Consistency | Oct 2, 2019 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Formal Language Constraints for Markov Decision Processes | Oct 2, 2019 | Atari GamesMuJoCo | CodeCode Available | 0 |