| Knowledge-Based Sequential Decision-Making Under Uncertainty | May 16, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Tight Regret Bounds for Infinite-armed Linear Contextual Bandits | May 4, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Group Retention when Using Machine Learning in Sequential Decision Making: the Interplay between User Dynamics and Fairness | May 2, 2019 | Decision MakingFairness | —Unverified | 0 |
| Trajectory VAE for multi-modal imitation | May 1, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Understanding & Generalizing AlphaGo Zero | May 1, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Soft Q-Learning with Mutual-Information Regularization | May 1, 2019 | Decision MakingQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Optimal Critical Care Pain Management with Morphine using Dueling Double-Deep Q Networks | Apr 25, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio | Apr 24, 2019 | Decision Makingfeature selection | —Unverified | 0 |
| Latent Variable Algorithms for Multimodal Learning and Sensor Fusion | Apr 23, 2019 | Activity RecognitionDecision Making | —Unverified | 0 |
| The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human Priors | Apr 22, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |