| Scaling Multi-Armed Bandit Algorithms | Jul 25, 2019 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL | Jul 24, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| A Sufficient Statistic for Influence in Structured Multiagent Environments | Jul 22, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reward Advancement: Transforming Policy under Maximum Causal Entropy Principle | Jul 11, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Scheme for Dynamic Risk-Sensitive Sequential Decision Making | Jul 9, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Thompson Sampling on Symmetric α-Stable Bandits | Jul 8, 2019 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Co-training for Policy Learning | Jul 3, 2019 | Combinatorial Optimizationcontinuous-control | CodeCode Available | 0 |
| Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning | Jul 1, 2019 | Decision MakingImage Captioning | CodeCode Available | 0 |
| Exploiting Relevance for Online Decision-Making in High-Dimensions | Jul 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Markov models via low-rank optimization | Jun 28, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |