| Scaling Multi-Armed Bandit Algorithms | Jul 25, 2019 | Multi-Armed BanditsSequential Decision Making | —Unverified | 0 |
| IR-VIC: Unsupervised Discovery of Sub-goals for Transfer in RL | Jul 24, 2019 | Decision MakingHierarchical Reinforcement Learning | —Unverified | 0 |
| A Sufficient Statistic for Influence in Structured Multiagent Environments | Jul 22, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Reward Advancement: Transforming Policy under Maximum Causal Entropy Principle | Jul 11, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Scheme for Dynamic Risk-Sensitive Sequential Decision Making | Jul 9, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Thompson Sampling on Symmetric α-Stable Bandits | Jul 8, 2019 | Bayesian InferenceDecision Making | —Unverified | 0 |
| Co-training for Policy Learning | Jul 3, 2019 | Combinatorial Optimizationcontinuous-control | CodeCode Available | 0 |
| Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning | Jul 1, 2019 | Decision MakingImage Captioning | CodeCode Available | 0 |
| Exploiting Relevance for Online Decision-Making in High-Dimensions | Jul 1, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Markov models via low-rank optimization | Jun 28, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| A Theoretical Connection Between Statistical Physics and Reinforcement Learning | Jun 24, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| A Hierarchical Architecture for Sequential Decision-Making in Autonomous Driving using Deep Reinforcement Learning | Jun 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Macro-action Multi-time scale Dynamic Programming for Energy Management in Buildings with Phase Change Materials | Jun 11, 2019 | Decision Makingenergy management | —Unverified | 0 |
| Neural Heterogeneous Scheduler | Jun 9, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Non-Stationary Reinforcement Learning: The Blessing of (More) Optimism | Jun 7, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Lifelong Learning with a Changing Action Set | Jun 5, 2019 | Decision MakingLifelong learning | CodeCode Available | 0 |
| Reinforcement Learning When All Actions are Not Always Available | Jun 5, 2019 | AllDecision Making | CodeCode Available | 0 |
| Learning NP-Hard Multi-Agent Assignment Planning using GNN: Inference on a Random Graph and Provable Auction-Fitted Q-learning | May 29, 2019 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Learning to Discretize: Solving 1D Scalar Conservation Laws via Deep Reinforcement Learning | May 27, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-hop Reading Comprehension via Deep Reinforcement Learning based Document Traversal | May 23, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Knowledge-Based Sequential Decision-Making Under Uncertainty | May 16, 2019 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Tight Regret Bounds for Infinite-armed Linear Contextual Bandits | May 4, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Group Retention when Using Machine Learning in Sequential Decision Making: the Interplay between User Dynamics and Fairness | May 2, 2019 | Decision MakingFairness | —Unverified | 0 |
| Understanding & Generalizing AlphaGo Zero | May 1, 2019 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Soft Q-Learning with Mutual-Information Regularization | May 1, 2019 | Decision MakingQ-Learning | —Unverified | 0 |