| UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits | Apr 16, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Hindsight is Only 50/50: Unsuitability of MDP based Approximate POMDP Solvers for Multi-resolution Information Gathering | Apr 7, 2018 | Decision MakingImitation Learning | —Unverified | 0 |
| Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection | Mar 14, 2018 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Hierarchical Imitation and Reinforcement Learning | Mar 1, 2018 | Decision MakingImitation Learning | —Unverified | 0 |
| Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling | Feb 26, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Novel Approaches to Accelerating the Convergence Rate of Markov Decision Process for Search Result Diversification | Feb 23, 2018 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Structured Control Nets for Deep Reinforcement Learning | Feb 22, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| An Anytime Algorithm for Task and Motion MDPs | Feb 16, 2018 | Decision MakingMotion Planning | —Unverified | 0 |
| MPC-Inspired Neural Network Policies for Sequential Decision Making | Feb 15, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |