| Machine Teaching for Inverse Reinforcement Learning: Algorithms and Applications | May 20, 2018 | Decision Makingreinforcement-learning | CodeCode Available | 0 |
| Fast Online Exact Solutions for Deterministic MDPs with Sparse Rewards | May 8, 2018 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 17, 2018 | Atari GamesDecision Making | —Unverified | 0 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |
| UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits | Apr 16, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Hindsight is Only 50/50: Unsuitability of MDP based Approximate POMDP Solvers for Multi-resolution Information Gathering | Apr 7, 2018 | Decision MakingImitation Learning | —Unverified | 0 |
| Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection | Mar 14, 2018 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Hierarchical Imitation and Reinforcement Learning | Mar 1, 2018 | Decision MakingImitation Learning | —Unverified | 0 |
| Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling | Feb 26, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |