| UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits | Apr 16, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Policy Gradient With Value Function Approximation For Collective Multiagent Planning | Apr 9, 2018 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Hindsight is Only 50/50: Unsuitability of MDP based Approximate POMDP Solvers for Multi-resolution Information Gathering | Apr 7, 2018 | Decision MakingImitation Learning | —Unverified | 0 |
| Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection | Mar 14, 2018 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Hierarchical Imitation and Reinforcement Learning | Mar 1, 2018 | Decision MakingImitation Learning | —Unverified | 0 |
| Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling | Feb 26, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Novel Approaches to Accelerating the Convergence Rate of Markov Decision Process for Search Result Diversification | Feb 23, 2018 | Decision MakingInformation Retrieval | —Unverified | 0 |
| Structured Control Nets for Deep Reinforcement Learning | Feb 22, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| An Anytime Algorithm for Task and Motion MDPs | Feb 16, 2018 | Decision MakingMotion Planning | —Unverified | 0 |
| MPC-Inspired Neural Network Policies for Sequential Decision Making | Feb 15, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Decomposition Methods with Deep Corrections for Reinforcement Learning | Feb 6, 2018 | Autonomous DrivingDecision Making | CodeCode Available | 0 |
| Understanding Human Behaviors in Crowds by Imitating the Decision-Making Process | Jan 25, 2018 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Testing Optimality of Sequential Decision-Making | Jan 4, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning Structural Weight Uncertainty for Sequential Decision-Making | Dec 30, 2017 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward | Dec 29, 2017 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-shot Pedestrian Re-identification via Sequential Decision Making | Dec 19, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Learning Multi-Level Hierarchies with Hindsight | Dec 4, 2017 | Decision MakingHierarchical Reinforcement Learning | CodeCode Available | 1 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| SkipNet: Learning Dynamic Routing in Convolutional Networks | Nov 26, 2017 | Decision MakingReinforcement Learning | CodeCode Available | 1 |
| Classification with Costly Features using Deep Reinforcement Learning | Nov 20, 2017 | ClassificationClassification with Costly Features | CodeCode Available | 0 |
| Loss Functions for Multiset Prediction | Nov 14, 2017 | Decision MakingPrediction | —Unverified | 0 |
| Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making | Oct 31, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics | Oct 30, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Hierarchical State Abstractions for Decision-Making Problems with Computational Constraints | Oct 22, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Asymmetric Actor Critic for Image-Based Robot Learning | Oct 18, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight | Oct 2, 2017 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping | Sep 14, 2017 | Decision MakingImage Cropping | CodeCode Available | 0 |
| Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks | Sep 13, 2017 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Safety-Aware Algorithms for Adversarial Contextual Bandit | Aug 1, 2017 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Non-Stationary Bandits with Habituation and Recovery Dynamics | Jul 26, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions | Jul 24, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Learning model-based planning from scratch | Jul 19, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces | Jul 8, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Tableaux for Policy Synthesis for MDPs with PCTL* Constraints | Jun 30, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| The Theory is Predictive, but is it Complete? An Application to Human Perception of Randomness | Jun 21, 2017 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| Unlocking the Potential of Simulators: Design with RL in Mind | Jun 8, 2017 | Decision MakingFriction | —Unverified | 0 |
| A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming | Jun 5, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Boltzmann Exploration Done Right | May 29, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Thinking Fast and Slow with Deep Learning and Tree Search | May 23, 2017 | Decision MakingDeep Learning | CodeCode Available | 1 |
| Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning | May 21, 2017 | BenchmarkingDecision Making | —Unverified | 0 |
| Answer Set Programming for Non-Stationary Markov Decision Processes | May 3, 2017 | Decision Makingreinforcement-learning | —Unverified | 0 |
| On Improving Deep Reinforcement Learning for POMDPs | Apr 26, 2017 | Atari GamesDecision Making | CodeCode Available | 0 |
| Using Reinforcement Learning for Demand Response of Domestic Hot Water Buffers: a Real-Life Demonstration | Mar 16, 2017 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making | Mar 14, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Deep Robust Kalman Filter | Mar 7, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction | Mar 3, 2017 | Decision MakingDependency Parsing | —Unverified | 0 |
| Active Learning for Accurate Estimation of Linear Models | Mar 2, 2017 | Active LearningDecision Making | —Unverified | 0 |
| Tight Bounds for Bandit Combinatorial Optimization | Feb 24, 2017 | Combinatorial OptimizationDecision Making | —Unverified | 0 |
| Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning | Feb 20, 2017 | Car RacingDecision Making | —Unverified | 0 |
| Deep Reinforcement Learning for Visual Object Tracking in Videos | Jan 31, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |