| Safety-Aware Algorithms for Adversarial Contextual Bandit | Aug 1, 2017 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Non-Stationary Bandits with Habituation and Recovery Dynamics | Jul 26, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions | Jul 24, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Learning model-based planning from scratch | Jul 19, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces | Jul 8, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Tableaux for Policy Synthesis for MDPs with PCTL* Constraints | Jun 30, 2017 | Decision MakingSequential Decision Making | —Unverified | 0 |
| The Theory is Predictive, but is it Complete? An Application to Human Perception of Randomness | Jun 21, 2017 | BIG-bench Machine LearningDecision Making | —Unverified | 0 |
| Unlocking the Potential of Simulators: Design with RL in Mind | Jun 8, 2017 | Decision MakingFriction | —Unverified | 0 |
| A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming | Jun 5, 2017 | Decision MakingReinforcement Learning | —Unverified | 0 |
| Boltzmann Exploration Done Right | May 29, 2017 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |