| A Biologically Plausible Benchmark for Contextual Bandit Algorithms in Precision Oncology Using in vitro Data | Nov 11, 2019 | BenchmarkingDecision Making | CodeCode Available | 0 |
| Adaptivity in Adaptive Submodularity | Nov 9, 2019 | Active LearningDecision Making | —Unverified | 0 |
| Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations | Nov 2, 2019 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints | Nov 2, 2019 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Thompson Sampling via Local Uncertainty | Oct 30, 2019 | Decision MakingMulti-Armed Bandits | CodeCode Available | 0 |
| Policy Learning for Malaria Control | Oct 20, 2019 | Bayesian OptimizationDecision Making | CodeCode Available | 0 |
| Adaptive Exploration in Linear Contextual Bandit | Oct 15, 2019 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value Functions | Oct 15, 2019 | Decision MakingDecision Making Under Uncertainty | CodeCode Available | 1 |
| MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for Python | Oct 4, 2019 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Deep Q-Network for Angry Birds | Oct 4, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |