MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for Python

2019-10-04IEEE 31th International Conference on Tools with Artificial Intelligence, ICTAI 2019 2019Code Available0· sign in to hype

Emily Strong, Bernard Kleynhans, Serdar Kadioglu

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/fmr-llc/mabwiser
none★ 0

Abstract

Contextual multi-armed bandit algorithms serve as an effective technique to address online sequential decision-making problems. Despite their popularity, when it comes to off-the-shelf tools the library support remains limited, in particular for the Python technology stack. To fill this gap, in this paper we present a system that provides context-free, parametric and nonparametric contextual multi-armed bandit models. The available bandit policies accommodate both batch and online learning. The MABWISER system is implemented as an open-source Python library. Our design enables built-in parallelization to speed up training and test components for scalability while ensuring the reproducibility of results. We present a running example to highlight the user-friendly nature of the public interface and discuss the simulation capability of the library for hyperparameter tuning and rapid experimentation.

Tasks

Decision Making Sequential Decision Making

MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for Python

Code

Abstract

Tasks

Reproductions