MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for Python
Emily Strong, Bernard Kleynhans, Serdar Kadioglu
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/fmr-llc/mabwisernone★ 0
Abstract
Contextual multi-armed bandit algorithms serve as an effective technique to address online sequential decision-making problems. Despite their popularity, when it comes to off-the-shelf tools the library support remains limited, in particular for the Python technology stack. To fill this gap, in this paper we present a system that provides context-free, parametric and nonparametric contextual multi-armed bandit models. The available bandit policies accommodate both batch and online learning. The MABWISER system is implemented as an open-source Python library. Our design enables built-in parallelization to speed up training and test components for scalability while ensuring the reproducibility of results. We present a running example to highlight the user-friendly nature of the public interface and discuss the simulation capability of the library for hyperparameter tuning and rapid experimentation.