Exponentiated Gradient LINUCB for Contextual Multi-Armed Bandits
2013-05-10Unverified0· sign in to hype
Djallel Bouneffouf
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We present Exponentiated Gradient LINUCB, an algorithm for con-textual multi-armed bandits. This algorithm uses Exponentiated Gradient to find the optimal exploration of the LINUCB. Within a deliberately designed offline simulation framework we conduct evaluations with real online event log data. The experimental results demonstrate that our algorithm outperforms surveyed algorithms.