Online Semi-Supervised Learning with Bandit Feedback

2020-10-23ICLR Workshop LLDUnverified0· sign in to hype

Sohini Upadhyay, Mikhail Yurochkin, Mayank Agarwal, Yasaman Khazaeni, DjallelBouneffouf

Unverified — Be the first to reproduce this paper.

Abstract

We formulate a new problem at the intersectionof semi-supervised learning and contextual bandits,motivated by several applications including clini-cal trials and ad recommendations. We demonstratehow Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted tothe new problem formulation. We also propose avariant of the linear contextual bandit with semi-supervised missing rewards imputation. We thentake the best of both approaches to develop multi-GCN embedded contextual bandit. Our algorithmsare verified on several real world datasets.

Tasks

Imputation Multi-Armed Bandits

Online Semi-Supervised Learning with Bandit Feedback

Abstract

Tasks

Reproductions