Why ADAGRAD Fails for Online Topic Modeling

2017-09-01EMNLP 2017Unverified0· sign in to hype

You Lu, Jeffrey Lund, Jordan Boyd-Graber

Unverified — Be the first to reproduce this paper.

Abstract

Online topic modeling, i.e., topic modeling with stochastic variational inference, is a powerful and efficient technique for analyzing large datasets, and ADAGRAD is a widely-used technique for tuning learning rates during online gradient optimization. However, these two techniques do not work well together. We show that this is because ADAGRAD uses accumulation of previous gradients as the learning rates' denominators. For online topic modeling, the magnitude of gradients is very large. It causes learning rates to shrink very quickly, so the parameters cannot fully converge until the training ends

Tasks

Topic Models Variational Inference

Why ADAGRAD Fails for Online Topic Modeling

Abstract

Tasks

Reproductions