Linear Learning with Sparse Data
2016-12-29Unverified0· sign in to hype
Ofer Dekel
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
Linear predictors are especially useful when the data is high-dimensional and sparse. One of the standard techniques used to train a linear predictor is the Averaged Stochastic Gradient Descent (ASGD) algorithm. We present an efficient implementation of ASGD that avoids dense vector operations. We also describe a translation invariant extension called Centered Averaged Stochastic Gradient Descent (CASGD).