Linear Learning with Sparse Data

Ofer Dekel

Linear Learning with Sparse Data

Machine Learning 2017-01-27 v2

Authors: Ofer Dekel

Abstract

Linear predictors are especially useful when the data is high-dimensional and sparse. One of the standard techniques used to train a linear predictor is the Averaged Stochastic Gradient Descent (ASGD) algorithm. We present an efficient implementation of ASGD that avoids dense vector operations. We also describe a translation invariant extension called Centered Averaged Stochastic Gradient Descent (CASGD).

Linear Learning with Sparse Data

Abstract

Keywords

Cite

Related papers