English

Linear Learning with Sparse Data

Machine Learning 2017-01-27 v2

Abstract

Linear predictors are especially useful when the data is high-dimensional and sparse. One of the standard techniques used to train a linear predictor is the Averaged Stochastic Gradient Descent (ASGD) algorithm. We present an efficient implementation of ASGD that avoids dense vector operations. We also describe a translation invariant extension called Centered Averaged Stochastic Gradient Descent (CASGD).

Keywords

Cite

@article{arxiv.1612.09147,
  title  = {Linear Learning with Sparse Data},
  author = {Ofer Dekel},
  journal= {arXiv preprint arXiv:1612.09147},
  year   = {2017}
}