Large-Scale Learning with Less RAM via Randomization

Daniel Golovin; D. Sculley; H. Brendan McMahan; Michael Young

Large-Scale Learning with Less RAM via Randomization

Machine Learning 2013-03-20 v1

Authors: Daniel Golovin , D. Sculley , H. Brendan McMahan , Michael Young

Abstract

We reduce the memory footprint of popular large-scale online learning methods by projecting our weight vector onto a coarse discrete set using randomized rounding. Compared to standard 32-bit float encodings, this reduces RAM usage by more than 50% during training and by up to 95% when making predictions from a fixed model, with almost no loss in accuracy. We also show that randomized counting can be used to implement per-coordinate learning rates, improving model quality with little additional RAM. We prove these memory-saving methods achieve regret guarantees similar to their exact variants. Empirical evaluation confirms excellent performance, dominating standard approaches across memory versus accuracy tradeoffs.

Keywords

quantization model compression large language model training

Cite

@article{arxiv.1303.4664,
  title  = {Large-Scale Learning with Less RAM via Randomization},
  author = {Daniel Golovin and D. Sculley and H. Brendan McMahan and Michael Young},
  journal= {arXiv preprint arXiv:1303.4664},
  year   = {2013}
}

Comments

Extended version of ICML 2013 paper

Large-Scale Learning with Less RAM via Randomization

Abstract

Keywords

Cite

Comments

Related papers