Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

Mark Schmidt; Reza Babanezhad; Mohamed Osama Ahmed; Aaron Defazio; Ann Clifton; Anoop Sarkar

Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

Machine Learning 2015-04-20 v1 Machine Learning Optimization and Control Computation

Authors: Mark Schmidt , Reza Babanezhad , Mohamed Osama Ahmed , Aaron Defazio , Ann Clifton , Anoop Sarkar

Abstract

We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs). We describe a practical implementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradient method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of convergence of the SAGA variant under non-uniform sampling. Our experimental results reveal that our method often significantly outperforms existing methods in terms of the training objective, and performs as well or better than optimally-tuned stochastic gradient methods in terms of test error.

Keywords

stochastic gradient descent

Cite

@article{arxiv.1504.04406,
  title  = {Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields},
  author = {Mark Schmidt and Reza Babanezhad and Mohamed Osama Ahmed and Aaron Defazio and Ann Clifton and Anoop Sarkar},
  journal= {arXiv preprint arXiv:1504.04406},
  year   = {2015}
}

Comments

AI/Stats 2015, 24 pages

Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

Abstract

Keywords

Cite

Comments

Related papers