English

Fast Distributed Gradient Methods

Information Theory 2014-04-15 v4 math.IT

Abstract

We study distributed optimization problems when NN nodes minimize the sum of their individual costs subject to a common vector variable. The costs are convex, have Lipschitz continuous gradient (with constant LL), and bounded gradient. We propose two fast distributed gradient algorithms based on the centralized Nesterov gradient algorithm and establish their convergence rates in terms of the per-node communications K\mathcal{K} and the per-node gradient evaluations kk. Our first method, Distributed Nesterov Gradient, achieves rates O(logK/K)O\left({\log \mathcal{K}}/{\mathcal{K}}\right) and O(logk/k)O\left({\log k}/{k}\right). Our second method, Distributed Nesterov gradient with Consensus iterations, assumes at all nodes knowledge of LL and μ(W)\mu(W) -- the second largest singular value of the N×NN \times N doubly stochastic weight matrix WW. It achieves rates O(1/K2ξ)O\left({1}/{\mathcal{K}^{2-\xi}}\right) and O(1/k2)O\left({1}/{k^2}\right) (ξ>0\xi>0 arbitrarily small). Further, we give with both methods explicit dependence of the convergence constants on NN and WW. Simulation examples illustrate our findings.

Keywords

Cite

@article{arxiv.1112.2972,
  title  = {Fast Distributed Gradient Methods},
  author = {Dusan Jakovetic and Joao Xavier and Jose M. F. Moura},
  journal= {arXiv preprint arXiv:1112.2972},
  year   = {2014}
}
R2 v1 2026-06-21T19:50:41.931Z