A Primal-Dual SGD Algorithm for Distributed Nonconvex Optimization

Xinlei Yi; Shengjun Zhang; Tao Yang; Tianyou Chai; Karl H. Johansson

A Primal-Dual SGD Algorithm for Distributed Nonconvex Optimization

Optimization and Control 2022-01-11 v3

Authors: Xinlei Yi , Shengjun Zhang , Tao Yang , Tianyou Chai , Karl H. Johansson

Abstract

The distributed nonconvex optimization problem of minimizing a global cost function formed by a sum of $n$ local cost functions by using local information exchange is considered. This problem is an important component of many machine learning techniques with data parallelism, such as deep learning and federated learning. We propose a distributed primal--dual stochastic gradient descent (SGD) algorithm, suitable for arbitrarily connected communication networks and any smooth (possibly nonconvex) cost functions. We show that the proposed algorithm achieves the linear speedup convergence rate $\mathcal{O}(1/\sqrt{nT})$ for general nonconvex cost functions and the linear speedup convergence rate $\mathcal{O}(1/(nT))$ when the global cost function satisfies the Polyak--{\L}ojasiewicz (P--{\L}) condition, where $T$ is the total number of iterations. We also show that the output of the proposed algorithm with constant parameters linearly converges to a neighborhood of a global optimum. We demonstrate through numerical experiments the efficiency of our algorithm in comparison with the baseline centralized SGD and recently proposed distributed SGD algorithms.

Keywords

stochastic gradient descent distributed optimization primal-dual optimization

Cite

@article{arxiv.2006.03474,
  title  = {A Primal-Dual SGD Algorithm for Distributed Nonconvex Optimization},
  author = {Xinlei Yi and Shengjun Zhang and Tao Yang and Tianyou Chai and Karl H. Johansson},
  journal= {arXiv preprint arXiv:2006.03474},
  year   = {2022}
}

Comments

arXiv admin note: text overlap with arXiv:1912.12110

A Primal-Dual SGD Algorithm for Distributed Nonconvex Optimization

Abstract

Keywords

Cite

Comments

Related papers