English

Distributed Learning with Sparse Communications by Identification

Optimization and Control 2020-06-26 v2 Distributed, Parallel, and Cluster Computing

Abstract

In distributed optimization for large-scale learning, a major performance limitation comes from the communications between the different entities. When computations are performed by workers on local data while a coordinator machine coordinates their updates to minimize a global loss, we present an asynchronous optimization algorithm that efficiently reduces the communications between the coordinator and workers. This reduction comes from a random sparsification of the local updates. We show that this algorithm converges linearly in the strongly convex case and also identifies optimal strongly sparse solutions. We further exploit this identification to propose an automatic dimension reduction, aptly sparsifying all exchanges between coordinator and workers.

Keywords

Cite

@article{arxiv.1812.03871,
  title  = {Distributed Learning with Sparse Communications by Identification},
  author = {Dmitry Grishchenko and Franck Iutzeler and Jérôme Malick and Massih-Reza Amini},
  journal= {arXiv preprint arXiv:1812.03871},
  year   = {2020}
}

Comments

v2 is a significant improvement over v1 (titled "Asynchronous Distributed Learning with Sparse Communications and Identification") with new algorithms, results, and discussions