English

Federated Composite Optimization

Machine Learning 2021-06-08 v3 Distributed, Parallel, and Cluster Computing Optimization and Control Machine Learning

Abstract

Federated Learning (FL) is a distributed learning paradigm that scales on-device learning collaboratively and privately. Standard FL algorithms such as FedAvg are primarily geared towards smooth unconstrained settings. In this paper, we study the Federated Composite Optimization (FCO) problem, in which the loss function contains a non-smooth regularizer. Such problems arise naturally in FL applications that involve sparsity, low-rank, monotonicity, or more general constraints. We first show that straightforward extensions of primal algorithms such as FedAvg are not well-suited for FCO since they suffer from the "curse of primal averaging," resulting in poor convergence. As a solution, we propose a new primal-dual algorithm, Federated Dual Averaging (FedDualAvg), which by employing a novel server dual averaging procedure circumvents the curse of primal averaging. Our theoretical analysis and empirical experiments demonstrate that FedDualAvg outperforms the other baselines.

Keywords

Cite

@article{arxiv.2011.08474,
  title  = {Federated Composite Optimization},
  author = {Honglin Yuan and Manzil Zaheer and Sashank Reddi},
  journal= {arXiv preprint arXiv:2011.08474},
  year   = {2021}
}

Comments

Accepted to ICML 2021. Code repository see https://github.com/hongliny/FCO-ICML21