Efficient Adaptive Federated Optimization

Su Hyeong Lee; Sidharth Sharma; Manzil Zaheer; Tian Li

Efficient Adaptive Federated Optimization

Machine Learning 2025-10-28 v3 Distributed, Parallel, and Cluster Computing Optimization and Control

Authors: Su Hyeong Lee , Sidharth Sharma , Manzil Zaheer , Tian Li

Abstract

Adaptive optimization is critical in federated learning, where enabling adaptivity on both the server and client sides has proven essential for achieving optimal performance. However, the scalability of such jointly adaptive systems is often hindered by resource limitations in communication and memory. In this paper, we introduce a class of efficient adaptive algorithms, named $FedAda^2$ and its enhanced version $FedAda^2$ ++, designed specifically for large-scale, cross-device federated environments. $FedAda^2$ optimizes communication efficiency by avoiding the transfer of preconditioners between the server and clients. Additionally, $FedAda^2$ ++ extends this approach by incorporating memory-efficient adaptive optimizers on the client side, further reducing on-device memory usage. Theoretically, we demonstrate that $FedAda^2$ and $FedAda^2$ ++ achieve the same convergence rates for general, non-convex objectives as its more resource-intensive counterparts that directly integrate joint adaptivity. Extensive empirical evaluations on image and text datasets demonstrate both the advantages of joint adaptivity and the effectiveness of $FedAda^2$ / $FedAda^2$ ++.

Keywords

federated learning optimization hyperparameter optimization

Cite

@article{arxiv.2410.18117,
  title  = {Efficient Adaptive Federated Optimization},
  author = {Su Hyeong Lee and Sidharth Sharma and Manzil Zaheer and Tian Li},
  journal= {arXiv preprint arXiv:2410.18117},
  year   = {2025}
}

Comments

Accepted to NeurIPS 2025

Efficient Adaptive Federated Optimization

Abstract

Keywords

Cite

Comments

Related papers