English

Distributed Random Reshuffling Methods with Improved Convergence

Optimization and Control 2025-03-18 v3 Machine Learning Multiagent Systems

Abstract

This paper proposes two distributed random reshuffling methods, namely Gradient Tracking with Random Reshuffling (GT-RR) and Exact Diffusion with Random Reshuffling (ED-RR), to solve the distributed optimization problem over a connected network, where a set of agents aim to minimize the average of their local cost functions. Both algorithms invoke random reshuffling (RR) update for each agent, inherit favorable characteristics of RR for minimizing smooth nonconvex objective functions, and improve the performance of previous distributed random reshuffling methods both theoretically and empirically. Specifically, both GT-RR and ED-RR achieve the convergence rate of O(1/[(1λ)1/3m1/3T2/3])O(1/[(1-\lambda)^{1/3}m^{1/3}T^{2/3}]) in driving the (minimum) expected squared norm of the gradient to zero, where TT denotes the number of epochs, mm is the sample size for each agent, and 1λ1-\lambda represents the spectral gap of the mixing matrix. When the objective functions further satisfy the Polyak-{\L}ojasiewicz (PL) condition, we show GT-RR and ED-RR both achieve O(1/[(1λ)mT2])O(1/[(1-\lambda)mT^2]) convergence rate in terms of the averaged expected differences between the agents' function values and the global minimum value. Notably, both results are comparable to the convergence rates of centralized RR methods (up to constant factors depending on the network topology) and outperform those of previous distributed random reshuffling algorithms.

Keywords

Cite

@article{arxiv.2306.12037,
  title  = {Distributed Random Reshuffling Methods with Improved Convergence},
  author = {Kun Huang and Linli Zhou and Shi Pu},
  journal= {arXiv preprint arXiv:2306.12037},
  year   = {2025}
}

Comments

16 pages, 8 figures, a short version would appear in IEEE TAC

R2 v1 2026-06-28T11:10:24.113Z