English

Universal and Asymptotically Optimal Data and Task Allocation in Distributed Computing

Information Theory 2026-01-12 v1 math.IT

Abstract

We study the joint minimization of communication and computation costs in distributed computing, where a master node coordinates NN workers to evaluate a function over a library of nn files. Assuming that the function is decomposed into an arbitrary subfunction set X\mathbf{X}, with each subfunction depending on dd input files, renders our distributed computing problem into a dd-uniform hypergraph edge partitioning problem wherein the edge set (subfunction set), defined by dd-wise dependencies between vertices (files) must be partitioned across NN disjoint groups (workers). The aim is to design a file and subfunction allocation, corresponding to a partition of X\mathbf{X}, that minimizes the communication cost πX\pi_{\mathbf{X}}, representing the maximum number of distinct files per server, while also minimizing the computation cost δX\delta_{\mathbf{X}} corresponding to a maximal worker subfunction load. For a broad range of parameters, we propose a deterministic allocation solution, the \emph{Interweaved-Cliques (IC) design}, whose information-theoretic-inspired interweaved clique structure simultaneously achieves order-optimal communication and computation costs, for a large class of decompositions X\mathbf{X}. This optimality is derived from our achievability and converse bounds, which reveal -- under reasonable assumptions on the density of X\mathbf{X} -- that the optimal scaling of the communication cost takes the form n/N1/dn/N^{1/d}, revealing that our design achieves the order-optimal \textit{partitioning gain} that scales as N1/dN^{1/d}, while also achieving an order-optimal computation cost. Interestingly, this order optimality is achieved in a deterministic manner, and very importantly, it is achieved blindly from X\mathbf{X}, therefore enabling multiple desired functions to be computed without reshuffling files.

Keywords

Cite

@article{arxiv.2601.05873,
  title  = {Universal and Asymptotically Optimal Data and Task Allocation in Distributed Computing},
  author = {Javad Maheri and K. K. Krishnan Namboodiri and Petros Elia},
  journal= {arXiv preprint arXiv:2601.05873},
  year   = {2026}
}

Comments

49 pages, 2 figures