English

Communication-Aware Diffusion Load Balancing for Persistently Interacting Objects

Distributed, Parallel, and Cluster Computing 2026-03-25 v1 Performance

Abstract

Parallel applications with irregular and time-varying workloads often suffer from load imbalance. Dynamic load balancing techniques address this challenge by redistributing work during execution. We present a new type of distributed diffusion-based load balancing targeted at communication-intensive applications with persistently communicating objects. Leveraging the application's communication graph, our strategy reduces across-node communication while simultaneously distributing load effectively. We also propose an algorithmic variant for cases where the communication patterns are not readily available. We explore optimizations to our algorithm, and comparisons with other related load balancing strategies in simulation and on a Particle-in-Cell benchmark on up to 8 nodes of Perlmutter at NERSC.

Keywords

Cite

@article{arxiv.2603.23329,
  title  = {Communication-Aware Diffusion Load Balancing for Persistently Interacting Objects},
  author = {Maya Taylor and Kavitha Chandrasekar and Laxmikant V. Kale},
  journal= {arXiv preprint arXiv:2603.23329},
  year   = {2026}
}

Comments

8 pages, 6 figures. To appear in the Proceedings of PDSEC 2026 (workshop of the IEEE IPDPS 2026)