English

Dynamic Loop Scheduling Using MPI Passive-Target Remote Memory Access

Distributed, Parallel, and Cluster Computing 2019-01-10 v1

Abstract

Scientific applications often contain large computationally-intensive parallel loops. Loop scheduling techniques aim to achieve load balanced executions of such applications. For distributed-memory systems, existing dynamic loop scheduling (DLS) libraries are typically MPI-based, and employ a master-worker execution model to assign variably-sized chunks of loop iterations. The master-worker execution model may adversely impact performance due to the master-level contention. This work proposes a distributed chunk-calculation approach that does not require the master-worker execution scheme. Moreover, it considers the novel features in the latest MPI standards, such as passive-target remote memory access, shared-memory window creation, and atomic read-modify-write operations. To evaluate the proposed approach, five well-known DLS techniques, two applications, and two heterogeneous hardware setups have been considered. The DLS techniques implemented using the proposed approach outperformed their counterparts implemented using the traditional master-worker execution model.

Keywords

Cite

@article{arxiv.1901.02773,
  title  = {Dynamic Loop Scheduling Using MPI Passive-Target Remote Memory Access},
  author = {Ahmed Eleliemy and Florina M. Ciorba},
  journal= {arXiv preprint arXiv:1901.02773},
  year   = {2019}
}
R2 v1 2026-06-23T07:07:08.540Z