English

A Partition-centric Distributed Algorithm for Identifying Euler Circuits in Large Graphs

Distributed, Parallel, and Cluster Computing 2020-06-16 v1

Abstract

Finding the Eulerian circuit in graphs is a classic problem, but inadequately explored for parallel computation. With such cycles finding use in neuroscience and Internet of Things for large graphs, designing a distributed algorithm for finding the Euler circuit is important. Existing parallel algorithms are impractical for commodity clusters and Clouds. We propose a novel partition-centric algorithm to find the Euler circuit, over large graphs partitioned across distributed machines and executed iteratively using a Bulk Synchronous Parallel (BSP) model. The algorithm finds partial paths and cycles within each partition, and refines these into longer paths by recursively merging the partitions. We describe the algorithm, analyze its complexity, validate it on Apache Spark for large graphs, and offer experimental results. We also identify memory bottlenecks in the algorithm and propose an enhanced design to address it.

Keywords

Cite

@article{arxiv.1903.06950,
  title  = {A Partition-centric Distributed Algorithm for Identifying Euler Circuits in Large Graphs},
  author = {Siddharth D Jaiswal and Yogesh Simmhan},
  journal= {arXiv preprint arXiv:1903.06950},
  year   = {2020}
}

Comments

To appear in Proceedings of 5th IEEE International Workshop on High-Performance Big Data, Deep Learning, and Cloud Computing, In conjunction with The 33rd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2019), Rio de Janeiro, Brazil, May 20th, 2019