English

Accelerating XOR-based Erasure Coding using Program Optimization Techniques

Programming Languages 2021-08-06 v1 Distributed, Parallel, and Cluster Computing Performance

Abstract

Erasure coding (EC) affords data redundancy for large-scale systems. XOR-based EC is an easy-to-implement method for optimizing EC. This paper addresses a significant performance gap between the state-of-the-art XOR-based EC approach (with 4.9 GB/s coding throughput) and Intel's high-performance EC library based on another approach (with 6.7 GB/s). We propose a novel approach based on our observation that XOR-based EC virtually generates programs of a Domain Specific Language for XORing byte arrays. We formalize such programs as straight-line programs (SLPs) of compiler construction and optimize SLPs using various optimization techniques. Our optimization flow is three-fold: 1) reducing operations using grammar compression algorithms; 2) reducing memory accesses using deforestation, a functional program optimization method; and 3) reducing cache misses using the (red-blue) pebble game of program analysis. We provide an experimental library, which outperforms Intel's library with 8.92 GB/s throughput.

Keywords

Cite

@article{arxiv.2108.02692,
  title  = {Accelerating XOR-based Erasure Coding using Program Optimization Techniques},
  author = {Yuya Uezato},
  journal= {arXiv preprint arXiv:2108.02692},
  year   = {2021}
}

Comments

18 pages. Author's version of a paper accepted at SC'21 https://sc21.supercomputing.org/