English

Exoshuffle-CloudSort

Distributed, Parallel, and Cluster Computing 2023-01-11 v1 Operating Systems

Abstract

We present Exoshuffle-CloudSort, a sorting application running on top of Ray using the Exoshuffle architecture. Exoshuffle-CloudSort runs on Amazon EC2, with input and output data stored on Amazon S3. Using 40 i4i.4xlarge workers, Exoshuffle-CloudSort completes the 100 TB CloudSort Benchmark (Indy category) in 5378 seconds, with an average total cost of $97.

Cite

@article{arxiv.2301.03734,
  title  = {Exoshuffle-CloudSort},
  author = {Frank Sifei Luan and Stephanie Wang and Samyukta Yagati and Sean Kim and Kenneth Lien and Isaac Ong and Tony Hong and SangBin Cho and Eric Liang and Ion Stoica},
  journal= {arXiv preprint arXiv:2301.03734},
  year   = {2023}
}
R2 v1 2026-06-28T08:08:09.923Z