English

Asynchronous Execution of Python Code on Task Based Runtime Systems

Programming Languages 2019-03-08 v2

Abstract

Despite advancements in the areas of parallel and distributed computing, the complexity of programming on High Performance Computing (HPC) resources has deterred many domain experts, especially in the areas of machine learning and artificial intelligence (AI), from utilizing performance benefits of such systems. Researchers and scientists favor high-productivity languages to avoid the inconvenience of programming in low-level languages and costs of acquiring the necessary skills required for programming at this level. In recent years, Python, with the support of linear algebra libraries like NumPy, has gained popularity despite facing limitations which prevent this code from distributed runs. Here we present a solution which maintains both high level programming abstractions as well as parallel and distributed efficiency. Phylanx, is an asynchronous array processing toolkit which transforms Python and NumPy operations into code which can be executed in parallel on HPC resources by mapping Python and NumPy functions and variables into a dependency tree executed by HPX, a general purpose, parallel, task-based runtime system written in C++. Phylanx additionally provides introspection and visualization capabilities for debugging and performance analysis. We have tested the foundations of our approach by comparing our implementation of widely used machine learning algorithms to accepted NumPy standards.

Keywords

Cite

@article{arxiv.1810.07591,
  title  = {Asynchronous Execution of Python Code on Task Based Runtime Systems},
  author = {R. Tohid and Bibek Wagle and Shahrzad Shirzad and Patrick Diehl and Adrian Serio and Alireza Kheirkhahan and Parsa Amini and Katy Williams and Kate Isaacs and Kevin Huck and Steven Brandt and Hartmut Kaiser},
  journal= {arXiv preprint arXiv:1810.07591},
  year   = {2019}
}