English

Graphulo: Linear Algebra Graph Kernels for NoSQL Databases

Data Structures and Algorithms 2016-11-11 v2 Databases

Abstract

Big data and the Internet of Things era continue to challenge computational systems. Several technology solutions such as NoSQL databases have been developed to deal with this challenge. In order to generate meaningful results from large datasets, analysts often use a graph representation which provides an intuitive way to work with the data. Graph vertices can represent users and events, and edges can represent the relationship between vertices. Graph algorithms are used to extract meaningful information from these very large graphs. At MIT, the Graphulo initiative is an effort to perform graph algorithms directly in NoSQL databases such as Apache Accumulo or SciDB, which have an inherently sparse data storage scheme. Sparse matrix operations have a history of efficient implementations and the Graph Basic Linear Algebra Subprogram (GraphBLAS) community has developed a set of key kernels that can be used to develop efficient linear algebra operations. However, in order to use the GraphBLAS kernels, it is important that common graph algorithms be recast using the linear algebra building blocks. In this article, we look at common classes of graph algorithms and recast them into linear algebra operations using the GraphBLAS building blocks.

Keywords

Cite

@article{arxiv.1508.07372,
  title  = {Graphulo: Linear Algebra Graph Kernels for NoSQL Databases},
  author = {Vijay Gadepally and Jake Bolewski and Dan Hook and Dylan Hutchison and Ben Miller and Jeremy Kepner},
  journal= {arXiv preprint arXiv:1508.07372},
  year   = {2016}
}

Comments

10 pages

R2 v1 2026-06-22T10:44:07.917Z