English

Towards a Converged Relational-Graph Optimization Framework

Databases 2024-12-10 v3

Abstract

The recent ISO SQL:2023 standard adopts SQL/PGQ (Property Graph Queries), facilitating graph-like querying within relational databases. This advancement, however, underscores a significant gap in how to effectively optimize SQL/PGQ queries within relational database systems. To address this gap, we extend the foundational SPJ (Select-Project-Join) queries to SPJM queries, which include an additional matching operator for representing graph pattern matching in SQL/PGQ. Although SPJM queries can be converted to SPJ queries and optimized using existing relational query optimizers, our analysis shows that such a graph-agnostic method fails to benefit from graph-specific optimization techniques found in the literature. To address this issue, we develop a converged relational-graph optimization framework called RelGo for optimizing SPJM queries, leveraging joint efforts from both relational and graph query optimizations. Using DuckDB as the underlying relational execution engine, our experiments show that RelGo can generate efficient execution plans for SPJM queries. On well-established benchmarks, these plans exhibit an average speedup of 21.90x compared to those produced by the graph-agnostic optimizer.

Keywords

Cite

@article{arxiv.2408.13480,
  title  = {Towards a Converged Relational-Graph Optimization Framework},
  author = {Yunkai Lou and Longbin Lai and Bingqing Lyu and Yufan Yang and Xiaoli Zhou and Wenyuan Yu and Ying Zhang and Jingren Zhou},
  journal= {arXiv preprint arXiv:2408.13480},
  year   = {2024}
}