English

Work Sharing and Offloading for Efficient Approximate Threshold-based Vector Join

Databases 2026-03-18 v1

Abstract

Vector joins - finding all vector pairs between a set of query and data vectors whose distances are below a given threshold - are fundamental to modern vector and vector-relational database systems that power multimodal retrieval and semantic analytics. Existing state-of-the-art approach exploits work sharing among similar queries but still suffers from redundant index traversals and excessive distance computations. We propose a unified framework for efficient approximate vector joins that (1) introduces soft work sharing to reuse traversal results beyond the join results of previous queries, (2) builds a merged index over both query and data vectors to further speedup graph explorations, and (3) improves robustness for out-of-distribution queries through an adaptive hybrid search strategy. Experiments on eight datasets demonstrate substantial improvements in efficiency-recall trade-off over the state of the art.

Keywords

Cite

@article{arxiv.2603.16360,
  title  = {Work Sharing and Offloading for Efficient Approximate Threshold-based Vector Join},
  author = {Kyoungmin Kim and Lennart Roth and Liang Liang and Anastasia Ailamaki},
  journal= {arXiv preprint arXiv:2603.16360},
  year   = {2026}
}