English

On Range Summary Queries

Computational Geometry 2023-05-08 v1

Abstract

We study the query version of the approximate heavy hitter and quantile problems. In the former problem, the input is a parameter ε\varepsilon and a set PP of nn points in Rd\mathbb{R}^d where each point is assigned a color from a set CC, and we want to build a structure s.t. given any geometric range γ\gamma, we can efficiently find a list of approximate heavy hitters in γP\gamma\cap P, i.e., colors that appear at least εγP\varepsilon |\gamma \cap P| times in γP\gamma \cap P, as well as their frequencies with an additive error of εγP\varepsilon |\gamma \cap P|. In the latter problem, each point is assigned a weight from a totally ordered universe and the query must output a sequence SS of 1+1/ε1+1/\varepsilon weights s.t. the ii-th weight in SS has approximate rank iεγPi\varepsilon|\gamma\cap P|, meaning, rank iεγPi\varepsilon|\gamma\cap P| up to an additive error of εγP\varepsilon|\gamma\cap P|. Previously, optimal results were only known in 1D [WY11] but a few sub-optimal methods were available in higher dimensions [AW17, ACH+12]. We study the problems for 3D halfspace and dominance queries. We consider the real RAM model with integer registers of size w=Θ(logn)w=\Theta(\log n) bits. For dominance queries, we show optimal solutions for both heavy hitter and quantile problems: using linear space, we can answer both queries in time O(logn+1/ε)O(\log n + 1/\varepsilon). Note that as the output size is 1ε\frac{1}{\varepsilon}, after investing the initial O(logn)O(\log n) searching time, our structure takes on average O(1)O(1) time to find a heavy hitter or a quantile! For more general halfspace heavy hitter queries, the same optimal query time can be achieved by increasing the space by an extra logw1ε\log_w\frac{1}{\varepsilon} (resp. loglogw1ε\log\log_w\frac{1}{\varepsilon}) factor in 3D (resp. 2D). By spending extra logO(1)1ε\log^{O(1)}\frac{1}{\varepsilon} factors in time and space, we can also support quantile queries.

Keywords

Cite

@article{arxiv.2305.03180,
  title  = {On Range Summary Queries},
  author = {Peyman Afshani and Pingan Cheng and Aniket Basu Roy and Zhewei Wei},
  journal= {arXiv preprint arXiv:2305.03180},
  year   = {2023}
}

Comments

To appear in ICALP'23