English

Hyperedge Estimation using Polylogarithmic Subset Queries

Data Structures and Algorithms 2020-09-08 v4

Abstract

In this work, we estimate the number of hyperedges in a hypergraph H(U(H),F(H)){\cal H}(U({\cal H}), {\cal F}({\cal H})), where U(H)U({\cal H}) denotes the set of vertices and F(H)){\cal F}({\cal H})) denotes the set of hyperedges. We assume a query oracle access to the hypergraph H{\cal H}. Estimating the number of edges, triangles or small subgraphs in a graph is a well studied problem. Beame \etal~and Bhattacharya \etal~gave algorithms to estimate the number of edges and triangles in a graph using queries to the {\sc Bipartite Independent Set} ({\sc BIS}) and the {\sc Tripartite Independent Set} ({\sc TIS}) oracles, respectively. We generalize the earlier works by estimating the number of hyperedges using a query oracle, known as the {\bf Generalized dd-partite independent set oracle ({\sc GPIS})}, that takes dd (non-empty) pairwise disjoint subsets of vertices A1,,AdU(H)A_1,\ldots,A_d \subseteq U({\cal H}) as input, and answers whether there exists a hyperedge in H{\cal H} having (exactly) one vertex in each Ai,i{1,2,,d}A_i, i \in \{1,2,\ldots,d\}. We give a randomized algorithm for the hyperedge estimation problem using the {\sc GPIS} query oracle to output m^\widehat{m} for m(H)m({\cal H}) satisfying (1ϵ)m(H)m^(1+ϵ)m(H)(1-\epsilon) \cdot m({\cal H}) \leq \widehat{m} \leq (1+\epsilon) \cdot m({\cal H}). The number of queries made by our algorithm, assuming dd to be a constant, is polylogarithmic in the number of vertices of the hypergraph.

Keywords

Cite

@article{arxiv.1908.04196,
  title  = {Hyperedge Estimation using Polylogarithmic Subset Queries},
  author = {Anup Bhattacharya and Arijit Bishnu and Arijit Ghosh and Gopinath Mishra},
  journal= {arXiv preprint arXiv:1908.04196},
  year   = {2020}
}

Comments

34 pages

R2 v1 2026-06-23T10:45:18.035Z