English

Approximating Sparse Graphs: The Random Overlapping Communities Model

Combinatorics 2022-02-07 v3 Discrete Mathematics

Abstract

How can we approximate sparse graphs and sequences of sparse graphs (with unbounded average degree)? We consider convergence in the first kk moments of the graph spectrum (equivalent to the numbers of closed kk-walks) appropriately normalized. We introduce a simple, easy to sample, random graph model that captures the limiting spectra of many sequences of interest, including the sequence of hypercube graphs. The Random Overlapping Communities (ROC) model is specified by a distribution on pairs (s,q)(s,q), sZ+,q(0,1]s \in \mathbb{Z}_+, q \in (0,1]. A graph on nn vertices with average degree dd is generated by repeatedly picking pairs (s,q)(s,q) from the distribution, adding an Erd\H{o}s-R\'{e}nyi random graph of edge density qq on a subset of vertices chosen by including each vertex with probability s/ns/n, and repeating this process so that the expected degree is dd. Our proof of convergence to a ROC random graph is based on the Stieltjes moment condition. We also show that the model is an effective approximation for individual graphs. For almost all possible triangle-to-edge and four-cycle-to-edge ratios, there exists a pair (s,q)(s,q) such that the ROC model with this single community type produces graphs with both desired ratios, a property that cannot be achieved by stochastic block models of bounded description size. Moreover, ROC graphs exhibit an inverse relationship between degree and clustering coefficient, a characteristic of many real-world networks.

Keywords

Cite

@article{arxiv.1802.03652,
  title  = {Approximating Sparse Graphs: The Random Overlapping Communities Model},
  author = {Samantha Petti and Santosh S. Vempala},
  journal= {arXiv preprint arXiv:1802.03652},
  year   = {2022}
}

Comments

This paper subsumes the paper "Random Overlapping Communities: Approximating Motif Densities of Large Graphs," arXiv:1709.09477