Maximum Coverage in the Data Stream Model: Parameterized and Generalized
Abstract
We present algorithms for the Max-Cover and Max-Unique-Cover problems in the data stream model. The input to both problems are subsets of a universe of size and a value . In Max-Cover, the problem is to find a collection of at most sets such that the number of elements covered by at least one set is maximized. In Max-Unique-Cover, the problem is to find a collection of at most sets such that the number of elements covered by exactly one set is maximized. Our goal is to design single-pass algorithms that use space that is sublinear in the input size. Our main algorithmic results are: If the sets have size at most , there exist single-pass algorithms using space that solve both problems exactly. This is optimal up to polylogarithmic factors for constant . If each element appears in at most sets, we present single pass algorithms using space that return a approximation in the case of Max-Cover. We also present a single-pass algorithm using slightly more memory, i.e., space, that approximates Max-Unique-Cover. In contrast to the above results, when and are arbitrary, any constant pass approximation algorithm for either problem requires space but a single pass space algorithm exists. In fact any constant-pass algorithm with an approximation better than and for Max-Cover and Max-Unique-Cover respectively requires space when and are unrestricted. En route, we also obtain an algorithm for a parameterized version of the streaming Set-Cover problem.
Cite
@article{arxiv.2102.08476,
title = {Maximum Coverage in the Data Stream Model: Parameterized and Generalized},
author = {Andrew McGregor and David Tench and Hoa T. Vu},
journal= {arXiv preprint arXiv:2102.08476},
year = {2021}
}
Comments
Conference version to appear at ICDT 2021