Better Streaming Algorithms for the Maximum Coverage Problem
Abstract
We study the classic NP-Hard problem of finding the maximum -set coverage in the data stream model: given a set system of sets that are subsets of a universe , find the sets that cover the most number of distinct elements. The problem can be approximated up to a factor in polynomial time. In the streaming-set model, the sets and their elements are revealed online. The main goal of our work is to design algorithms, with approximation guarantees as close as possible to , that use sublinear space . Our main results are: Two approximation algorithms: One uses passes and space whereas the other uses only a single pass but space. We show that any approximation factor better than in constant passes requires space for constant even if the algorithm is allowed unbounded processing time. We also demonstrate a single-pass, approximation algorithm using space. We also study the maximum -vertex coverage problem in the dynamic graph stream model. In this model, the stream consists of edge insertions and deletions of a graph on vertices. The goal is to find vertices that cover the most number of distinct edges. We show that any constant approximation in constant passes requires space for constant whereas space is sufficient for a approximation and arbitrary in a single pass. For regular graphs, we show that space is sufficient for a approximation in a single pass. We generalize this to a approximation when the ratio between the minimum and maximum degree is bounded below by .
Cite
@article{arxiv.1610.06199,
title = {Better Streaming Algorithms for the Maximum Coverage Problem},
author = {Andrew McGregor and Hoa T. Vu},
journal= {arXiv preprint arXiv:1610.06199},
year = {2018}
}
Comments
- A preliminary version appeared in ICDT 2017 - Fix typos