English

Fully-Dynamic Coresets

Data Structures and Algorithms 2020-09-29 v3

Abstract

With input sizes becoming massive, coresets -- small yet representative summary of the input -- are relevant more than ever. A weighted set CwC_w that is a subset of the input is an ε\varepsilon-coreset if the cost of any feasible solution SS with respect to CwC_w is within [1±ε][1 {\pm} \varepsilon] of the cost of SS with respect to the original input. We give a very general technique to compute coresets in the fully-dynamic setting where input points can be added or deleted. Given a static ε\varepsilon-coreset algorithm that runs in time t(n,ε,λ)t(n, \varepsilon, \lambda) and computes a coreset of size s(n,ε,λ)s(n, \varepsilon, \lambda), where nn is the number of input points and 1λ1 {-}\lambda is the success probability, we give a fully-dynamic algorithm that computes an ε\varepsilon-coreset with worst-case update time O((logn)t(s(n,ε/logn,λ/n),ε/logn,λ/n))O((\log n) \cdot t(s(n, \varepsilon/\log n, \lambda/n), \varepsilon/\log n, \lambda/n) ) (this bound is stated informally), where the success probability is 1λ1{-}\lambda. Our technique is a fully-dynamic analog of the merge-and-reduce technique that applies to insertion-only setting. Although our space usage is O(n)O(n), we work in the presence of an adaptive adversary, and we show that Ω(n)\Omega(n) space is required when adversary is adaptive. As a consequence, we get fully-dynamic ε\varepsilon-coreset algorithms for kk-median and kk-means with worst-case update time O(ε2k2log5nlog3k)O(\varepsilon^{-2}k^2\log^5 n \log^3 k) and coreset size O(ε2klognlog2k)O(\varepsilon^{-2}k\log n \log^2 k) ignoring loglogn\log \log n and log(1/ε)\log(1/\varepsilon) factors and assuming that ε,λ=Ω(1/\varepsilon, \lambda = \Omega(1/poly(n))(n)). These are the first fully-dynamic algorithms for kk-median and kk-means with worst-case update times O(O(poly(k,logn,ε1))(k, \log n, \varepsilon^{-1})). We also give conditional lower bound on update/query time for any fully-dynamic (4δ)(4 - \delta)-approximation algorithm for kk-means.

Keywords

Cite

@article{arxiv.2004.14891,
  title  = {Fully-Dynamic Coresets},
  author = {Monika Henzinger and Sagar Kale},
  journal= {arXiv preprint arXiv:2004.14891},
  year   = {2020}
}

Comments

Added missed important reference. Abstract is shortened