English

Exact Exponential Algorithms for Clustering Problems

Data Structures and Algorithms 2022-08-16 v1

Abstract

In this paper we initiate a systematic study of exact algorithms for well-known clustering problems, namely kk-Median and kk-Means. In kk-Median, the input consists of a set XX of nn points belonging to a metric space, and the task is to select a subset CXC \subseteq X of kk points as centers, such that the sum of the distances of every point to its nearest center is minimized. In kk-Means, the objective is to minimize the sum of squares of the distances instead. It is easy to design an algorithm running in time maxkn(nk)nO(1)=O(2n)\max_{k\leq n} {n \choose k} n^{O(1)} = O^*(2^n) (O()O^*(\cdot) notation hides polynomial factors in nn). We design first non-trivial exact algorithms for these problems. In particular, we obtain an O((1.89)n)O^*((1.89)^n) time exact algorithm for kk-Median that works for any value of kk. Our algorithm is quite general in that it does not use any properties of the underlying (metric) space -- it does not even require the distances to satisfy the triangle inequality. In particular, the same algorithm also works for kk-Means. We complement this result by showing that the running time of our algorithm is asymptotically optimal, up to the base of the exponent. That is, unless ETH fails, there is no algorithm for these problems running in time 2o(n)nO(1)2^{o(n)} \cdot n^{O(1)}. Finally, we consider the "supplier" versions of these clustering problems, where, in addition to the set XX we are additionally given a set of mm candidate centers FF, and objective is to find a subset of kk centers from FF. The goal is still to minimize the kk-Median/kk-Means/kk-Center objective. For these versions we give a O(2n(mn)O(1))O(2^n (mn)^{O(1)}) time algorithms using subset convolution. We complement this result by showing that, under the Set Cover Conjecture, the supplier versions of these problems do not admit an exact algorithm running in time 2(1ϵ)n(mn)O(1)2^{(1-\epsilon) n} (mn)^{O(1)}.

Keywords

Cite

@article{arxiv.2208.06847,
  title  = {Exact Exponential Algorithms for Clustering Problems},
  author = {Fedor V. Fomin and Petr A. Golovach and Tanmay Inamdar and Nidhi Purohit and Saket Saurabh},
  journal= {arXiv preprint arXiv:2208.06847},
  year   = {2022}
}

Comments

Accepted to IPEC 2022