Sliding Window Algorithms for k-Clustering Problems

Michele Borassi; Alessandro Epasto; Silvio Lattanzi; Sergei Vassilvitskii; Morteza Zadimoghaddam

Sliding Window Algorithms for k-Clustering Problems

Data Structures and Algorithms 2020-10-26 v2

Authors: Michele Borassi , Alessandro Epasto , Silvio Lattanzi , Sergei Vassilvitskii , Morteza Zadimoghaddam

Abstract

The sliding window model of computation captures scenarios in which data is arriving continuously, but only the latest $w$ elements should be used for analysis. The goal is to design algorithms that update the solution efficiently with each arrival rather than recomputing it from scratch. In this work, we focus on $k$ -clustering problems such as $k$ -means and $k$ -median. In this setting, we provide simple and practical algorithms that offer stronger performance guarantees than previous results. Empirically, we show that our methods store only a small fraction of the data, are orders of magnitude faster, and find solutions with costs only slightly higher than those returned by algorithms with access to the full dataset.

Keywords

cluster analysis optimization algorithm randomized algorithm

Cite

@article{arxiv.2006.05850,
  title  = {Sliding Window Algorithms for k-Clustering Problems},
  author = {Michele Borassi and Alessandro Epasto and Silvio Lattanzi and Sergei Vassilvitskii and Morteza Zadimoghaddam},
  journal= {arXiv preprint arXiv:2006.05850},
  year   = {2020}
}

Comments

43 pages, 7 figures

Sliding Window Algorithms for k-Clustering Problems

Abstract

Keywords

Cite

Comments

Related papers