English

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

Machine Learning 2020-07-23 v1 Machine Learning

Abstract

Spectral clustering has shown a superior performance in analyzing the cluster structure. However, its computational complexity limits its application in analyzing large-scale data. To address this problem, many low-rank matrix approximating algorithms are proposed, including the Nystrom method - an approach with proven approximate error bounds. There are several algorithms that provide recipes to construct Nystrom approximations with variable accuracies and computing times. This paper proposes a scalable Nystrom-based clustering algorithm with a new sampling procedure, Centroid Minimum Sum of Squared Similarities (CMS3), and a heuristic on when to use it. Our heuristic depends on the eigen spectrum shape of the dataset, and yields competitive low-rank approximations in test datasets compared to the other state-of-the-art methods

Keywords

Cite

@article{arxiv.2007.11416,
  title  = {Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling},
  author = {Djallel Bouneffouf},
  journal= {arXiv preprint arXiv:2007.11416},
  year   = {2020}
}
R2 v1 2026-06-23T17:18:55.908Z