Mini-Batch Kernel $k$-means

Ben Jourdan; Gregory Schwartzman

Mini-Batch Kernel $k$-means

Machine Learning 2024-10-10 v1 Artificial Intelligence Data Structures and Algorithms

Authors: Ben Jourdan , Gregory Schwartzman

Abstract

We present the first mini-batch kernel $k$ -means algorithm, offering an order of magnitude improvement in running time compared to the full batch algorithm. A single iteration of our algorithm takes $\widetilde{O}(kb^2)$ time, significantly faster than the $O(n^2)$ time required by the full batch kernel $k$ -means, where $n$ is the dataset size and $b$ is the batch size. Extensive experiments demonstrate that our algorithm consistently achieves a 10-100x speedup with minimal loss in quality, addressing the slow runtime that has limited kernel $k$ -means adoption in practice. We further complement these results with a theoretical analysis under an early stopping condition, proving that with a batch size of $\widetilde{\Omega}(\max \{\gamma^{4}, \gamma^{2}\} \cdot \epsilon^{-2})$ , the algorithm terminates in $O(\gamma^2/\epsilon)$ iterations with high probability, where $\gamma$ bounds the norm of points in feature space and $\epsilon$ is a termination threshold. Our analysis holds for any reasonable center initialization, and when using $k$ -means++ initialization, the algorithm achieves an approximation ratio of $O(\log k)$ in expectation. For normalized kernels, such as Gaussian or Laplacian it holds that $\gamma=1$ . Taking $\epsilon = O(1)$ and $b=\Theta(\log n)$ , the algorithm terminates in $O(1)$ iterations, with each iteration running in $\widetilde{O}(k)$ time.

Keywords

graph algorithm

Cite

@article{arxiv.2410.05902,
  title  = {Mini-Batch Kernel $k$-means},
  author = {Ben Jourdan and Gregory Schwartzman},
  journal= {arXiv preprint arXiv:2410.05902},
  year   = {2024}
}

Comments

arXiv admin note: text overlap with arXiv:2304.00419

Mini-Batch Kernel $k$-means

Abstract

Keywords

Cite

Comments

Related papers