English

PyClustrPath: An efficient Python package for generating clustering paths with GPU acceleration

Optimization and Control 2025-01-28 v1

Abstract

Convex clustering is a popular clustering model without requiring the number of clusters as prior knowledge. It can generate a clustering path by continuously solving the model with a sequence of regularization parameter values. This paper introduces {\it PyClustrPath}, a highly efficient Python package for solving the convex clustering model with GPU acceleration. {\it PyClustrPath} implements popular first-order and second-order algorithms with a clean modular design. Such a design makes {\it PyClustrPath} more scalable to incorporate new algorithms for solving the convex clustering model in the future. We extensively test the numerical performance of {\it PyClustrPath} on popular clustering datasets, demonstrating its superior performance compared to the existing solvers for generating the clustering path based on the convex clustering model. The implementation of {\it PyClustrPath} can be found at: https://github.com/D3IntOpt/PyClustrPath.

Keywords

Cite

@article{arxiv.2501.15964,
  title  = {PyClustrPath: An efficient Python package for generating clustering paths with GPU acceleration},
  author = {Hongfei Wu and Yancheng Yuan},
  journal= {arXiv preprint arXiv:2501.15964},
  year   = {2025}
}

Comments

13 pages, 10 figures, 3 tables