Sparse clustering via the Deterministic Information Bottleneck algorithm

Efthymios Costa; Ioanna Papatsouma; Angelos Markos

Sparse clustering via the Deterministic Information Bottleneck algorithm

Machine Learning 2026-04-14 v3 Machine Learning

Authors: Efthymios Costa , Ioanna Papatsouma , Angelos Markos

Abstract

Cluster analysis relates to the task of assigning objects into groups which ideally present some desirable characteristics. When a cluster structure is confined to a subset of the feature space, traditional clustering techniques face unprecedented challenges. We present an information theoretic framework that overcomes the problems associated with sparse data, allowing for joint feature weighting and clustering. Our proposal constitutes a competitive alternative to existing clustering algorithms for sparse data, as demonstrated through simulations on synthetic data. The effectiveness of our method is established by an application on a real-world genomics data set.

Keywords

clustering model-based clustering sampling algorithms

Cite

@article{arxiv.2601.20628,
  title  = {Sparse clustering via the Deterministic Information Bottleneck algorithm},
  author = {Efthymios Costa and Ioanna Papatsouma and Angelos Markos},
  journal= {arXiv preprint arXiv:2601.20628},
  year   = {2026}
}

Comments

Submitted to IFCS 2026 (8 pages total)

Sparse clustering via the Deterministic Information Bottleneck algorithm

Abstract

Keywords

Cite

Comments

Related papers