English

Clustering via Self-Supervised Diffusion

Artificial Intelligence 2025-07-31 v2 Computer Vision and Pattern Recognition

Abstract

Diffusion models, widely recognized for their success in generative tasks, have not yet been applied to clustering. We introduce Clustering via Diffusion (CLUDI), a self-supervised framework that combines the generative power of diffusion models with pre-trained Vision Transformer features to achieve robust and accurate clustering. CLUDI is trained via a teacher-student paradigm: the teacher uses stochastic diffusion-based sampling to produce diverse cluster assignments, which the student refines into stable predictions. This stochasticity acts as a novel data augmentation strategy, enabling CLUDI to uncover intricate structures in high-dimensional data. Extensive evaluations on challenging datasets demonstrate that CLUDI achieves state-of-the-art performance in unsupervised classification, setting new benchmarks in clustering robustness and adaptability to complex data distributions. Our code is available at https://github.com/BGU-CS-VIL/CLUDI.

Keywords

Cite

@article{arxiv.2507.04283,
  title  = {Clustering via Self-Supervised Diffusion},
  author = {Roy Uziel and Irit Chelly and Oren Freifeld and Ari Pakman},
  journal= {arXiv preprint arXiv:2507.04283},
  year   = {2025}
}