Diversity-aware clustering: Computational Complexity and Approximation Algorithms

Suhas Thejaswi; Ameet Gadekar; Bruno Ordozgoiti; Aristides Gionis

Diversity-aware clustering: Computational Complexity and Approximation Algorithms

Data Structures and Algorithms 2025-05-21 v3 Artificial Intelligence Computational Complexity Machine Learning

Authors: Suhas Thejaswi , Ameet Gadekar , Bruno Ordozgoiti , Aristides Gionis

Abstract

In this work, we study diversity-aware clustering problems where the data points are associated with multiple attributes resulting in intersecting groups. A clustering solution needs to ensure that the number of chosen cluster centers from each group should be within the range defined by a lower and upper bound threshold for each group, while simultaneously minimizing the clustering objective, which can be either $k$ -median, $k$ -means or $k$ -supplier. We study the computational complexity of the proposed problems, offering insights into their NP-hardness, polynomial-time inapproximability, and fixed-parameter intractability. We present parameterized approximation algorithms with approximation ratios $1+ \frac{2}{e} + \epsilon \approx 1.736$ , $1+\frac{8}{e} + \epsilon \approx 3.943$ , and $5$ for diversity-aware $k$ -median, diversity-aware $k$ -means and diversity-aware $k$ -supplier, respectively. Assuming Gap-ETH, the approximation ratios are tight for the diversity-aware $k$ -median and diversity-aware $k$ -means problems. Our results imply the same approximation factors for their respective fair variants with disjoint groups -- fair $k$ -median, fair $k$ -means, and fair $k$ -supplier -- with lower bound requirements.

Keywords

cluster analysis graph clustering approximation algorithm

Cite

@article{arxiv.2401.05502,
  title  = {Diversity-aware clustering: Computational Complexity and Approximation Algorithms},
  author = {Suhas Thejaswi and Ameet Gadekar and Bruno Ordozgoiti and Aristides Gionis},
  journal= {arXiv preprint arXiv:2401.05502},
  year   = {2025}
}

Comments

Algorithmic Fairness, Fair Clustering, Diversity-aware Clustering, Intersectionaly, Subgroup fairness

Diversity-aware clustering: Computational Complexity and Approximation Algorithms

Abstract

Keywords

Cite

Comments

Related papers