English

A Faster Algorithm for Constrained Correlation Clustering

Data Structures and Algorithms 2025-01-07 v1

Abstract

In the Correlation Clustering problem we are given nn nodes, and a preference for each pair of nodes indicating whether we prefer the two endpoints to be in the same cluster or not. The output is a clustering inducing the minimum number of violated preferences. In certain cases, however, the preference between some pairs may be too important to be violated. The constrained version of this problem specifies pairs of nodes that must be in the same cluster as well as pairs that must not be in the same cluster (hard constraints). The output clustering has to satisfy all hard constraints while minimizing the number of violated preferences. Constrained Correlation Clustering is APX-Hard and has been approximated within a factor 3 by van Zuylen et al. [SODA '07] using Ω(n3ω)\Omega(n^{3\omega}) time. In this work, using a more combinatorial approach, we show how to approximate this problem significantly faster at the cost of a slightly weaker approximation factor. In particular, our algorithm runs in O~(n3)\widetilde{O}(n^3) time and approximates Constrained Correlation Clustering within a factor 16. To achieve our result we need properties guaranteed by a particular influential algorithm for (unconstrained) Correlation Clustering, the CC-PIVOT algorithm. This algorithm chooses a pivot node uu, creates a cluster containing uu and all its preferred nodes, and recursively solves the rest of the problem. As a byproduct of our work, we provide a derandomization of the CC-PIVOT algorithm that still achieves the 3-approximation; furthermore, we show that there exist instances where no ordering of the pivots can give a (3ε)(3-\varepsilon)-approximation, for any constant ε\varepsilon. Finally, we introduce a node-weighted version of Correlation Clustering, which can be approximated within factor 3 using our insights on Constrained Correlation Clustering.

Keywords

Cite

@article{arxiv.2501.03154,
  title  = {A Faster Algorithm for Constrained Correlation Clustering},
  author = {Nick Fischer and Evangelos Kipouridis and Jonas Klausen and Mikkel Thorup},
  journal= {arXiv preprint arXiv:2501.03154},
  year   = {2025}
}

Comments

To appear at STACS '25