Related papers: Combinatorial Correlation Clustering

Breaking 3-Factor Approximation for Correlation Clustering in Polylogarithmic Rounds

In this paper, we study parallel algorithms for the correlation clustering problem, where every pair of two different entities is labeled with similar or dissimilar. The goal is to partition the entities into clusters to minimize the number…

Data Structures and Algorithms · Computer Science 2023-07-14 Nairen Cao , Shang-En Huang , Hsin-Hao Su

Almost 3-Approximate Correlation Clustering in Constant Rounds

We study parallel algorithms for correlation clustering. Each pair among $n$ objects is labeled as either "similar" or "dissimilar". The goal is to partition the objects into arbitrarily many clusters while minimizing the number of…

Data Structures and Algorithms · Computer Science 2022-05-10 Soheil Behnezhad , Moses Charikar , Weiyun Ma , Li-Yang Tan

Simultaneously Approximating All Norms for Massively Parallel Correlation Clustering

We revisit the simultaneous approximation model for the correlation clustering problem introduced by Davies, Moseley, and Newman[DMN24]. The objective is to find a clustering that minimizes given norms of the disagreement vector over all…

Data Structures and Algorithms · Computer Science 2024-10-23 Nairen Cao , Shi Li , Jia Ye

An Improved Algorithm for Bipartite Correlation Clustering

Bipartite Correlation clustering is the problem of generating a set of disjoint bi-cliques on a set of nodes while minimizing the symmetric difference to a bipartite input graph. The number or size of the output clusters is not constrained…

Data Structures and Algorithms · Computer Science 2010-12-15 Nir Ailon , Noa Avigdor-Elgrabli , Edo Liberty

Static to Dynamic Correlation Clustering

Correlation clustering is a well-studied problem, first proposed by Bansal, Blum, and Chawla [Mach. Learn. '04]. The input is an unweighted, undirected graph. The problem is to cluster the vertices so as to minimize the number of edges…

Data Structures and Algorithms · Computer Science 2026-05-12 Nairen Cao , Vincent Cohen-Addad , Euiwoong Lee , Shi Li , David Rasmussen Lolck , Alantha Newman , Mikkel Thorup , Lukas Vogl , Shuyi Yan , Hanwen Zhang

Improved Combinatorial Approximations for Weighted Correlation Clustering

We present combinatorial approximation algorithms for the weighted correlation clustering problem. In this problem, we have a set of vertices and two weight values for each pair of vertices, denoting their difference and similarity. The…

Data Structures and Algorithms · Computer Science 2025-07-16 Mojtaba Ostovari , Alireza Zarei

Correlation Clustering in Constant Many Parallel Rounds

Correlation clustering is a central topic in unsupervised learning, with many applications in ML and data mining. In correlation clustering, one receives as input a signed graph and the goal is to partition it to minimize the number of…

Data Structures and Algorithms · Computer Science 2021-06-17 Vincent Cohen-Addad , Silvio Lattanzi , Slobodan Mitrović , Ashkan Norouzi-Fard , Nikos Parotsidis , Jakub Tarnawski

Solving the Correlation Cluster LP in Sublinear Time

Correlation Clustering is a fundamental and widely-studied problem in unsupervised learning and data mining. The input is a graph and the goal is to construct a clustering minimizing the number of inter-cluster edges plus the number of…

Data Structures and Algorithms · Computer Science 2025-11-05 Nairen Cao , Vincent Cohen-Addad , Shi Li , Euiwoong Lee , David Rasmussen Lolck , Alantha Newman , Mikkel Thorup , Lukas Vogl , Shuyi Yan , Hanwen Zhang

A Simple and Fast $(3+\varepsilon)$-approximation for Constrained Correlation Clustering

In Constrained Correlation Clustering, the goal is to cluster a complete signed graph in a way that minimizes the number of negative edges inside clusters plus the number of positive edges between clusters, while respecting hard constraints…

Data Structures and Algorithms · Computer Science 2025-11-05 Nate Veldt

A Faster Algorithm for Constrained Correlation Clustering

In the Correlation Clustering problem we are given $n$ nodes, and a preference for each pair of nodes indicating whether we prefer the two endpoints to be in the same cluster or not. The output is a clustering inducing the minimum number of…

Data Structures and Algorithms · Computer Science 2025-01-07 Nick Fischer , Evangelos Kipouridis , Jonas Klausen , Mikkel Thorup

Fast Combinatorial Algorithms for Min Max Correlation Clustering

We introduce fast algorithms for correlation clustering with respect to the Min Max objective that provide constant factor approximations on complete graphs. Our algorithms are the first purely combinatorial approximation algorithms for…

Data Structures and Algorithms · Computer Science 2023-01-31 Sami Davies , Benjamin Moseley , Heather Newman

Four Algorithms for Correlation Clustering: A Survey

In the Correlation Clustering problem, we are given a set of objects with pairwise similarity information. Our aim is to partition these objects into clusters that match this information as closely as possible. More specifically, the…

Data Structures and Algorithms · Computer Science 2022-08-29 Jafar Jafarov

Understanding the Cluster LP for Correlation Clustering

In the classic Correlation Clustering problem introduced by Bansal, Blum, and Chawla (FOCS 2002), the input is a complete graph where edges are labeled either $+$ or $-$, and the goal is to find a partition of the vertices that minimizes…

Data Structures and Algorithms · Computer Science 2025-11-03 Nairen Cao , Vincent Cohen-Addad , Euiwoong Lee , Shi Li , Alantha Newman , Lukas Vogl

Local correlation clustering

Correlation clustering is perhaps the most natural formulation of clustering. Given $n$ objects and a pairwise similarity measure, the goal is to cluster the objects so that, to the best possible extent, similar objects are put in the same…

Data Structures and Algorithms · Computer Science 2013-12-19 Francesco Bonchi , David García-Soriano , Konstantin Kutzkov

Near Optimal LP Rounding Algorithm for Correlation Clustering on Complete and Complete k-partite Graphs

We give new rounding schemes for the standard linear programming relaxation of the correlation clustering problem, achieving approximation factors almost matching the integrality gaps: - For complete graphs our appoximation is $2.06 -…

Data Structures and Algorithms · Computer Science 2015-06-25 Shuchi Chawla , Konstantin Makarychev , Tselil Schramm , Grigory Yaroslavtsev

Approximate Correlation Clustering Using Same-Cluster Queries

Ashtiani et al. (NIPS 2016) introduced a semi-supervised framework for clustering (SSAC) where a learner is allowed to make same-cluster queries. More specifically, in their model, there is a query oracle that answers queries of the form…

Data Structures and Algorithms · Computer Science 2017-12-20 Nir Ailon , Anup Bhattacharya , Ragesh Jaiswal

An FPT Constant-Factor Approximation Algorithm for Correlation Clustering

The Correlation Clustering problem is one of the most extensively studied clustering formulations due to its wide applications in machine learning, data mining, computational biology and other areas. We consider the Correlation Clustering…

Data Structures and Algorithms · Computer Science 2025-03-04 Jianqi Zhou , Zhongyi Zhang , Jiong Guo

Massively Parallel Correlation Clustering in Bounded Arboricity Graphs

Identifying clusters of similar elements in a set is a common task in data analysis. With the immense growth of data and physical limitations on single processor speed, it is necessary to find efficient parallel algorithms for clustering…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-08-09 Mélanie Cambus , Davin Choo , Havu Miikonen , Jara Uitto

We consider the classic Correlation Clustering problem: Given a complete graph where edges are labelled either $+$ or $-$, the goal is to find a partition of the vertices that minimizes the number of the \pedges across parts plus the number…

Data Structures and Algorithms · Computer Science 2023-10-02 Vincent Cohen-Addad , Euiwoong Lee , Shi Li , Alantha Newman

Query-Efficient Correlation Clustering

Correlation clustering is arguably the most natural formulation of clustering. Given n objects and a pairwise similarity measure, the goal is to cluster the objects so that, to the best possible extent, similar objects are put in the same…

Data Structures and Algorithms · Computer Science 2020-02-27 David García-Soriano , Konstantin Kutzkov , Francesco Bonchi , Charalampos Tsourakakis