Related papers: A Faster Algorithm for Constrained Correlation Clu…

A Simple and Fast $(3+\varepsilon)$-approximation for Constrained Correlation Clustering

In Constrained Correlation Clustering, the goal is to cluster a complete signed graph in a way that minimizes the number of negative edges inside clusters plus the number of positive edges between clusters, while respecting hard constraints…

Data Structures and Algorithms · Computer Science 2025-11-05 Nate Veldt

An Improved Algorithm for Bipartite Correlation Clustering

Bipartite Correlation clustering is the problem of generating a set of disjoint bi-cliques on a set of nodes while minimizing the symmetric difference to a bipartite input graph. The number or size of the output clusters is not constrained…

Data Structures and Algorithms · Computer Science 2010-12-15 Nir Ailon , Noa Avigdor-Elgrabli , Edo Liberty

Combinatorial Correlation Clustering

Correlation Clustering is a classic clustering objective arising in numerous machine learning and data mining applications. Given a graph $G=(V,E)$, the goal is to partition the vertex set into clusters so as to minimize the number of edges…

Data Structures and Algorithms · Computer Science 2024-07-17 Vincent Cohen-Addad , David Rasmussen Lolck , Marcin Pilipczuk , Mikkel Thorup , Shuyi Yan , Hanwen Zhang

Near-Optimal Correlation Clustering with Privacy

Correlation clustering is a central problem in unsupervised learning, with applications spanning community detection, duplicate detection, automated labelling and many more. In the correlation clustering problem one receives as input a set…

Machine Learning · Computer Science 2022-03-04 Vincent Cohen-Addad , Chenglin Fan , Silvio Lattanzi , Slobodan Mitrović , Ashkan Norouzi-Fard , Nikos Parotsidis , Jakub Tarnawski

Improved Combinatorial Approximations for Weighted Correlation Clustering

We present combinatorial approximation algorithms for the weighted correlation clustering problem. In this problem, we have a set of vertices and two weight values for each pair of vertices, denoting their difference and similarity. The…

Data Structures and Algorithms · Computer Science 2025-07-16 Mojtaba Ostovari , Alireza Zarei

Query-Efficient Correlation Clustering

Correlation clustering is arguably the most natural formulation of clustering. Given n objects and a pairwise similarity measure, the goal is to cluster the objects so that, to the best possible extent, similar objects are put in the same…

Data Structures and Algorithms · Computer Science 2020-02-27 David García-Soriano , Konstantin Kutzkov , Francesco Bonchi , Charalampos Tsourakakis

Almost 3-Approximate Correlation Clustering in Constant Rounds

We study parallel algorithms for correlation clustering. Each pair among $n$ objects is labeled as either "similar" or "dissimilar". The goal is to partition the objects into arbitrarily many clusters while minimizing the number of…

Data Structures and Algorithms · Computer Science 2022-05-10 Soheil Behnezhad , Moses Charikar , Weiyun Ma , Li-Yang Tan

Correlation Clustering with Same-Cluster Queries Bounded by Optimal Cost

Several clustering frameworks with interactive (semi-supervised) queries have been studied in the past. Recently, clustering with same-cluster queries has become popular. An algorithm in this setting has access to an oracle with full…

Data Structures and Algorithms · Computer Science 2019-08-15 Barna Saha , Sanjay Subramanian

Solving the Correlation Cluster LP in Sublinear Time

Correlation Clustering is a fundamental and widely-studied problem in unsupervised learning and data mining. The input is a graph and the goal is to construct a clustering minimizing the number of inter-cluster edges plus the number of…

Data Structures and Algorithms · Computer Science 2025-11-05 Nairen Cao , Vincent Cohen-Addad , Shi Li , Euiwoong Lee , David Rasmussen Lolck , Alantha Newman , Mikkel Thorup , Lukas Vogl , Shuyi Yan , Hanwen Zhang

Breaking 3-Factor Approximation for Correlation Clustering in Polylogarithmic Rounds

In this paper, we study parallel algorithms for the correlation clustering problem, where every pair of two different entities is labeled with similar or dissimilar. The goal is to partition the entities into clusters to minimize the number…

Data Structures and Algorithms · Computer Science 2023-07-14 Nairen Cao , Shang-En Huang , Hsin-Hao Su

Pruned Pivot: Correlation Clustering Algorithm for Dynamic, Parallel, and Local Computation Models

Given a graph with positive and negative edge labels, the correlation clustering problem aims to cluster the nodes so to minimize the total number of between-cluster positive and within-cluster negative edges. This problem has many…

Data Structures and Algorithms · Computer Science 2024-06-17 Mina Dalirrooyfard , Konstantin Makarychev , Slobodan Mitrović

An FPT Constant-Factor Approximation Algorithm for Correlation Clustering

The Correlation Clustering problem is one of the most extensively studied clustering formulations due to its wide applications in machine learning, data mining, computational biology and other areas. We consider the Correlation Clustering…

Data Structures and Algorithms · Computer Science 2025-03-04 Jianqi Zhou , Zhongyi Zhang , Jiong Guo

Correlation Clustering via Strong Triadic Closure Labeling: Fast Approximation Algorithms and Practical Lower Bounds

Correlation clustering is a widely studied framework for clustering based on pairwise similarity and dissimilarity scores, but its best approximation algorithms rely on impractical linear programming relaxations. We present faster…

Data Structures and Algorithms · Computer Science 2022-06-27 Nate Veldt

Correlation Clustering Beyond the Pivot Algorithm

We study the classic correlation clustering in the dynamic setting. Given $n$ objects and a complete labeling of the object-pairs as either similar or dissimilar, the goal is to partition the objects into arbitrarily many clusters while…

Data Structures and Algorithms · Computer Science 2025-07-15 Soheil Behnezhad , Moses Charikar , Vincent Cohen-Addad , Alma Ghafari , Weiyun Ma

Fast Combinatorial Algorithms for Min Max Correlation Clustering

We introduce fast algorithms for correlation clustering with respect to the Min Max objective that provide constant factor approximations on complete graphs. Our algorithms are the first purely combinatorial approximation algorithms for…

Data Structures and Algorithms · Computer Science 2023-01-31 Sami Davies , Benjamin Moseley , Heather Newman

Fair Correlation Clustering

In this paper we study the problem of correlation clustering under fairness constraints. In the classic correlation clustering problem, we are given a complete graph where each edge is labeled positive or negative. The goal is to obtain a…

Data Structures and Algorithms · Computer Science 2020-02-11 Saba Ahmadi , Sainyam Galhotra , Barna Saha , Roy Schwartz

Correlation Clustering with a Fixed Number of Clusters

We continue the investigation of problems concerning correlation clustering or clustering with qualitative information, which is a clustering formulation that has been studied recently. The basic setup here is that we are given as input a…

Data Structures and Algorithms · Computer Science 2007-05-23 Ioannis Giotis , Venkatesan Guruswami

A $(3+\varepsilon)$-Approximate Correlation Clustering Algorithm in Dynamic Streams

Grouping together similar elements in datasets is a common task in data mining and machine learning. In this paper, we study streaming algorithms for correlation clustering, where each pair of elements is labeled either similar or…

Data Structures and Algorithms · Computer Science 2025-03-05 Mélanie Cambus , Fabian Kuhn , Etna Lindy , Shreyas Pai , Jara Uitto

Correlation Clustering with Low-Rank Matrices

Correlation clustering is a technique for aggregating data based on qualitative information about which pairs of objects are labeled 'similar' or 'dissimilar.' Because the optimization problem is NP-hard, much of the previous literature…

Machine Learning · Computer Science 2017-03-20 Nate Veldt , Anthony Wirth , David F. Gleich

Efficient Correlation Clustering Methods for Large Consensus Clustering Instances

Consensus clustering (or clustering aggregation) inputs $k$ partitions of a given ground set $V$, and seeks to create a single partition that minimizes disagreement with all input partitions. State-of-the-art algorithms for consensus…

Data Structures and Algorithms · Computer Science 2023-07-11 Nathan Cordner , George Kollios