Related papers: Correlation Clustering Beyond the Pivot Algorithm

Fully Dynamic Adversarially Robust Correlation Clustering in Polylogarithmic Update Time

We study the dynamic correlation clustering problem with $\textit{adaptive}$ edge label flips. In correlation clustering, we are given a $n$-vertex complete graph whose edges are labeled either $(+)$ or $(-)$, and the goal is to minimize…

Data Structures and Algorithms · Computer Science 2026-02-10 Vladimir Braverman , Prathamesh Dharangutte , Shreyas Pai , Vihan Shah , Chen Wang

Pruned Pivot: Correlation Clustering Algorithm for Dynamic, Parallel, and Local Computation Models

Given a graph with positive and negative edge labels, the correlation clustering problem aims to cluster the nodes so to minimize the total number of between-cluster positive and within-cluster negative edges. This problem has many…

Data Structures and Algorithms · Computer Science 2024-06-17 Mina Dalirrooyfard , Konstantin Makarychev , Slobodan Mitrović

Static to Dynamic Correlation Clustering

Correlation clustering is a well-studied problem, first proposed by Bansal, Blum, and Chawla [Mach. Learn. '04]. The input is an unweighted, undirected graph. The problem is to cluster the vertices so as to minimize the number of edges…

Data Structures and Algorithms · Computer Science 2026-05-12 Nairen Cao , Vincent Cohen-Addad , Euiwoong Lee , Shi Li , David Rasmussen Lolck , Alantha Newman , Mikkel Thorup , Lukas Vogl , Shuyi Yan , Hanwen Zhang

Almost 3-Approximate Correlation Clustering in Constant Rounds

We study parallel algorithms for correlation clustering. Each pair among $n$ objects is labeled as either "similar" or "dissimilar". The goal is to partition the objects into arbitrarily many clusters while minimizing the number of…

Data Structures and Algorithms · Computer Science 2022-05-10 Soheil Behnezhad , Moses Charikar , Weiyun Ma , Li-Yang Tan

Breaking 3-Factor Approximation for Correlation Clustering in Polylogarithmic Rounds

In this paper, we study parallel algorithms for the correlation clustering problem, where every pair of two different entities is labeled with similar or dissimilar. The goal is to partition the entities into clusters to minimize the number…

Data Structures and Algorithms · Computer Science 2023-07-14 Nairen Cao , Shang-En Huang , Hsin-Hao Su

Correlation Clustering in Data Streams

Clustering is a fundamental tool for analyzing large data sets. A rich body of work has been devoted to designing data-stream algorithms for the relevant optimization problems such as $k$-center, $k$-median, and $k$-means. Such algorithms…

Data Structures and Algorithms · Computer Science 2018-12-06 Kook Jin Ahn , Graham Cormode , Sudipto Guha , Andrew McGregor , Anthony Wirth

Correlation Clustering via Strong Triadic Closure Labeling: Fast Approximation Algorithms and Practical Lower Bounds

Correlation clustering is a widely studied framework for clustering based on pairwise similarity and dissimilarity scores, but its best approximation algorithms rely on impractical linear programming relaxations. We present faster…

Data Structures and Algorithms · Computer Science 2022-06-27 Nate Veldt

Combinatorial Correlation Clustering

Correlation Clustering is a classic clustering objective arising in numerous machine learning and data mining applications. Given a graph $G=(V,E)$, the goal is to partition the vertex set into clusters so as to minimize the number of edges…

Data Structures and Algorithms · Computer Science 2024-07-17 Vincent Cohen-Addad , David Rasmussen Lolck , Marcin Pilipczuk , Mikkel Thorup , Shuyi Yan , Hanwen Zhang

A Simple and Fast $(3+\varepsilon)$-approximation for Constrained Correlation Clustering

In Constrained Correlation Clustering, the goal is to cluster a complete signed graph in a way that minimizes the number of negative edges inside clusters plus the number of positive edges between clusters, while respecting hard constraints…

Data Structures and Algorithms · Computer Science 2025-11-05 Nate Veldt

Correlation Clustering with Adaptive Similarity Queries

In correlation clustering, we are given $n$ objects together with a binary similarity score between each pair of them. The goal is to partition the objects into clusters so to minimise the disagreements with the scores. In this work we…

Machine Learning · Computer Science 2020-01-15 Marco Bressan , Nicolò Cesa-Bianchi , Andrea Paudice , Fabio Vitale

Dynamic Correlation Clustering in Sublinear Update Time

We study the classic problem of correlation clustering in dynamic node streams. In this setting, nodes are either added or randomly deleted over time, and each node pair is connected by a positive or negative edge. The objective is to…

Data Structures and Algorithms · Computer Science 2024-06-14 Vincent Cohen-Addad , Silvio Lattanzi , Andreas Maggiori , Nikos Parotsidis

A Faster Algorithm for Constrained Correlation Clustering

In the Correlation Clustering problem we are given $n$ nodes, and a preference for each pair of nodes indicating whether we prefer the two endpoints to be in the same cluster or not. The output is a clustering inducing the minimum number of…

Data Structures and Algorithms · Computer Science 2025-01-07 Nick Fischer , Evangelos Kipouridis , Jonas Klausen , Mikkel Thorup

Correlation Clustering and Biclustering with Locally Bounded Errors

We consider a generalized version of the correlation clustering problem, defined as follows. Given a complete graph $G$ whose edges are labeled with $+$ or $-$, we wish to partition the graph into clusters while trying to avoid errors: $+$…

Data Structures and Algorithms · Computer Science 2016-05-25 Gregory J. Puleo , Olgica Milenkovic

SPARSE-PIVOT: Dynamic correlation clustering for node insertions

We present a new Correlation Clustering algorithm for a dynamic setting where nodes are added one at a time. In this model, proposed by Cohen-Addad, Lattanzi, Maggiori, and Parotsidis (ICML 2024), the algorithm uses database queries to…

Data Structures and Algorithms · Computer Science 2025-07-03 Mina Dalirrooyfard , Konstantin Makarychev , Slobodan Mitrović

Local correlation clustering

Correlation clustering is perhaps the most natural formulation of clustering. Given $n$ objects and a pairwise similarity measure, the goal is to cluster the objects so that, to the best possible extent, similar objects are put in the same…

Data Structures and Algorithms · Computer Science 2013-12-19 Francesco Bonchi , David García-Soriano , Konstantin Kutzkov

Online Correlation Clustering for Dynamic Complete Signed Graphs

In the correlation clustering problem for complete signed graphs, the input is a complete signed graph with edges weighted as $+1$ (denote recommendation to put this pair in the same cluster) or $-1$ (recommending to put this pair of…

Data Structures and Algorithms · Computer Science 2022-11-15 Ali Shakiba

Solving the Correlation Cluster LP in Sublinear Time

Correlation Clustering is a fundamental and widely-studied problem in unsupervised learning and data mining. The input is a graph and the goal is to construct a clustering minimizing the number of inter-cluster edges plus the number of…

Data Structures and Algorithms · Computer Science 2025-11-05 Nairen Cao , Vincent Cohen-Addad , Shi Li , Euiwoong Lee , David Rasmussen Lolck , Alantha Newman , Mikkel Thorup , Lukas Vogl , Shuyi Yan , Hanwen Zhang

Near-Optimal Correlation Clustering with Privacy

Correlation clustering is a central problem in unsupervised learning, with applications spanning community detection, duplicate detection, automated labelling and many more. In the correlation clustering problem one receives as input a set…

Machine Learning · Computer Science 2022-03-04 Vincent Cohen-Addad , Chenglin Fan , Silvio Lattanzi , Slobodan Mitrović , Ashkan Norouzi-Fard , Nikos Parotsidis , Jakub Tarnawski

Correlation Clustering with Low-Rank Matrices

Correlation clustering is a technique for aggregating data based on qualitative information about which pairs of objects are labeled 'similar' or 'dissimilar.' Because the optimization problem is NP-hard, much of the previous literature…

Machine Learning · Computer Science 2017-03-20 Nate Veldt , Anthony Wirth , David F. Gleich

Efficient Correlation Clustering Methods for Large Consensus Clustering Instances

Consensus clustering (or clustering aggregation) inputs $k$ partitions of a given ground set $V$, and seeks to create a single partition that minimizes disagreement with all input partitions. State-of-the-art algorithms for consensus…

Data Structures and Algorithms · Computer Science 2023-07-11 Nathan Cordner , George Kollios