Correlation Clustering with Asymmetric Classification Errors
Abstract
In the Correlation Clustering problem, we are given a weighted graph with its edges labeled as "similar" or "dissimilar" by a binary classifier. The goal is to produce a clustering that minimizes the weight of "disagreements": the sum of the weights of "similar" edges across clusters and "dissimilar" edges within clusters. We study the correlation clustering problem under the following assumption: Every "similar" edge has weight and every "dissimilar" edge has weight (where and is a scaling parameter). We give a approximation algorithm for this problem. This assumption captures well the scenario when classification errors are asymmetric. Additionally, we show an asymptotically matching Linear Programming integrality gap of .
Cite
@article{arxiv.2108.05696,
title = {Correlation Clustering with Asymmetric Classification Errors},
author = {Jafar Jafarov and Sanchit Kalhan and Konstantin Makarychev and Yury Makarychev},
journal= {arXiv preprint arXiv:2108.05696},
year = {2021}
}
Comments
24 pages, 2 figures. The conference version of this paper appeared in the proceedings of ICML 2020