English
Related papers

Related papers: Fair Algorithms for Clustering

200 papers

Clustering is a fundamental tool in data mining. It partitions points into groups (clusters) and may be used to make decisions for each point based on its group. However, this process may harm protected (minority) classes if the clustering…

Data Structures and Algorithms · Computer Science 2018-11-27 Ioana O. Bercea , Martin Groß , Samir Khuller , Aounon Kumar , Clemens Rösner , Daniel R. Schmidt , Melanie Schmidt

Numerous algorithms have been produced for the fundamental problem of clustering under many different notions of fairness. Perhaps the most common family of notions currently studied is group fairness, in which proportional group…

Machine Learning · Computer Science 2023-06-06 Seyed A. Esmaeili , Sharmila Duppala , John P. Dickerson , Brian Brubach

Given the widespread popularity of spectral clustering (SC) for partitioning graph data, we study a version of constrained SC in which we try to incorporate the fairness notion proposed by Chierichetti et al. (2017). According to this…

Machine Learning · Statistics 2019-05-14 Matthäus Kleindessner , Samira Samadi , Pranjal Awasthi , Jamie Morgenstern

Correlation clustering is a ubiquitous paradigm in unsupervised machine learning where addressing unfairness is a major challenge. Motivated by this, we study Fair Correlation Clustering where the data points may belong to different…

Machine Learning · Computer Science 2022-06-13 Sara Ahmadian , Maryam Negahbani

We study the question of fair clustering under the {\em disparate impact} doctrine, where each protected class must have approximately equal representation in every cluster. We formulate the fair clustering problem under both the $k$-center…

Machine Learning · Computer Science 2018-02-19 Flavio Chierichetti , Ravi Kumar , Silvio Lattanzi , Sergei Vassilvitskii

Consensus clustering, a fundamental task in machine learning and data analysis, aims to aggregate multiple input clusterings of a dataset, potentially based on different non-sensitive attributes, into a single clustering that best…

Machine Learning · Computer Science 2025-06-18 Diptarka Chakraborty , Kushagra Chatterjee , Debarati Das , Tien Long Nguyen , Romina Nobahari

As machine learning has become more prevalent, researchers have begun to recognize the necessity of ensuring machine learning systems are fair. Recently, there has been an interest in defining a notion of fairness that mitigates…

Data Structures and Algorithms · Computer Science 2020-06-22 Sara Ahmadian , Alessandro Epasto , Marina Knittel , Ravi Kumar , Mohammad Mahdian , Benjamin Moseley , Philip Pham , Sergei Vassilvitskii , Yuyan Wang

In this paper, we study the problem of fair clustering on the $k-$center objective. In fair clustering, the input is $N$ points, each belonging to at least one of $l$ protected groups, e.g. male, female, Asian, Hispanic. The objective is to…

Machine Learning · Computer Science 2020-11-10 Elfarouk Harb , Ho Shan Lam

The advent of ML-driven decision-making and policy formation has led to an increasing focus on algorithmic fairness. As clustering is one of the most commonly used unsupervised machine learning approaches, there has naturally been a…

Machine Learning · Statistics 2023-05-30 Abhisek Chakraborty , Anirban Bhattacharya , Debdeep Pati

We extend the fair machine learning literature by considering the problem of proportional centroid clustering in a metric context. For clustering $n$ points with $k$ centers, we define fairness as proportionality to mean that any $n/k$…

Machine Learning · Computer Science 2020-10-13 Xingyu Chen , Brandon Fain , Liang Lyu , Kamesh Munagala

We revisit the problem of fair clustering, first introduced by Chierichetti et al., that requires each protected attribute to have approximately equal representation in every cluster; i.e., a balance property. Existing solutions to fair…

Machine Learning · Computer Science 2023-03-22 Shivam Gupta , Ganesh Ghalme , Narayanan C. Krishnan , Shweta Jain

Clustering is a fundamental unsupervised learning problem where a dataset is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its…

Machine Learning · Computer Science 2023-01-10 Seyed A. Esmaeili , Brian Brubach , Aravind Srinivasan , John P. Dickerson

Clustering is a well-studied unsupervised learning task that aims to partition data points into a number of clusters. In many applications, these clusters correspond to real-world constructs (e.g., electoral districts, playlists, TV…

Optimization and Control · Mathematics 2025-09-25 Connor Lawless , Oktay Gunluk

There has been much interest recently in developing fair clustering algorithms that seek to do justice to the representation of groups defined along sensitive attributes such as race and gender. We observe that clustering algorithms could…

Machine Learning · Computer Science 2023-01-02 Stanley Simoes , Deepak P , Muiris MacCarthaigh

Clustering is an unsupervised learning task that aims to partition data into a set of clusters. In many applications, these clusters correspond to real-world constructs (e.g. electoral districts) whose benefit can only be attained by groups…

Machine Learning · Computer Science 2023-02-09 Connor Lawless , Oktay Gunluk

Clustering algorithms are widely utilized for many modern data science applications. This motivates the need to make outputs of clustering algorithms fair. Traditionally, new fair algorithmic variants to clustering algorithms are developed…

Machine Learning · Computer Science 2021-10-26 Anshuman Chhabra , Adish Singla , Prasant Mohapatra

Center-based clustering (e.g., $k$-means, $k$-medians) and clustering using linear subspaces are two most popular techniques to partition real-world data into smaller clusters. However, when the data consists of sensitive demographic…

Machine Learning · Computer Science 2022-08-23 Sruthi Gorantla , Kishen N. Gowda , Amit Deshpande , Anand Louis

What does it mean for a clustering to be fair? One popular approach seeks to ensure that each cluster contains groups in (roughly) the same proportion in which they exist in the population. The normative principle at play is balance: any…

Machine Learning · Computer Science 2021-01-29 Mohsen Abbasi , Aditya Bhaskara , Suresh Venkatasubramanian

Clustering is an unsupervised machine learning task that consists of identifying groups of similar objects. It has numerous applications and is increasingly used in fairness-sensitive domains where objects represent individuals, such as…

Machine Learning · Computer Science 2026-05-14 Claudio Mantuano , Manuel Kammermann , Philipp Baumann

This paper studies the fair range clustering problem in which the data points are from different demographic groups and the goal is to pick $k$ centers with the minimum clustering cost such that each group is at least minimally represented…

Machine Learning · Computer Science 2023-06-23 Sèdjro S. Hotegni , Sepideh Mahabadi , Ali Vakilian
‹ Prev 1 2 3 10 Next ›