English
Related papers

Related papers: Fair Minimum Representation Clustering

200 papers

Clustering is a well-studied unsupervised learning task that aims to partition data points into a number of clusters. In many applications, these clusters correspond to real-world constructs (e.g., electoral districts, playlists, TV…

Optimization and Control · Mathematics 2025-09-25 Connor Lawless , Oktay Gunluk

There has been much interest recently in developing fair clustering algorithms that seek to do justice to the representation of groups defined along sensitive attributes such as race and gender. We observe that clustering algorithms could…

Machine Learning · Computer Science 2023-01-02 Stanley Simoes , Deepak P , Muiris MacCarthaigh

Clustering is a fundamental unsupervised learning problem where a dataset is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its…

Machine Learning · Computer Science 2023-01-10 Seyed A. Esmaeili , Brian Brubach , Aravind Srinivasan , John P. Dickerson

Clustering is a fundamental tool in data mining. It partitions points into groups (clusters) and may be used to make decisions for each point based on its group. However, this process may harm protected (minority) classes if the clustering…

Data Structures and Algorithms · Computer Science 2018-11-27 Ioana O. Bercea , Martin Groß , Samir Khuller , Aounon Kumar , Clemens Rösner , Daniel R. Schmidt , Melanie Schmidt

Incorporating fairness constructs into machine learning algorithms is a topic of much societal importance and recent interest. Clustering, a fundamental task in unsupervised learning that manifests across a number of web data scenarios, has…

Computers and Society · Computer Science 2020-10-15 Deepak P , Savitha Sam Abraham

The popular K-means clustering algorithm potentially suffers from a major weakness for further analysis or interpretation. Some cluster may have disproportionately more (or fewer) points from one of the subpopulations in terms of some…

Machine Learning · Computer Science 2026-02-10 Guancheng Zhou , Haiping Xu , Hongkang Xu , Chenyu Li , Donghui Yan

Consensus clustering, a fundamental task in machine learning and data analysis, aims to aggregate multiple input clusterings of a dataset, potentially based on different non-sensitive attributes, into a single clustering that best…

Machine Learning · Computer Science 2025-06-18 Diptarka Chakraborty , Kushagra Chatterjee , Debarati Das , Tien Long Nguyen , Romina Nobahari

Numerous algorithms have been produced for the fundamental problem of clustering under many different notions of fairness. Perhaps the most common family of notions currently studied is group fairness, in which proportional group…

Machine Learning · Computer Science 2023-06-06 Seyed A. Esmaeili , Sharmila Duppala , John P. Dickerson , Brian Brubach

What does it mean for a clustering to be fair? One popular approach seeks to ensure that each cluster contains groups in (roughly) the same proportion in which they exist in the population. The normative principle at play is balance: any…

Machine Learning · Computer Science 2021-01-29 Mohsen Abbasi , Aditya Bhaskara , Suresh Venkatasubramanian

We study the question of fair clustering under the {\em disparate impact} doctrine, where each protected class must have approximately equal representation in every cluster. We formulate the fair clustering problem under both the $k$-center…

Machine Learning · Computer Science 2018-02-19 Flavio Chierichetti , Ravi Kumar , Silvio Lattanzi , Sergei Vassilvitskii

Clustering algorithms may unintentionally propagate or intensify existing disparities, leading to unfair representations or biased decision-making. Current fair clustering methods rely on notions of fairness that do not capture any…

Machine Learning · Statistics 2023-12-15 Fritz Bayer , Drago Plecko , Niko Beerenwinkel , Jack Kuipers

We study the problem of finding low-cost Fair Clusterings in data where each data point may belong to many protected groups. Our work significantly generalizes the seminal work of Chierichetti et.al. (NIPS 2017) as follows. - We allow the…

Data Structures and Algorithms · Computer Science 2019-06-18 Suman K. Bera , Deeparnab Chakrabarty , Nicolas J. Flores , Maryam Negahbani

Clustering is an unsupervised machine learning task that consists of identifying groups of similar objects. It has numerous applications and is increasingly used in fairness-sensitive domains where objects represent individuals, such as…

Machine Learning · Computer Science 2026-05-14 Claudio Mantuano , Manuel Kammermann , Philipp Baumann

We show that the popular k-means clustering algorithm (Lloyd's heuristic), used for a variety of scientific data, can result in outcomes that are unfavorable to subgroups of data (e.g., demographic groups). Such biased clusterings can have…

Machine Learning · Computer Science 2020-10-30 Mehrdad Ghadiri , Samira Samadi , Santosh Vempala

The goal of fair clustering is to find clusters such that the proportion of sensitive attributes (e.g., gender, race, etc.) in each cluster is similar to that of the entire dataset. Various fair clustering algorithms have been proposed that…

Machine Learning · Statistics 2026-02-26 Jinwon Park , Kunwoong Kim , Jihu Lee , Yongdai Kim

Ensuring fairness in machine learning algorithms is a challenging and essential task. We consider the problem of clustering a set of points while satisfying fairness constraints. While there have been several attempts to capture group…

Machine Learning · Computer Science 2023-02-07 Debajyoti Kar , Mert Kosan , Debmalya Mandal , Sourav Medya , Arlei Silva , Palash Dey , Swagato Sanyal

Clustering is a popular form of unsupervised learning for geometric data. Unfortunately, many clustering algorithms lead to cluster assignments that are hard to explain, partially because they depend on all the features of the data in a…

Machine Learning · Computer Science 2020-09-23 Sanjoy Dasgupta , Nave Frost , Michal Moshkovitz , Cyrus Rashtchian

Clustering is an effective technique in data mining to generate groups that are the matter of interest. Among various clustering approaches, the family of k-means algorithms and min-cut algorithms gain most popularity due to their…

Machine Learning · Computer Science 2014-11-25 Xiaojun Chang , Feiping Nie , Zhigang Ma , Yi Yang

Fair clustering is the process of grouping similar entities together, while satisfying a mathematically well-defined fairness metric as a constraint. Due to the practical challenges in precise model specification, the prescribed fairness…

Machine Learning · Statistics 2021-02-09 Sainyam Galhotra , Sandhya Saisubramanian , Shlomo Zilberstein

Center-based clustering (e.g., $k$-means, $k$-medians) and clustering using linear subspaces are two most popular techniques to partition real-world data into smaller clusters. However, when the data consists of sensitive demographic…

Machine Learning · Computer Science 2022-08-23 Sruthi Gorantla , Kishen N. Gowda , Amit Deshpande , Anand Louis
‹ Prev 1 2 3 10 Next ›