English
Related papers

Related papers: XClusters: Explainability-first Clustering

200 papers

Clustering is a popular form of unsupervised learning for geometric data. Unfortunately, many clustering algorithms lead to cluster assignments that are hard to explain, partially because they depend on all the features of the data in a…

Machine Learning · Computer Science 2020-09-23 Sanjoy Dasgupta , Nave Frost , Michal Moshkovitz , Cyrus Rashtchian

Clustering ensemble has emerged as an important research topic in the field of machine learning. Although numerous methods have been proposed to improve clustering quality, most existing approaches overlook the need for interpretability in…

Machine Learning · Computer Science 2025-06-09 Hang Lv , Lianyu Hu , Mudi Jiang , Xinying Liu , Zengyou He

Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and the control strategy used to search the space of clusterings. Ideally, the search…

Artificial Intelligence · Computer Science 2014-11-17 D. Fisher

The domain of explainable AI is of interest in all Machine Learning fields, and it is all the more important in clustering, an unsupervised task whose result must be validated by a domain expert. We aim at finding a clustering that has high…

Artificial Intelligence · Computer Science 2024-03-28 Mathieu Guilbert , Christel Vrain , Thi-Bich-Hanh Dao

$k$-means and $k$-median clustering are powerful unsupervised machine learning techniques. However, due to complicated dependences on all the features, it is challenging to interpret the resulting cluster assignments. Moshkovitz, Dasgupta,…

Data Structures and Algorithms · Computer Science 2021-12-17 Sayan Bandyapadhyay , Fedor V. Fomin , Petr A. Golovach , William Lochet , Nidhi Purohit , Kirill Simonov

State-of-the-art clustering algorithms use heuristics to partition the feature space and provide little insight into the rationale for cluster membership, limiting their interpretability. In healthcare applications, the latter poses a…

Machine Learning · Statistics 2018-12-04 Dimitris Bertsimas , Agni Orfanoudaki , Holly Wiberg

We introduce a fast and explainable clustering method called CLASSIX. It consists of two phases, namely a greedy aggregation phase of the sorted data into groups of nearby data points, followed by the merging of groups into clusters. The…

Machine Learning · Computer Science 2024-02-16 Xinye Chen , Stefan Güttel

The price of explainability for a clustering task can be defined as the unavoidable loss,in terms of the objective function, if we force the final partition to be explainable. Here, we study this price for the following clustering problems:…

Machine Learning · Computer Science 2021-02-16 Eduardo Laber , Lucas Murtinho

We study the explainable clustering problem first posed by Moshkovitz, Dasgupta, Rashtchian, and Frost (ICML 2020). The goal of explainable clustering is to fit an axis-aligned decision tree with $K$ leaves and minimal clustering cost…

Machine Learning · Computer Science 2025-08-08 Maximilian Fleissner , Maedeh Zarvandi , Debarghya Ghoshdastidar

In recent years, much of the research on clustering algorithms has primarily focused on enhancing their accuracy and efficiency, frequently at the expense of interpretability. However, as these methods are increasingly being applied in…

Machine Learning · Computer Science 2026-01-21 Lianyu Hu , Mudi Jiang , Junjie Dong , Xinying Liu , Zengyou He

Fair clustering has gained increasing attention in recent years, especially in applications involving socially sensitive attributes. However, existing fair clustering methods often lack interpretability, limiting their applicability in…

Machine Learning · Computer Science 2025-11-27 Mudi Jiang , Jiahui Zhou , Xinying Liu , Zengyou He , Zhikui Chen

In this paper, we address an issue of finding explainable clusters of class-uniform data in labelled datasets. The issue falls into the domain of interpretable supervised clustering. Unlike traditional clustering, supervised clustering aims…

Machine Learning · Computer Science 2023-07-18 Natallia Kokash , Leonid Makhnist

Graph clustering groups entities -- the vertices of a graph -- based on their similarity, typically using a complex distance function over a large number of features. Successful integration of clustering approaches in automated…

Machine Learning · Statistics 2020-02-03 Sandhya Saisubramanian , Sainyam Galhotra , Shlomo Zilberstein

Clustering serves as a vital tool for uncovering latent data structures, and achieving both high accuracy and interpretability is essential. To this end, existing methods typically construct binary decision trees by solving mixed-integer…

Machine Learning · Computer Science 2026-02-17 Hayato Suzuki , Shunnosuke Ikeda , Yuichi Takano

Despite the growing popularity of explainable and interpretable machine learning, there is still surprisingly limited work on inherently interpretable clustering methods. Recently, there has been a surge of interest in explaining the…

Machine Learning · Computer Science 2024-11-26 Maximilian Fleissner , Leena Chennuru Vankadara , Debarghya Ghoshdastidar

We study supervised learning problems using clustering constraints to impose structure on either features or samples, seeking to help both prediction and interpretation. The problem of clustering features arises naturally in text…

Machine Learning · Computer Science 2016-09-20 Vincent Roulet , Fajwel Fogel , Alexandre d'Aspremont , Francis Bach

Explainable AI (XAI) is an important developing area but remains relatively understudied for clustering. We propose an explainable-by-design clustering approach that not only finds clusters but also exemplars to explain each cluster. The…

Artificial Intelligence · Computer Science 2022-09-21 Ian Davidson , Michael Livanos , Antoine Gourru , Peter Walker , Julien Velcin , S. S. Ravi

Clustering is an unsupervised learning problem that aims to partition unlabelled data points into groups with similar features. Traditional clustering algorithms provide limited insight into the groups they find as their main focus is…

Machine Learning · Computer Science 2022-10-18 Connor Lawless , Oktay Gunluk

One basic requirement of many studies is the necessity of classifying data. Clustering is a proposed method for summarizing networks. Clustering methods can be divided into two categories named model-based approaches and algorithmic…

Machine Learning · Computer Science 2013-02-19 Raheleh Namayandeh , Farzad Didehvar , Zahra Shojaei

Constrained clustering is a semi-supervised task that employs a limited amount of labelled data, formulated as constraints, to incorporate domain-specific knowledge and to significantly improve clustering accuracy. Previous work has…

Machine Learning · Computer Science 2023-05-17 Pouya Shati , Eldan Cohen , Sheila McIlraith
‹ Prev 1 2 3 10 Next ›