Related papers: Explaining Kernel Clustering via Decision Trees

Interpretable Clustering via Optimal Trees

State-of-the-art clustering algorithms use heuristics to partition the feature space and provide little insight into the rationale for cluster membership, limiting their interpretability. In healthcare applications, the latter poses a…

Machine Learning · Statistics 2018-12-04 Dimitris Bertsimas , Agni Orfanoudaki , Holly Wiberg

Interpretable clustering via optimal multiway-split decision trees

Clustering serves as a vital tool for uncovering latent data structures, and achieving both high accuracy and interpretability is essential. To this end, existing methods typically construct binary decision trees by solving mixed-integer…

Machine Learning · Computer Science 2026-02-17 Hayato Suzuki , Shunnosuke Ikeda , Yuichi Takano

Explainable $k$-Means and $k$-Medians Clustering

Clustering is a popular form of unsupervised learning for geometric data. Unfortunately, many clustering algorithms lead to cluster assignments that are hard to explain, partially because they depend on all the features of the data in a…

Machine Learning · Computer Science 2020-09-23 Sanjoy Dasgupta , Nave Frost , Michal Moshkovitz , Cyrus Rashtchian

Shallow decision trees for explainable $k$-means clustering

A number of recent works have employed decision trees for the construction of explainable partitions that aim to minimize the $k$-means cost function. These works, however, largely ignore metrics related to the depths of the leaves in the…

Machine Learning · Computer Science 2022-08-29 Eduardo Laber , Lucas Murtinho , Felipe Oliveira

Scalable Kernel Clustering: Approximate Kernel k-means

Kernel-based clustering algorithms have the ability to capture the non-linear structure in real world data. Among various kernel-based clustering algorithms, kernel k-means has gained popularity due to its simple iterative nature and ease…

Computer Vision and Pattern Recognition · Computer Science 2014-02-18 Radha Chitta , Rong Jin , Timothy C. Havens , Anil K. Jain

Interpretable Fair Clustering

Fair clustering has gained increasing attention in recent years, especially in applications involving socially sensitive attributes. However, existing fair clustering methods often lack interpretability, limiting their applicability in…

Machine Learning · Computer Science 2025-11-27 Mudi Jiang , Jiahui Zhou , Xinying Liu , Zengyou He , Zhikui Chen

Interpretable Clustering Ensemble

Clustering ensemble has emerged as an important research topic in the field of machine learning. Although numerous methods have been proposed to improve clustering quality, most existing approaches overlook the need for interpretability in…

Machine Learning · Computer Science 2025-06-09 Hang Lv , Lianyu Hu , Mudi Jiang , Xinying Liu , Zengyou He

How to Find a Good Explanation for Clustering?

$k$-means and $k$-median clustering are powerful unsupervised machine learning techniques. However, due to complicated dependences on all the features, it is challenging to interpret the resulting cluster assignments. Moshkovitz, Dasgupta,…

Data Structures and Algorithms · Computer Science 2021-12-17 Sayan Bandyapadhyay , Fedor V. Fomin , Petr A. Golovach , William Lochet , Nidhi Purohit , Kirill Simonov

Learning Interpretable Characteristic Kernels via Decision Forests

Decision forests are widely used for classification and regression tasks. A lesser known property of tree-based methods is that one can construct a proximity matrix from the tree(s), and these proximity matrices are induced kernels. While…

Machine Learning · Statistics 2024-10-14 Sambit Panda , Cencheng Shen , Joshua T. Vogelstein

Accelerating k-Means Clustering with Cover Trees

The k-means clustering algorithm is a popular algorithm that partitions data into k clusters. There are many improvements to accelerate the standard algorithm. Most current research employs upper and lower bounds on point-to-cluster…

Machine Learning · Computer Science 2024-10-22 Andreas Lang , Erich Schubert

Interpretable Clustering: A Survey

In recent years, much of the research on clustering algorithms has primarily focused on enhancing their accuracy and efficiency, frequently at the expense of interpretability. However, as these methods are increasingly being applied in…

Machine Learning · Computer Science 2026-01-21 Lianyu Hu , Mudi Jiang , Junjie Dong , Xinying Liu , Zengyou He

ExKMC: Expanding Explainable $k$-Means Clustering

Despite the popularity of explainable AI, there is limited work on effective methods for unsupervised learning. We study algorithms for $k$-means clustering, focusing on a trade-off between explainability and accuracy. Following prior work,…

Machine Learning · Computer Science 2020-07-03 Nave Frost , Michal Moshkovitz , Cyrus Rashtchian

NeurCAM: Interpretable Neural Clustering via Additive Models

Interpretable clustering algorithms aim to group similar data points while explaining the obtained groups to support knowledge discovery and pattern recognition tasks. While most approaches to interpretable clustering construct clusters…

Machine Learning · Computer Science 2024-08-27 Nakul Upadhya , Eldan Cohen

Using Decision Trees for Interpretable Supervised Clustering

In this paper, we address an issue of finding explainable clusters of class-uniform data in labelled datasets. The issue falls into the domain of interpretable supervised clustering. Unlike traditional clustering, supervised clustering aims…

Machine Learning · Computer Science 2023-07-18 Natallia Kokash , Leonid Makhnist

Interpretable Sequence Clustering

Categorical sequence clustering plays a crucial role in various fields, but the lack of interpretability in cluster assignments poses significant challenges. Sequences inherently lack explicit features, and existing sequence clustering…

Machine Learning · Computer Science 2023-09-06 Junjie Dong , Xinyi Yang , Mudi Jiang , Lianyu Hu , Zengyou He

Interpretable Time Series Clustering Using Local Explanations

This study focuses on exploring the use of local interpretability methods for explaining time series clustering models. Many of the state-of-the-art clustering models are not directly explainable. To provide explanations for these…

Machine Learning · Computer Science 2022-08-03 Ozan Ozyegen , Nicholas Prayogo , Mucahit Cevik , Ayse Basar

Efficient Algorithms for Generating Provably Near-Optimal Cluster Descriptors for Explainability

Improving the explainability of the results from machine learning methods has become an important research goal. Here, we study the problem of making clusters more interpretable by extending a recent approach of [Davidson et al., NeurIPS…

Data Structures and Algorithms · Computer Science 2020-02-10 Prathyush Sambaturu , Aparna Gupta , Ian Davidson , S. S. Ravi , Anil Vullikanti , Andrew Warren

A Randomized Approach to Efficient Kernel Clustering

Kernel-based K-means clustering has gained popularity due to its simplicity and the power of its implicit non-linear representation of the data. A dominant concern is the memory requirement since memory scales as the square of the number of…

Machine Learning · Statistics 2016-12-05 Farhad Pourkamali-Anaraki , Stephen Becker

Towards Explainable Clustering: A Constrained Declarative based Approach

The domain of explainable AI is of interest in all Machine Learning fields, and it is all the more important in clustering, an unsupervised task whose result must be validated by a domain expert. We aim at finding a clustering that has high…

Artificial Intelligence · Computer Science 2024-03-28 Mathieu Guilbert , Christel Vrain , Thi-Bich-Hanh Dao

Making Tree Ensembles Interpretable

Tree ensembles, such as random forest and boosted trees, are renowned for their high prediction performance, whereas their interpretability is critically limited. In this paper, we propose a post processing method that improves the model…

Machine Learning · Statistics 2016-06-20 Satoshi Hara , Kohei Hayashi