Related papers: Algorithm-Agnostic Explainability for Unsupervised…

Algorithm-Agnostic Interpretations for Clustering

A clustering outcome for high-dimensional data is typically interpreted via post-processing, involving dimension reduction and subsequent visualization. This destroys the meaning of the data and obfuscates interpretations. We propose…

Machine Learning · Computer Science 2022-09-23 Christian A. Scholbeck , Henri Funk , Giuseppe Casalicchio

Counterfactual Explanations for Clustering Models

Clustering algorithms rely on complex optimisation processes that may be difficult to comprehend, especially for individuals who lack technical expertise. While many explainable artificial intelligence techniques exist for supervised…

Machine Learning · Computer Science 2024-09-20 Aurora Spagnol , Kacper Sokol , Pietro Barbiero , Marc Langheinrich , Martin Gjoreski

Explaining AutoClustering: Uncovering Meta-Feature Contribution in AutoML for Clustering

AutoClustering methods aim to automate unsupervised learning tasks, including algorithm selection (AS), hyperparameter optimization (HPO), and pipeline synthesis (PS), by often leveraging meta-learning over dataset meta-features. While…

Machine Learning · Computer Science 2026-02-23 Matheus Camilo da Silva , Leonardo Arrighi , Ana Carolina Lorena , Sylvio Barbon Junior

Towards Explainable Clustering: A Constrained Declarative based Approach

The domain of explainable AI is of interest in all Machine Learning fields, and it is all the more important in clustering, an unsupervised task whose result must be validated by a domain expert. We aim at finding a clustering that has high…

Artificial Intelligence · Computer Science 2024-03-28 Mathieu Guilbert , Christel Vrain , Thi-Bich-Hanh Dao

Semi-Supervised Constrained Clustering: An In-Depth Overview, Ranked Taxonomy and Future Research Directions

Clustering is a well-known unsupervised machine learning approach capable of automatically grouping discrete sets of instances with similar characteristics. Constrained clustering is a semi-supervised extension to this process that can be…

Machine Learning · Computer Science 2023-03-02 Germán González-Almagro , Daniel Peralta , Eli De Poorter , José-Ramón Cano , Salvador García

Active Clustering with Model-Based Uncertainty Reduction

Semi-supervised clustering seeks to augment traditional clustering methods by incorporating side information provided via human expertise in order to increase the semantic meaningfulness of the resulting clusters. However, most current…

Machine Learning · Computer Science 2014-02-17 Caiming Xiong , David Johnson , Jason J. Corso

Clustering and Unsupervised Anomaly Detection with L2 Normalized Deep Auto-Encoder Representations

Clustering is essential to many tasks in pattern recognition and computer vision. With the advent of deep learning, there is an increasing interest in learning deep unsupervised representations for clustering analysis. Many works on this…

Machine Learning · Computer Science 2018-02-02 Caglar Aytekin , Xingyang Ni , Francesco Cricri , Emre Aksu

Semi-supervised clustering methods

Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning…

Methodology · Statistics 2014-07-11 Eric Bair

Supervised Convex Clustering

Clustering has long been a popular unsupervised learning approach to identify groups of similar objects and discover patterns from unlabeled data in many applications. Yet, coming up with meaningful interpretations of the estimated clusters…

Methodology · Statistics 2020-05-26 Minjie Wang , Tianyi Yao , Genevera I. Allen

Studying Cross-cluster Modularity in Neural Networks

An approach to improve neural network interpretability is via clusterability, i.e., splitting a model into disjoint clusters that can be studied independently. We define a measure for clusterability and show that pre-trained models form…

Machine Learning · Computer Science 2025-07-28 Satvik Golechha , Maheep Chaudhary , Joan Velja , Alessandro Abate , Nandi Schoots

Explainable Clustering via Exemplars: Complexity and Efficient Approximation Algorithms

Explainable AI (XAI) is an important developing area but remains relatively understudied for clustering. We propose an explainable-by-design clustering approach that not only finds clusters but also exemplars to explain each cluster. The…

Artificial Intelligence · Computer Science 2022-09-21 Ian Davidson , Michael Livanos , Antoine Gourru , Peter Walker , Julien Velcin , S. S. Ravi

EXPLAIN-IT: Towards Explainable AI for Unsupervised Network Traffic Analysis

The application of unsupervised learning approaches, and in particular of clustering techniques, represents a powerful exploration means for the analysis of network measurements. Discovering underlying data characteristics, grouping similar…

Artificial Intelligence · Computer Science 2020-03-11 Andrea Morichetta , Pedro Casas , Marco Mellia

Explaining Digital Pathology Models via Clustering Activations

We present a clustering-based explainability technique for digital pathology models based on convolutional neural networks. Unlike commonly used methods based on saliency maps, such as occlusion, GradCAM, or relevance propagation, which…

Computer Vision and Pattern Recognition · Computer Science 2026-05-28 Adam Bajger , Jan Obdržálek , Vojtěch Kůr , Rudolf Nenutil , Petr Holub , Vít Musil , Tomáš Brázdil

PCM and APCM Revisited: An Uncertainty Perspective

In this paper, we take a new look at the possibilistic c-means (PCM) and adaptive PCM (APCM) clustering algorithms from the perspective of uncertainty. This new perspective offers us insights into the clustering process, and also provides…

Computer Vision and Pattern Recognition · Computer Science 2016-10-28 Peixin Hou , Hao Deng , Jiguang Yue , Shuguang Liu

Clustering Data with Nonignorable Missingness using Semi-Parametric Mixture Models

We are concerned in clustering continuous data sets subject to non-ignorable missingness. We perform clustering with a specific semi-parametric mixture, under the assumption of conditional independence given the component. The mixture model…

Methodology · Statistics 2021-07-20 Marie Du Roy de Chaumaray , Matthieu Marbac

Clustering Uncertain Data via Representative Possible Worlds with Consistency Learning

Clustering uncertain data is an essential task in data mining for the internet of things. Possible world based algorithms seem promising for clustering uncertain data. However, there are two issues in existing possible world based…

Machine Learning · Computer Science 2019-09-30 Han Liu , Xianchao Zhang , Xiaotong Zhang , Qimai Li , Xiao-Ming Wu

XClusters: Explainability-first Clustering

We study the problem of explainability-first clustering where explainability becomes a first-class citizen for clustering. Previous clustering approaches use decision trees for explanation, but only after the clustering is completed. In…

Machine Learning · Computer Science 2022-12-13 Hyunseung Hwang , Steven Euijong Whang

Inference for Cluster Randomized Experiments with Non-ignorable Cluster Sizes

This paper considers the problem of inference in cluster randomized experiments when cluster sizes are non-ignorable. Here, by a cluster randomized experiment, we mean one in which treatment is assigned at the cluster level. By…

Econometrics · Economics 2024-04-11 Federico Bugni , Ivan Canay , Azeem Shaikh , Max Tabord-Meehan

Interpretable Clustering via Optimal Trees

State-of-the-art clustering algorithms use heuristics to partition the feature space and provide little insight into the rationale for cluster membership, limiting their interpretability. In healthcare applications, the latter poses a…

Machine Learning · Statistics 2018-12-04 Dimitris Bertsimas , Agni Orfanoudaki , Holly Wiberg

Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis

Cognitive diagnosis models have been popularly used in fields such as education, psychology, and social sciences. While parametric likelihood estimation is a prevailing method for fitting cognitive diagnosis models, nonparametric…

Statistics Theory · Mathematics 2025-10-01 Chengyu Cui , Yanlong Liu , Gongjun Xu