Related papers: Hierarchical Maximum-Margin Clustering

Scalable Hierarchical Agglomerative Clustering

The applicability of agglomerative clustering, for inferring both hierarchical and flat clustering, is limited by its scalability. Existing scalable hierarchical clustering methods sacrifice quality for speed and often lead to over-merging…

Machine Learning · Computer Science 2021-10-01 Nicholas Monath , Avinava Dubey , Guru Guruganesh , Manzil Zaheer , Amr Ahmed , Andrew McCallum , Gokhan Mergen , Marc Najork , Mert Terzihan , Bryon Tjanaka , Yuan Wang , Yuchen Wu

Hierarchical Clustering with Structural Constraints

Hierarchical clustering is a popular unsupervised data analysis method. For many real-world applications, we would like to exploit prior information about the data that imposes constraints on the clustering hierarchy, and is not captured by…

Data Structures and Algorithms · Computer Science 2018-07-17 Vaggos Chatziafratis , Rad Niazadeh , Moses Charikar

Improved Hierarchical Clustering on Massive Datasets with Broad Guarantees

Hierarchical clustering is a stronger extension of one of today's most influential unsupervised learning methods: clustering. The goal of this method is to create a hierarchy of clusters, thus constructing cluster evolutionary history and…

Data Structures and Algorithms · Computer Science 2021-01-14 MohammadTaghi Hajiaghayi , Marina Knittel

Hierarchical clustering with discrete latent variable models and the integrated classification likelihood

Finding a set of nested partitions of a dataset is useful to uncover relevant structure at different scales, and is often dealt with a data-dependent methodology. In this paper, we introduce a general two-step methodology for model-based…

Computation · Statistics 2021-04-22 Etienne Côme , Nicolas Jouvin , Pierre Latouche , Charles Bouveyron

Hierarchical clustering of mixed-type data based on barycentric coding

Clustering of mixed-type datasets can be a particularly challenging task as it requires taking into account the associations between variables with different level of measurement, i.e., nominal, ordinal and/or interval. In some cases,…

Methodology · Statistics 2022-04-22 Odysseas Moschidis , Angelos Markos , Theodore Chadjipadelis

Divisive Hierarchical Clustering of Variables Identified by Singular Vectors

In this work, we introduce a novel methodology for divisive hierarchical clustering. Our divisive (``top-down'') approach is motivated by the fact that agglomerative hierarchical clustering (``bottom-up''), which is commonly used for…

Methodology · Statistics 2025-10-07 Jan O. Bauer

Matrix Completion with Hierarchical Graph Side Information

We consider a matrix completion problem that exploits social or item similarity graphs as side information. We develop a universal, parameter-free, and computationally efficient algorithm that starts with hierarchical graph clustering and…

Machine Learning · Statistics 2022-01-06 Adel Elmahdy , Junhyung Ahn , Changho Suh , Soheil Mohajer

Algorithms of maximum likelihood data clustering with applications

We address the problem of data clustering by introducing an unsupervised, parameter free approach based on maximum likelihood principle. Starting from the observation that data sets belonging to the same cluster share a common information,…

Statistical Mechanics · Physics 2009-11-07 Lorenzo Giada , Matteo Marsili

Unsupervised Deep Discriminant Analysis Based Clustering

This work presents an unsupervised deep discriminant analysis for clustering. The method is based on deep neural networks and aims to minimize the intra-cluster discrepancy and maximize the inter-cluster discrepancy in an unsupervised…

Machine Learning · Computer Science 2022-06-13 Jinyu Cai , Wenzhong Guo , Jicong Fan

Iterative Optimization and Simplification of Hierarchical Clusterings

Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and the control strategy used to search the space of clusterings. Ideally, the search…

Artificial Intelligence · Computer Science 2014-11-17 D. Fisher

Experiments on Density-Constrained Graph Clustering

Clustering a graph means identifying internally dense subgraphs which are only sparsely interconnected. Formalizations of this notion lead to measures that quantify the quality of a clustering and to algorithms that actually find…

Data Structures and Algorithms · Computer Science 2011-12-12 Robert Görke , Andrea Schumm , Dorothea Wagner

Hierarchical Clustering With Confidence

Agglomerative hierarchical clustering is one of the most widely used approaches for exploring how observations in a dataset relate to each other. However, its greedy nature makes it highly sensitive to small perturbations in the data, often…

Methodology · Statistics 2026-03-17 Di Wu , Jacob Bien , Snigdha Panigrahi

Functional data clustering via information maximization

A new method for clustering functional data is proposed via information maximization. The proposed method learns a probabilistic classifier in an unsupervised manner so that mutual information (or squared loss mutual information) between…

Applications · Statistics 2023-06-08 Xinyu Li , Jianjun Xu , Haoyang Cheng

Ultrametric Cluster Hierarchies: I Want 'em All!

Hierarchical clustering is a powerful tool for exploratory data analysis, organizing data into a tree of clusterings from which a partition can be chosen. This paper generalizes these ideas by proving that, for any reasonable hierarchy, one…

Machine Learning · Computer Science 2025-11-13 Andrew Draganov , Pascal Weber , Rasmus Skibdahl Melanchton Jørgensen , Anna Beer , Claudia Plant , Ira Assent

A parallelizable model-based approach for marginal and multivariate clustering

This paper develops a clustering method that takes advantage of the sturdiness of model-based clustering, while attempting to mitigate some of its pitfalls. First, we note that standard model-based clustering likely leads to the same number…

Machine Learning · Statistics 2022-12-09 Miguel de Carvalho , Gabriel Martos Venturini , Andrej Svetlošák

Hierarchical topological clustering

Topological methods have the potential of exploring data clouds without making assumptions on their the structure. Here we propose a hierarchical topological clustering algorithm that can be implemented with any distance choice. The…

Machine Learning · Computer Science 2026-02-10 Ana Carpio , Gema Duro

Fair Hierarchical Clustering

As machine learning has become more prevalent, researchers have begun to recognize the necessity of ensuring machine learning systems are fair. Recently, there has been an interest in defining a notion of fairness that mitigates…

Data Structures and Algorithms · Computer Science 2020-06-22 Sara Ahmadian , Alessandro Epasto , Marina Knittel , Ravi Kumar , Mohammad Mahdian , Benjamin Moseley , Philip Pham , Sergei Vassilvitskii , Yuyan Wang

HSC: A Novel Method for Clustering Hierarchies of Networked Data

Hierarchical clustering is one of the most powerful solutions to the problem of clustering, on the grounds that it performs a multi scale organization of the data. In recent years, research on hierarchical clustering methods has attracted…

Machine Learning · Computer Science 2019-08-02 Antonia Korba

Deep Transductive Semi-supervised Maximum Margin Clustering

Semi-supervised clustering is an very important topic in machine learning and computer vision. The key challenge of this problem is how to learn a metric, such that the instances sharing the same label are more likely close to each other on…

Machine Learning · Computer Science 2015-01-27 Gang Chen

Model-based clustering of multiple networks with a hierarchical algorithm

The paper tackles the problem of clustering multiple networks, directed or not, that do not share the same set of vertices, into groups of networks with similar topology. A statistical model-based approach based on a finite mixture of…

Statistics Theory · Mathematics 2023-11-07 Tabea Rebafka