Related papers: Layers and stability

A Hybrid Approach To Hierarchical Density-based Cluster Selection

HDBSCAN is a density-based clustering algorithm that constructs a cluster hierarchy tree and then uses a specific stability measure to extract flat clusters from the tree. We show how the application of an additional threshold value can…

Databases · Computer Science 2021-01-22 Claudia Malzer , Marcus Baum

Accelerated Hierarchical Density Clustering

We present an accelerated algorithm for hierarchical density based clustering. Our new algorithm improves upon HDBSCAN*, which itself provided a significant qualitative improvement over the popular DBSCAN algorithm. The accelerated HDBSCAN*…

Machine Learning · Statistics 2018-12-20 Leland McInnes , John Healy

An Algorithmic Introduction to Clustering

This paper tries to present a more unified view of clustering, by identifying the relationships between five different clustering algorithms. Some of the results are not new, but they are presented in a cleaner, simpler and more concise…

Machine Learning · Computer Science 2020-06-11 Bernardo A. Gonzalez-Torres

Hierarchical Single-Linkage Clustering for Community Detection with Overlaps and Outliers

Most community detection approaches make very strong assumptions about communities in the data, such as every vertex must belong to exactly one community (the communities form a partition). For vector data, Hierarchical Density Based…

Social and Information Networks · Computer Science 2025-09-03 Ryan DeWolfe

Hierarchical Clustering in Astronomy

Hierarchical clustering is a common algorithm in data analysis. It is unique among many clustering algorithms in that it draws dendrograms based on the distance of data under a certain metric, and group them. It is widely used in all areas…

Instrumentation and Methods for Astrophysics · Physics 2022-11-14 Heng Yu , Xiaolan Hou

Stability for layer points

In the first half this paper, we generalize the theory of layer points for Lesnick- (or degree-Rips-) complexes to the more general context of $\vec{v}$-hierarchical clusterings. Layer points provide a compressed description of a…

Statistics Theory · Mathematics 2021-09-07 Katharine L. M. Adamyk

Time Series Clustering Using DBSCAN

Economic policy and research rely on the correct evaluation of the billions of high-frequency data points that we collect every day. Consistent clustering algorithms, like DBSCAN, allow us to make sense of the data in a useful way. However,…

Statistics Theory · Mathematics 2024-03-25 Nicholas Waltz

Persistent Multiscale Density-based Clustering

Clustering is a cornerstone of modern data analysis. Detecting clusters in exploratory data analyses (EDA) requires algorithms that make few assumptions about the data. Density-based clustering algorithms are particularly well-suited for…

Machine Learning · Computer Science 2026-02-03 Daniël Bot , Leland McInnes , Jan Aerts

FLASC: A Flare-Sensitive Clustering Algorithm

Clustering algorithms are often used to find subpopulations in exploratory data analysis workflows. Not only the clusters themselves, but also their shape can represent meaningful subpopulations. In this paper, we present FLASC, an…

Machine Learning · Computer Science 2025-04-23 D. M. Bot , J. Peeters , J. Liesenborgs , J. Aerts

A Stable Cardinality Distance for Topological Classification

This work incorporates topological features via persistence diagrams to classify point cloud data arising from materials science. Persistence diagrams are multisets summarizing the connectedness and holes of given data. A new distance on…

Machine Learning · Statistics 2019-11-11 Vasileios Maroulas , Cassie Putman Micucci , Adam Spannaus

Hierarchy Structure of Graphs and Weighted Condensations

By natural way the hierarchy structure is introduced on directed graphs with weighted adjacencies. Embedded system of algebras of subsets of the set of vertices of such digraph and it's consolidations, which vertices are the elementary sets…

Combinatorics · Mathematics 2007-05-23 V. A. Buslov

Clustering Stability: An Overview

A popular method for selecting the number of clusters is based on stability arguments: one chooses the number of clusters such that the corresponding clustering results are "most stable". In recent years, a series of papers has analyzed the…

Machine Learning · Statistics 2010-07-08 Ulrike von Luxburg

Efficient Computation of Multiple Density-Based Clustering Hierarchies

HDBSCAN*, a state-of-the-art density-based hierarchical clustering method, produces a hierarchical organization of clusters in a dataset w.r.t. a parameter mpts. While the performance of HDBSCAN* is robust w.r.t. mpts in the sense that a…

Databases · Computer Science 2018-06-11 Antonio Cavalcante Araujo Neto , Joerg Sander , Ricardo J. G. B. Campello , Mario A. Nascimento

ADBSCAN: Adaptive Density-Based Spatial Clustering of Applications with Noise for Identifying Clusters with Varying Densities

Density-based spatial clustering of applications with noise (DBSCAN) is a data clustering algorithm which has the high-performance rate for dataset where clusters have the constant density of data points. One of the significant attributes…

Machine Learning · Computer Science 2019-02-06 Mohammad Mahmudur Rahman Khan , Md. Abu Bakr Siddique , Rezoana Bente Arif , Mahjabin Rahman Oishe

LINSCAN -- A Linearity Based Clustering Algorithm

DBSCAN and OPTICS are powerful algorithms for identifying clusters of points in domains where few assumptions can be made about the structure of the data. In this paper, we leverage these strengths and introduce a new algorithm, LINSCAN,…

Machine Learning · Computer Science 2026-04-15 Andrew Dennehy , Xiaoyu Zou , Shabnam J. Semnani , Yuri Fialko , Alexander Cloninger

Cluster Strong Lensing with Hierarchical Inference

Lensing by galaxy clusters is a versatile probe of cosmology and extragalactic astrophysics, but the accuracy of some of its predictions is limited by the simplified models adopted to reduce the (otherwise untractable) number of degrees of…

Cosmology and Nongalactic Astrophysics · Physics 2021-04-28 Pietro Bergamini , Adriano Agnello , Gabriel Bartosch Caminha

Detecting and analysing the topology of the cosmic web with spatial clustering algorithms I: Methods

In this paper we explore the use of spatial clustering algorithms as a new computational approach for modeling the cosmic web. We demonstrate that such algorithms are efficient in terms of computing time needed. We explore three distinct…

Instrumentation and Methods for Astrophysics · Physics 2022-09-14 Dimitrios Kelesis , Spyros Basilakos , Vicky Papadopoulou Lesta , Dimitris Fotakis , Andreas Efstathiou

ExDBSCAN: Explaining DBSCAN with Counterfactual Reasoning -- Additional Material

Clustering is an unsupervised technique for grouping data points by similarity. While explainability methods exist for supervised machine learning, they are not directly applicable to clustering, making it challenging to understand cluster…

Machine Learning · Computer Science 2026-05-29 Pernille Matthews , Lena Krieger , Tommaso Amico , Artur Zimek , Thomas Seidl , Ira Assent

Complexity hierarchies in Euclidean stars

We establish a hierarchy of Euclidean stars according to their degree of complexity, as measured by the complexity factor and the complexity of the pattern of evolution. We consider both, nondissipative and dissipative systems. Solutions…

General Relativity and Quantum Cosmology · Physics 2025-10-01 L. Herrera , A. Di Prisco , J. Ospino

Geometric reconstructions of density based clusterings

DBSCAN* and HDBSCAN* are well established density based clustering algorithms. However, obtaining the clusters of very large datasets is infeasible, limiting their use in real world applications. By exploiting the geometry of Euclidean…

Machine Learning · Computer Science 2022-03-16 A. L. Garcia-Pulido , K. P. Samardzhiev