Related papers: Directional density-based clustering

Hyperoctant Search Clustering: A Method for Clustering Data in High-Dimensional Hyperspheres

Clustering of high-dimensional data sets is a growing need in artificial intelligence, machine learning and pattern recognition. In this paper, we propose a new clustering method based on a combinatorial-topological approach applied to…

Machine Learning · Computer Science 2025-03-12 Mauricio Toledo-Acosta , Luis Ángel Ramos-García , Jorge Hermosillo-Valadez

Depth-based clustering analysis of directional data

A new depth-based clustering procedure for directional data is proposed. Such method is fully non-parametric and has the advantages to be flexible and applicable even in high dimensions when a suitable notion of depth is adopted. The…

Methodology · Statistics 2022-06-22 Giuseppe Pandolfo , Antonio D'ambrosio

Poisson Kernel-Based Clustering on the Sphere: Convergence Properties, Identifiability, and a Method of Sampling

Many applications of interest involve data that can be analyzed as unit vectors on a d-dimensional sphere. Specific examples include text mining, in particular clustering of documents, biology, astronomy and medicine among others. Previous…

Methodology · Statistics 2018-03-14 Mojgan Golzy , Marianthi Markatou

Kernel Smoothing, Mean Shift, and Their Learning Theory with Directional Data

Directional data consist of observations distributed on a (hyper)sphere, and appear in many applied fields, such as astronomy, ecology, and environmental science. This paper studies both statistical and computational problems of kernel…

Machine Learning · Statistics 2021-10-18 Yikun Zhang , Yen-Chi Chen

Density-Based Clustering with Kernel Diffusion

Finding a suitable density function is essential for density-based clustering algorithms such as DBSCAN and DPC. A naive density corresponding to the indicator function of a unit $d$-dimensional Euclidean ball is commonly used in these…

Machine Learning · Computer Science 2021-10-15 Chao Zheng , Yingjie Chen , Chong Chen , Jianqiang Huang , Xian-Sheng Hua

A Survey of Some Density Based Clustering Techniques

Density Based Clustering are a type of Clustering methods using in data mining for extracting previously unknown patterns from data sets. There are a number of density based clustering methods such as DBSCAN, OPTICS, DENCLUE, VDBSCAN,…

Machine Learning · Computer Science 2023-07-25 Rupanka Bhuyan , Samarjeet Borah

U-statistical inference for hierarchical clustering

Clustering methods are a valuable tool for the identification of patterns in high dimensional data with applications in many scientific problems. However, quantifying uncertainty in clustering is a challenging problem, particularly when…

Methodology · Statistics 2018-06-01 Marcio Valk , Gabriela Bettella Cybis

A Probabilistic $\ell_1$ Method for Clustering High Dimensional Data

In general, the clustering problem is NP-hard, and global optimality cannot be established for non-trivial instances. For high-dimensional data, distance-based methods for clustering or classification face an additional difficulty, the…

Statistics Theory · Mathematics 2016-04-26 Tsvetan Asamov , Adi Ben-Israel

Skeleton Clustering: Dimension-Free Density-based Clustering

We introduce a density-based clustering method called skeleton clustering that can detect clusters in multivariate and even high-dimensional data with irregular shapes. To bypass the curse of dimensionality, we propose surrogate density…

Machine Learning · Statistics 2023-03-09 Zeyu Wei , Yen-Chi Chen

Model--based clustering for spherical and hyper--spherical data using elliptically symmetric distributions

Model--based clustering for directional data data has attracted a lot of interest, but most methods utilize rotationally symmetric distributions. This paper suggests the use of elliptically symmetric distributions, namely the elliptically…

Methodology · Statistics 2026-05-28 Theodoros Perdikis , Nader Alharbi , Michail Tsagris

Pointwise density estimation on metric spaces and applications in seismology

We are studying the problem of estimating density in a wide range of metric spaces, including the Euclidean space, the sphere, the ball, and various Riemannian manifolds. Our framework involves a metric space with a doubling measure and a…

Statistics Theory · Mathematics 2023-04-04 Galatia Cleanthous , Athanasios G. Georgiadis , Philip A. White

Identifying the number of clusters for K-Means: A hypersphere density based approach

Application of K-Means algorithm is restricted by the fact that the number of clusters should be known beforehand. Previously suggested methods to solve this problem are either ad hoc or require parametric assumptions and complicated…

Machine Learning · Computer Science 2019-12-05 Sukavanan Nanjundan , Shreeviknesh Sankaran , C. R. Arjun , G. Paavai Anand

A new measure for assessment of clustering based on kernel density estimation

A new clustering accuracy measure is proposed to determine the unknown number of clusters and to assess the quality of clustering of a data set given in any dimensional space. Our validity index applies the classical nonparametric…

Methodology · Statistics 2022-02-15 Soumita Modak

Hyperspectral Image Clustering with Spatially-Regularized Ultrametrics

We propose a method for the unsupervised clustering of hyperspectral images based on spatially regularized spectral clustering with ultrametric path distances. The proposed method efficiently combines data density and geometry to…

Computer Vision and Pattern Recognition · Computer Science 2020-04-13 Shukun Zhang , James M. Murphy

Unimodal Strategies in Density-Based Clustering

Density-based clustering methods often surpass centroid-based counterparts, when addressing data with noise or arbitrary data distributions common in real-world problems. In this study, we reveal a key property intrinsic to density-based…

Machine Learning · Computer Science 2025-06-30 Oron Nir , Jay Tenenbaum , Ariel Shamir

Density based Spatial Clustering of Lines via Probabilistic Generation of Neighbourhood

Density based spatial clustering of points in $\mathbb{R}^n$ has a myriad of applications in a variety of industries. We generalise this problem to the density based clustering of lines in high-dimensional spaces, keeping in mind there…

Machine Learning · Computer Science 2024-10-04 Akanksha Das , Malay Bhattacharyya

Rate-optimal and computationally efficient nonparametric estimation on the circle and the sphere

We investigate the problem of density estimation on the unit circle and the unit sphere from a computational perspective. Our primary goal is to develop new density estimators that are both rate-optimal and computationally efficient for…

Statistics Theory · Mathematics 2026-05-08 Athanasios G. Georgiadis , Andrew P. Percival

Modal clustering of matrix-variate data

The nonparametric formulation of density-based clustering, known as modal clustering, draws a correspondence between groups and the attraction domains of the modes of the density function underlying the data. Its probabilistic foundation…

Methodology · Statistics 2020-10-27 Federico Ferraccioli , Giovanna Menardi

On a Distributed Approach for Density-based Clustering

Efficient extraction of useful knowledge from these data is still a challenge, mainly when the data is distributed, heterogeneous and of different quality depending on its corresponding local infrastructure. To reduce the overhead cost,…

Databases · Computer Science 2017-04-17 Nhien-An Le-Khac , M-Tahar Kechadi

Density-based clustering of social networks

The idea underlying the modal formulation of density-based clustering is to associate groups with the regions around the modes of the probability density function underlying the data. This correspondence between clusters and dense regions…

Social and Information Networks · Computer Science 2021-01-22 Giovanna Menardi , Domenico De Stefano