English
Related papers

Related papers: Theoretically-Efficient and Practical Parallel DBS…

200 papers

DBSCAN is a popular density-based clustering algorithm that has many different applications in practice. However, the running time of DBSCAN in high-dimensional space or general metric space ({\em e.g.,} clustering a set of texts by using…

Data Structures and Algorithms · Computer Science 2025-01-07 Guanlin Mo , Shihong Song , Hu Ding

DBSCAN is a fundamental spatial clustering algorithm with numerous practical applications. However, a bottleneck of the algorithm is in the worst case, the run time complexity is $O(n^2)$. To address this limitation, we propose a new…

Databases · Computer Science 2022-11-08 Xiaogang Huang , Tiefeng Ma , Conan Liu , Shuangzhe Liu

DBSCAN is a very classic algorithm for data clus- tering, which is widely used in many fields. However, with the data scale growing much more bigger than before, the traditional serial algorithm can not meet the performance requirement.…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-06-09 Bingchen Wang , Chenglong Zhang , Lei Song , Lianhe Zhao , Yu Dou , Zihao Yu

Clustering is a fundamental task in machine learning. One of the most successful and broadly used algorithms is DBSCAN, a density-based clustering algorithm. DBSCAN requires $\epsilon$-nearest neighbor graphs of the input dataset, which are…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-09-12 Youguang Chen , William Ruys , George Biros

This paper presents new parallel algorithms for generating Euclidean minimum spanning trees and spatial clustering hierarchies (known as HDBSCAN$^*$). Our approach is based on generating a well-separated pair decomposition followed by using…

Data Structures and Algorithms · Computer Science 2021-04-05 Yiqiu Wang , Shangdi Yu , Yan Gu , Julian Shun

DBSCAN is a classical density-based clustering procedure with tremendous practical relevance. However, DBSCAN implicitly needs to compute the empirical density for each sample point, leading to a quadratic worst-case time complexity, which…

Machine Learning · Computer Science 2019-05-21 Jennifer Jang , Heinrich Jiang

The density based clustering method {\em Density-Based Spatial Clustering of Applications with Noise (DBSCAN)} is a popular method for outlier recognition and has received tremendous attention from many different areas. A major issue of the…

Computational Geometry · Computer Science 2020-02-28 Hu Ding , Fan Yang

Clustering multidimensional points is a fundamental data mining task, with applications in many fields, such as astronomy, neuroscience, bioinformatics, and computer vision. The goal of clustering algorithms is to group similar objects…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-22 Yihao Huang , Shangdi Yu , Julian Shun

We present a new algorithm for the widely used density-based clustering method DBscan. Our algorithm computes the DBscan-clustering in $O(n\log n)$ time in $\mathbb{R}^2$, irrespective of the scale parameter $\varepsilon$ (and assuming the…

Computational Geometry · Computer Science 2017-03-01 Mark de Berg , Ade Gunawan , Marcel Roeloffzen

DBSCAN is one of the most important non-parametric unsupervised data analysis tools. By applying DBSCAN to a dataset, two key analytical results can be obtained: (1) clustering data points based on density distribution and (2) identifying…

Computer Vision and Pattern Recognition · Computer Science 2024-12-04 Yongyu Wang

DBSCAN is a well-known density-based clustering algorithm to discover arbitrary shape clusters. While conceptually simple in serial, the algorithm is challenging to efficiently parallelize on manycore GPU architectures. Common pitfalls,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-30 Andrey Prokopenko , Damien Lebrun-Grandie , Daniel Arndt

DBSCAN is a popular density-based clustering algorithm. It computes the $\epsilon$-neighborhood graph of a dataset and uses the connected components of the high-degree nodes to decide the clusters. However, the full neighborhood graph may…

Machine Learning · Computer Science 2020-10-23 Heinrich Jiang , Jennifer Jang , Jakub Łącki

DBSCAN is a typically used clustering algorithm due to its clustering ability for arbitrarily-shaped clusters and its robustness to outliers. Generally, the complexity of DBSCAN is O(n^2) in the worst case, and it practically becomes more…

Databases · Computer Science 2018-01-23 Thapana Boonchoo , Xiang Ao , Qing He

Density-based clustering has found numerous applications across various domains. The Density-Based Spatial Clustering of Applications with Noise (DBSCAN) algorithm is capable of finding clusters of varied shapes that are not linearly…

Databases · Computer Science 2019-12-03 Vinayak Mathur , Jinesh Mehta , Sanjay Singh

We propose a fast and dynamic algorithm for Density-Based Spatial Clustering of Applications with Noise (DBSCAN) that efficiently supports online updates. Traditional DBSCAN algorithms, designed for batch processing, become computationally…

Machine Learning · Computer Science 2025-03-12 Seiyun Shin , Ilan Shomorony , Peter Macgregor

Clustering multi-dimensional points is a fundamental task in many fields, and density-based clustering supports many applications as it can discover clusters of arbitrary shapes. This paper addresses the problem of Density-Peaks Clustering…

Databases · Computer Science 2022-12-01 Daichi Amagata , Takahiro Hara

DBSCAN is an algorithm that performs clustering in the presence of noise. In this paper, we provide two constructions that allow DBSCAN to be implemented neuromorphically, using spiking neural networks. The first construction is termed…

Neural and Evolutionary Computing · Computer Science 2024-09-24 Charles P. Rizzo , James S. Plank

This paper studies the nucleus decomposition problem, which has been shown to be useful in finding dense substructures in graphs. We present a novel parallel algorithm that is efficient both in theory and in practice. Our algorithm achieves…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-12 Jessica Shi , Laxman Dhulipala , Julian Shun

SCAN (Structural Clustering Algorithm for Networks) is a well-studied, widely used graph clustering algorithm. For large graphs, however, sequential SCAN variants are prohibitively slow, and parallel SCAN variants do not effectively share…

Databases · Computer Science 2021-04-01 Tom Tseng , Laxman Dhulipala , Julian Shun

We present PS-DBSCAN, a communication efficient parallel DBSCAN algorithm that combines the disjoint-set data structure and Parameter Server framework in Platform of AI (PAI). Since data points within the same cluster may be distributed…

Databases · Computer Science 2017-11-06 Xu Hu , Jun Huang , Minghui Qiu , Cen Chen , Wei Chu
‹ Prev 1 2 3 10 Next ›