English
Related papers

Related papers: PANDORA: A Parallel Dendrogram Construction Algori…

200 papers

Computing a Single-Linkage Dendrogram (SLD) is a key step in the classic single-linkage hierarchical clustering algorithm. Given an input edge-weighted tree $T$, the SLD of $T$ is a binary dendrogram that summarizes the $n-1$ clusterings…

Data Structures and Algorithms · Computer Science 2024-05-14 Laxman Dhulipala , Xiaojun Dong , Kishen N Gowda , Yan Gu

This paper presents new parallel algorithms for generating Euclidean minimum spanning trees and spatial clustering hierarchies (known as HDBSCAN$^*$). Our approach is based on generating a well-separated pair decomposition followed by using…

Data Structures and Algorithms · Computer Science 2021-04-05 Yiqiu Wang , Shangdi Yu , Yan Gu , Julian Shun

Single-linkage clustering is a popular form of hierarchical agglomerative clustering (HAC) where the distance between two clusters is defined as the minimum distance between any pair of points across the two clusters. In single-linkage HAC,…

Data Structures and Algorithms · Computer Science 2025-06-24 Quinten De Man , Laxman Dhulipala , Kishen N Gowda

We propose a parallel graph-based data clustering algorithm using CUDA GPU, based on exact clustering of the minimum spanning tree in terms of a minimum isoperimetric criteria. We also provide a comparative performance analysis of our…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-02-17 Ramin Javadi , Saleh Ashkboos

This paper studies the hierarchical clustering problem, where the goal is to produce a dendrogram that represents clusters at varying scales of a data set. We propose the ParChain framework for designing parallel hierarchical agglomerative…

Data Structures and Algorithms · Computer Science 2022-02-15 Shangdi Yu , Yiqiu Wang , Yan Gu , Laxman Dhulipala , Julian Shun

We present a new way to summarize and select mixture models via the hierarchical clustering tree (dendrogram) constructed from an overfitted latent mixing measure. Our proposed method bridges agglomerative hierarchical clustering and…

Methodology · Statistics 2024-03-11 Dat Do , Linh Do , Scott A. McKinley , Jonathan Terhorst , XuanLong Nguyen

"mdendro" is an R package that provides a comprehensive collection of linkage methods for agglomerative hierarchical clustering on a matrix of proximity data (distances or similarities), returning a multifurcated dendrogram or…

Information Retrieval · Computer Science 2025-09-16 Alberto Fernández , Sergio Gómez

We present an algorithm that allows for building left-balanced and complete k-d trees over k-dimensional points in a trivially parallel and GPU friendly way. Our algorithm requires exactly one int per data point as temporary storage, and…

Data Structures and Algorithms · Computer Science 2023-04-06 Ingo Wald

Dynamic programming (DP) algorithms, such as All-Pairs Shortest Path (APSP) and genomic sequence alignment, are fundamental to many scientific domains but are severely bottlenecked by data movement on conventional architectures. While…

Hardware Architecture · Computer Science 2026-03-02 Tsung-Han Lu , Weihong Xu , Tajana Rosing

As deep neural networks (DNNs) become deeper, the training time increases. In this perspective, multi-GPU parallel computing has become a key tool in accelerating the training of DNNs. In this paper, we introduce a novel methodology to…

Numerical Analysis · Mathematics 2024-07-08 Chang-Ock Lee , Youngkyu Lee , Jongho Park

We introduce a decomposition method for the distributed calculation of exact Euclidean Minimum Spanning Trees in high dimensions (where sub-quadratic algorithms are not effective), or more generalized geometric-minimum spanning trees of…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-06-05 Richard Lettich

Minimum Spanning Tree (MST) is an important graph algorithm that has wide ranging applications in the areas of computer networks, VLSI routing, wireless communications among others. Today virtually every computer is built out of multi-core…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-15 Suryanarayana Murthy Durbhakula

In this paper we present and evaluate a parallel algorithm for solving a minimum spanning tree (MST) problem for supercomputers with distributed memory. The algorithm relies on the relaxation of the message processing order requirement for…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-10-18 Artem Mazeev , Alexander Semenov , Alexey Simonov

We compare different methods for sampling from discrete probability distributions and introduce a new algorithm which is especially efficient on massively parallel processors, such as GPUs. The scheme preserves the distribution properties…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-09-02 Nikolaus Binder , Alexander Keller

In this paper, we present a novel massively parallel algorithm for accelerating the decision tree building procedure on GPUs (Graphics Processing Units), which is a crucial step in Gradient Boosted Decision Tree (GBDT) and random forests…

Machine Learning · Statistics 2017-06-27 Huan Zhang , Si Si , Cho-Jui Hsieh

Dynamic Parallelism (DP) is a runtime feature of the GPU programming model that allows GPU threads to execute additional GPU kernels, recursively. Apart from making the programming of parallel hierarchical patterns easier, DP can also…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-06-07 Felipe A. Quezada , Cristóbal A. Navarro , Miguel Romero , Cristhian Aguilera

DBSCAN is a well-known density-based clustering algorithm to discover arbitrary shape clusters. While conceptually simple in serial, the algorithm is challenging to efficiently parallelize on manycore GPU architectures. Common pitfalls,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-30 Andrey Prokopenko , Damien Lebrun-Grandie , Daniel Arndt

The minimum spanning tree clustering algorithm is capable of detecting clusters with irregular boundaries. In this paper we propose two minimum spanning trees based clustering algorithm. The first algorithm produces k clusters with center…

Other Computer Science · Computer Science 2010-05-26 S. John Peter , S. P. Victor

Our formulation reveals that the reduction across the sequence axis can be efficiently computed in parallel through a tree reduction. Our algorithm, called Tree Attention, for parallelizing exact attention computation across multiple GPUs…

Machine Learning · Computer Science 2025-02-11 Vasudev Shyam , Jonathan Pilault , Emily Shepperd , Quentin Anthony , Beren Millidge

Among the many possible approaches for the parallelization of self-organizing networks, and in particular of growing self-organizing networks, perhaps the most common one is producing an optimized, parallel implementation of the standard…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-03-31 Giacomo Parigi , Angelo Stramieri , Danilo Pau , Marco Piastra
‹ Prev 1 2 3 10 Next ›