English
Related papers

Related papers: Implementation of a Parallel Tree Method on a GPU

200 papers

The kd-tree is a fundamental tool in computer science. Among others, an application of the kd-tree search (oct-tree method) to fast evaluation of particle interactions and neighbor search is highly important since computational complexity…

Instrumentation and Methods for Astrophysics · Physics 2009-09-04 N. Nakasato

We present an algorithm that allows for building left-balanced and complete k-d trees over k-dimensional points in a trivially parallel and GPU friendly way. Our algorithm requires exactly one int per data point as temporary storage, and…

Data Structures and Algorithms · Computer Science 2023-04-06 Ingo Wald

We present a new very fast tree-code which runs on massively parallel Graphical Processing Units (GPU) with NVIDIA CUDA architecture. The tree-construction and calculation of multipole moments is carried out on the host CPU, while the force…

Instrumentation and Methods for Astrophysics · Physics 2010-10-15 Evghenii Gaburov , Jeroen Bédorf , Simon Portegies Zwart

We propose a parallel graph-based data clustering algorithm using CUDA GPU, based on exact clustering of the minimum spanning tree in terms of a minimum isoperimetric criteria. We also provide a comparative performance analysis of our…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-02-17 Ramin Javadi , Saleh Ashkboos

We present parallel algorithms for constructing and traversing sparse octrees on graphics processing units (GPUs). The algorithms are based on parallel-scan and sort methods. To test the performance and feasibility, we implemented them in…

Instrumentation and Methods for Astrophysics · Physics 2012-04-11 Jeroen Bédorf , Evghenii Gaburov , Simon Portegies Zwart

We present a parallel algorithm for computing the treewidth of a graph on a GPU. We implement this algorithm in OpenCL, and experimentally evaluate its performance. Our algorithm is based on an $O^*(2^{n})$-time algorithm that explores the…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-09-29 Tom C. van der Zanden , Hans L. Bodlaender

We present a GPU solution for exact maximal clique enumeration (MCE) that performs a search tree traversal following the Bron-Kerbosch algorithm. Prior works on parallelizing MCE on GPUs perform a breadth-first traversal of the tree, which…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-25 Mohammad Almasri , Yen-Hsiang Chang , Izzat El Hajj , Rakesh Nagi , Jinjun Xiong , Wen-mei Hwu

Finding small vertex covers in a graph has applications in numerous domains. Two common formulations of the problem include: Minimum Vertex Cover, which finds the smallest vertex cover in a graph, and Parameterized Vertex Cover, which finds…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-04-25 Peter Yamout , Karim Barada , Adnan Jaljuli , Amer E. Mouawad , Izzat El Hajj

Processing moving object trajectories arises in many application domains and has been addressed by practitioners in the spatiotemporal database and Geographical Information System communities. In this work, we focus on a trajectory…

Databases · Computer Science 2014-09-16 Michael G. Gowanlock , Henri Casanova

The $k$d-tree is one of the most widely used data structures to manage multi-dimensional data. Due to the ever-growing data volume, it is imperative to consider parallelism in $k$d-trees. However, we observed challenges in existing parallel…

Data Structures and Algorithms · Computer Science 2025-01-08 Ziyang Men , Zheqi Shen , Yan Gu , Yihan Sun

We propose a hybrid tree algorithm for reducing calculation and communication cost of collision-less N-body simulations. The concept of our algorithm is that we split interaction force into two parts: hard-force from neighbor particles and…

Instrumentation and Methods for Astrophysics · Physics 2014-06-25 Tsuyoshi Watanabe , Naohito Nakasato

I present a new GPU implementation of the wavelet tree data structure. It includes binary rank and select support structures that provide at least 10 times higher throughput of binary rank and select queries than the best publicly available…

Data Structures and Algorithms · Computer Science 2025-05-07 Marco Franzreb , Martin Burtscher , Stephan Rudolph

High-performance implementations of $k$-Nearest Neighbor Search ($k$NN) in low dimensions use tree-based data structures. Tree algorithms are hard to parallelize on GPUs due to their irregularity. However, newer Nvidia GPUs offer hardware…

Graphics · Computer Science 2024-04-23 Durga Mandarapu , Vani Nagarajan , Artem Pelenitsyn , Milind Kulkarni

Computing $k$-Nearest Neighbors (KNN) is one of the core kernels used in many machine learning, data mining and scientific computing applications. Although kd-tree based $O(\log n)$ algorithms have been proposed for computing KNN, due to…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-11-17 Md. Mostofa Ali Patwary , Nadathur Rajagopalan Satish , Narayanan Sundaram , Jialin Liu , Peter Sadowski , Evan Racah , Suren Byna , Craig Tull , Wahid Bhimji , Prabhat , Pradeep Dubey

We examine the problem of optimizing classification tree evaluation for on-line and real-time applications by using GPUs. Looking at trees with continuous attributes often used in image segmentation, we first put the existing algorithms for…

Distributed, Parallel, and Cluster Computing · Computer Science 2011-11-08 Jason Spencer

The rapid advancement of GPU technology has unlocked powerful parallel processing capabilities, creating new opportunities to enhance classic search algorithms. This hardware has been exploited in best-first search algorithms with neural…

Artificial Intelligence · Computer Science 2025-11-18 Ehsan Futuhi , Nathan R. Sturtevant

Similarity search, the task of identifying objects most similar to a given query object under a specific metric, has gathered significant attention due to its practical applications. However, the absence of coordinate information to…

Databases · Computer Science 2024-05-14 Yifan Zhu , Ruiyao Ma , Baihua Zheng , Xiangyu Ke , Lu Chen , Yunjun Gao

A buffer k-d tree is a k-d tree variant for massively-parallel nearest neighbor search. While providing valuable speed-ups on modern many-core devices in case both a large number of reference and query points are given, buffer k-d trees are…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-12-10 Fabian Gieseke , Cosmin Eugen Oancea , Ashish Mahabal , Christian Igel , Tom Heskes

An optimal data partitioning in parallel & distributed implementation of clustering algorithms is a necessary computation as it ensures independent task completion, fair distribution, less number of affected points and better & faster…

Artificial Intelligence · Computer Science 2016-09-21 Saraswati Mishra , Avnish Chandra Suman

Algorithms based on spatial tree traversal are widely regarded as among the most efficient and flexible approaches for many problems in CPU-based high-performance computing (HPC). However, directly transferring these algorithms to GPU…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-08 Jens Stücker , Oliver Hahn , Lukas Winkler , Adrian Gutierrez Adame , Thomas Flöss
‹ Prev 1 2 3 10 Next ›