English
Related papers

Related papers: Weighted Random Sampling on GPUs

200 papers

Data structures for efficient sampling from a set of weighted items are an important building block of many applications. However, few parallel solutions are known. We close many of these gaps both for shared-memory and distributed-memory…

Data Structures and Algorithms · Computer Science 2021-07-20 Lorenz Hübschle-Schneider , Peter Sanders

Principal component analysis (PCA) is a statistical technique commonly used in multivariate data analysis. However, PCA can be difficult to interpret and explain since the principal components (PCs) are linear combinations of the original…

Mathematical Software · Computer Science 2013-12-24 W. Liu , H. Zhang , D. Tao , Y. Wang , K. Lu

In this work we propose a highly optimized version of a simulated annealing (SA) algorithm adapted to the more recently developed Graphic Processor Units (GPUs). The programming has been carried out with CUDA toolkit, specially designed for…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-08-02 A. M. Ferreiro , J. A. García , J. G. López-Salas , C. Vázquez

Hash tables are used in a plethora of applications, including database operations, DNA sequencing, string searching, and many more. As such, there are many parallelized hash tables targeting multicore, distributed, and accelerator-based…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-04-05 Alok Tripathy , Oded Green

Correlation Power Analysis (CPA) is a type of power analysis based side channel attack that can be used to derive the secret key of encryption algorithms including DES (Data Encryption Standard) and AES (Advanced Encryption Standard). A…

Performance · Computer Science 2014-12-25 Hasindu Gamaarachchi , Roshan Ragel , Darshana Jayasinghe

Selected inversion is essential for applications such as Bayesian inference, electronic structure calculations, and inverse covariance estimation, where computing only specific elements of large sparse matrix inverses significantly reduces…

Performance · Computer Science 2025-09-03 Esmail Abdul Fattah , Hatem Ltaief , Havard Rue , David Keyes

The quadratic assignment problem (QAP) is one of the most difficult combinatorial optimization problems. An effective heuristic for obtaining approximate solutions to the QAP is simulated annealing (SA). Here we describe an SA…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-03-20 Gerald Paul

Approximate Nearest Neighbor Search (ANNS) plays a critical role in various disciplines spanning data mining and artificial intelligence, from information retrieval and computer vision to natural language processing and recommender systems.…

Data Structures and Algorithms · Computer Science 2024-07-10 Hiroyuki Ootomo , Akira Naruse , Corey Nolet , Ray Wang , Tamas Feher , Yong Wang

In the context of the genome-wide association studies (GWAS), one has to solve long sequences of generalized least-squares problems; such a task has two limiting factors: execution time --often in the range of days or weeks-- and data…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-05-02 Lucas Beyer , Paolo Bientinesi

The Random Phase Approximation (RPA) for correlation energy in the grid-based projector augmented wave (gpaw) code is accelerated by porting to the Graphics Processing Unit (GPU) architecture. The acceleration is achieved by grouping…

Computational Physics · Physics 2013-07-31 Jun Yan , Lin Li , Christopher O'Grady

Adaptive simulated annealing (ASA) is a global optimization algorithm based on an associated proof that the parameter space can be sampled much more efficiently than by using other previous simulated annealing algorithms. The author's ASA…

Mathematical Software · Computer Science 2007-05-23 Lester Ingber

Population annealing is a promising recent approach for Monte Carlo simulations in statistical physics, in particular for the simulation of systems with complex free-energy landscapes. It is a hybrid method, combining importance sampling…

Computational Physics · Physics 2017-09-14 Lev Yu. Barash , Martin Weigel , Michal Borovský , Wolfhard Janke , Lev N. Shchur

We describe an adaptation of the simulated annealing algorithm to nonparametric clustering and related probabilistic models. This new algorithm learns nonparametric latent structure over a growing and constantly churning subsample of…

Machine Learning · Statistics 2019-10-25 Fritz Obermeyer , Jonathan Glidden , Eric Jonas

Scheduling a task graph representing an application over a heterogeneous network of computers is a fundamental problem in distributed computing. It is known to be not only NP-hard but also not polynomial-time approximable within a constant…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-22 Jared Coleman , Bhaskar Krishnamachari

Principal component analysis (PCA) is a key statistical technique for multivariate data analysis. For large data sets the common approach to PCA computation is based on the standard NIPALS-PCA algorithm, which unfortunately suffers from…

Quantitative Methods · Quantitative Biology 2008-11-10 M. Andrecut

We present a scalable dissipative particle dynamics simulation code, fully implemented on the Graphics Processing Units (GPUs) using a hybrid CUDA/MPI programming model, which achieves 10-30 times speedup on a single GPU over 16 CPU cores…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-12-01 Yu-Hang Tang , George Em Karniadakis

It has been widely accepted that Graphics Processing Units (GPU) is one of promising schemes for encryption acceleration, in particular, the support of complex mathematical calculations such as integer and logical operations makes the…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-15 Canhui Wang , Xiaowen Chu

Modern parallel computing devices, such as the graphics processing unit (GPU), have gained significant traction in scientific and statistical computing. They are particularly well-suited to data-parallel algorithms such as the particle…

Computation · Statistics 2015-06-12 Lawrence M. Murray , Anthony Lee , Pierre E. Jacob

Acceleration of cryptographic applications on massively parallel computing platforms, such as Graphics Processing Units (GPUs), becomes a real challenge as their decreasing cost and mass production makes practical implementations…

Cryptography and Security · Computer Science 2013-05-17 Jean-Marie Chauvet , Eric Mahé

Stochastic simulation techniques employed for the analysis of portfolios of insurance/reinsurance risk, often referred to as `Aggregate Risk Analysis', can benefit from exploiting state-of-the-art high-performance computing platforms. In…

Distributed, Parallel, and Cluster Computing · Computer Science 2013-08-19 A. K. Bahl , O. Baltzer , A. Rau-Chaplin , B. Varghese , A. Whiteway
‹ Prev 1 2 3 10 Next ›