English
Related papers

Related papers: Performance Acceleration of Kernel Polynomial Meth…

200 papers

The Kernel Polynomial Method (KPM) is a well-established scheme in quantum physics and quantum chemistry to determine the eigenvalue density and spectral properties of large sparse matrices. In this work we demonstrate the high optimization…

Computational Engineering, Finance, and Science · Computer Science 2015-07-30 Moritz Kreutzer , Georg Hager , Gerhard Wellein , Andreas Pieper , Andreas Alvermann , Holger Fehske

Parallel algorithms on CPU and GPU are implemented for the Unified Gas-Kinetic Scheme and their performances are investigated and compared by a two dimensional channel flow case. The parallel CPU algorithm has a one dimensional block…

Computational Physics · Physics 2018-11-02 Jizhou Liu , Fang Q. Hu , Xiaodong Li

In atomistic spin dynamics simulations, the time cost of constructing the space- and time-displaced pair correlation function in real space increases quadratically as the number of spins $N$, leading to significant computational effort. The…

Computational Physics · Physics 2023-08-16 Hongwei Chen , Shiyang Chen , Joshua J. Turner , Adrian Feiguin

We present a GPU-accelerated cosmological simulation code, PhotoNs-GPU, based on algorithm of Particle Mesh Fast Multipole Method (PM-FMM), and focus on the GPU utilization and optimization. A proper interpolated method for truncated…

Instrumentation and Methods for Astrophysics · Physics 2021-12-28 Qiao Wang , Chen Meng

The conductor-like polarization model (C-PCM) with switching/Gaussian smooth discretization is a widely used implicit solvation model in chemical simulations. However, its application in quantum mechanical calculations of large-scale…

Chemical Physics · Physics 2015-07-03 Fang Liu , Nathan Luehr , Heather J. Kulik , Todd J. Martínez

We accelerated an ab-initio molecular QMC calculation by using GPGPU. Only the bottle-neck part of the calculation is replaced by CUDA subroutine and performed on GPU. The performance on a (single core CPU + GPU) is compared with that on a…

Computational Physics · Physics 2012-04-06 Yutaka Uejima , Tomoharu Terashima , Ryo Maezono

We provide a preliminary study on utilizing GPU (Graphics Processing Unit) to accelerate computation for three simulation optimization tasks with either first-order or second-order algorithms. Compared to the implementation using only CPU…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-04-19 Jinghai He , Haoyu Liu , Yuhang Wu , Zeyu Zheng , Tingyu Zhu

Quantum computers are becoming practical for computing numerous applications. However, simulating quantum computing on classical computers is still demanding yet useful because current quantum computers are limited because of computer…

Quantum Physics · Physics 2023-08-08 Jun Doi , Hiroshi Horii , Christopher Wood

In this paper we solve on GPUs massive problems with large amount of data, which are not appropriate for solution with the SIMD technology. For the given problem we consider a three-level parallelization. The multithreading of CPU is used…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-02-18 Natalya Litvinenko

The article introduces a new method for applying Quantum Clustering to graph structures. Quantum Clustering (QC) is a novel density-based unsupervised learning method that determines cluster centers by constructing a potential function. In…

Machine Learning · Computer Science 2025-01-17 Zhe Wang , ZhiJie He , Ding Liu

The paper considers the problem of implementation on graphics processors of numerical integration routines for higher order finite element approximations. The design of suitable GPU kernels is investigated in the context of general purpose…

Mathematical Software · Computer Science 2014-03-03 Krzysztof Banaś , Przemysław Płaszewski , Paweł Macioł

This paper presents a Graphics Processing Units (GPUs) acceleration method of an iterative scheme for gas-kinetic model equations. Unlike the previous GPU parallelization of explicit kinetic schemes, this work features a fast converging…

Computational Physics · Physics 2020-01-08 Lianhua Zhu , Peng Wang , Songze Chen , Zhaoli Guo , Yonghao Zhang

Kernels are executable code segments and kernel fusion is a technique for combing the segments in a coherent manner to improve execution time. For the first time, we have developed a technique to fuse image processing kernels to be executed…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-09-16 Asif M Adnan , Sridhar Radhakrishnan , Suleyman Karabuk

Parallel computing is a standard approach to achieving high-performance computing (HPC). Three commonly used methods to implement parallel computing include: 1) applying multithreading technology on single-core or multi-core CPUs; 2)…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-09-18 Xinyao Yi

Kernel matrix-vector multiplication (KMVM) is a foundational operation in machine learning and scientific computing. However, as KMVM tends to scale quadratically in both memory and time, applications are often limited by these…

Numerical Analysis · Mathematics 2025-02-25 Robert Hu , Siu Lun Chau , Dino Sejdinovic , Joan Alexis Glaunès

Clustering is an important tool in data analysis, with K-means being popular for its simplicity and versatility. However, it cannot handle non-linearly separable clusters. Kernel K-means addresses this limitation but requires a large kernel…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-01-29 Julian Bellavita , Matthew Rubino , Nakul Iyer , Andrew Chang , Aditya Devarakonda , Flavio Vella , Giulia Guidi

The use of graphics processing units for scientific computations is an emerging strategy that can significantly speed up various different algorithms. In this review, we discuss advances made in the field of computational physics, focusing…

Computational Physics · Physics 2013-03-07 Ari Harju , Topi Siro , Filippo Federici-Canova , Samuli Hakala , Teemu Rantalaiho

Matrix multiplication is a foundational operation in scientific computing and machine learning, yet its computational complexity makes it a significant bottleneck for large-scale applications. The shift to parallel architectures, primarily…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-30 Mufakir Qamar Ansari , Mudabir Qamar Ansari

The Cellular Potts Model (CPM) is a widely used simulation paradigm for systems of interacting cells that has been used to study scenarios ranging from plant development to morphogenesis, tumour growth and cell migration. Despite their wide…

Tissues and Organs · Quantitative Biology 2023-12-18 Shabaz Sultan , Sapna Devi , Scott N. Mueller , Johannes Textor

Classical optimization algorithms in machine learning often take a long time to compute when applied to a multi-dimensional problem and require a huge amount of CPU and GPU resource. Quantum parallelism has a potential to speed up machine…

Quantum Physics · Physics 2019-11-21 Venkat R. Dasari , Mee Seong Im , Lubjana Beshaj
‹ Prev 1 2 3 10 Next ›