English
Related papers

Related papers: Heterogeneous computing in a strongly-connected CP…

200 papers

CPU-GPU heterogeneous architectures are now commonly used in a wide variety of computing systems from mobile devices to supercomputers. Maximizing the throughput for multi-programmed workloads on such systems is indispensable as one single…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-08 Issa Saba , Eishi Arima , Dai Liu , Martin Schulz

Applications that exploit the architectural details of high-performance computing (HPC) systems have become increasingly invaluable in academia and industry over the past two decades. The most important hardware development of the last…

Mathematical Software · Computer Science 2020-05-15 Daniel J. Magee , Anthony S. Walker , Kyle E. Niemeyer

The widely-adopted practice is to train deep learning models with specialized hardware accelerators, e.g., GPUs or TPUs, due to their superior performance on linear algebra operations. However, this strategy does not employ effectively the…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-04-21 Yujing Ma , Florin Rusu

This work deals with the CPU-GPU heterogeneous code acceleration of a finite-volume CFD solver utilizing multiple CPUs and GPUs at the same time. First, a high-level description of the CFD solver called SENSEI, the discretization of SENSEI,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-05-30 Weicheng Xue , Hongyu Wang , Christopher J. Roy

In recent years, as the demand for low energy and high performance computing has steadily increased, heterogeneous computing has emerged as an important and promising solution. Because most workloads can typically run most efficiently on…

Performance · Computer Science 2017-12-11 Zhuo Chen , Diana Marculescu

A high fidelity flow simulation for complex geometries for high Reynolds number ($Re$) flow is still very challenging, which requires more powerful computational capability of HPC system. However, the development of HPC with traditional CPU…

Computational Physics · Physics 2022-03-03 Chuangchao Ye , Pengjunyi Zhang , Rui Yan , Dejun Sun , Zhenhua Wan

This paper consists of three parts. The first part provides a unified programming model for heterogeneous computing with CPU and accelerator (like GPU, FPGA, Google TPU, Atos QPU, and more) technologies. To some extent, this new programming…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-31 Yuqing Xiong

This paper presents a methodology for simultaneous heterogeneous computing, named ENEAC, where a quad core ARM Cortex-A53 CPU works in tandem with a preprogrammed on-board FPGA accelerator. A heterogeneous scheduler distributes the tasks…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-11-16 Kris Nikov , Mohammad Hosseinabady , Rafael Asenjo , Andrés Rodríguezz , Angeles Navarro , Jose Nunez-Yanez

The Preconditioned Conjugate Gradient (PCG) method is widely used for solving linear systems of equations with sparse matrices. A recent version of PCG, Pipelined PCG, eliminates the dependencies in the computations of the PCG algorithm so…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-05-14 Manasi Tiwari , Sathish Vadhiyar

The geometric multigrid method (GMG) is one of the most efficient solving techniques for discrete algebraic systems arising from elliptic partial differential equations. GMG utilizes a hierarchy of grids or discretizations and reduces the…

Numerical Analysis · Mathematics 2013-01-14 Chunsheng Feng , Shi Shu , Jinchao Xu , Chen-Song Zhang

Nonlinear time-history evolution problems employing high-fidelity physical models are essential in numerous scientific domains. However, these problems face a critical dual bottleneck: the immense computational cost of time-stepping and the…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-06 Tsuyoshi Ichimura , Kohei Fujita , Hideaki Ito , Muneo Hori , Lalith Maddegedara

Many important real-world applications, such as System Identification with Gaussian Processes, involve solving linear systems with symmetric positive-definite matrices. The iterative CG method and direct solvers based on the Cholesky…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-14 Tim Thüring , Alexander Strack , Dirk Pflüger

The paradigm shift towards multi-core and heterogeneous computing, driven by the fundamental power and thermal limits of single-core processors, has established energy efficiency as a first-class design constraint in high-performance…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-30 Mufakir Qamar Ansari , Mudabir Qamar Ansari

Evolutionary computing (EC) has proven to be effective in solving complex optimization and robotics problems. Unfortunately, typical Evolutionary Algorithms (EAs) are constrained by the computational capacity available to researchers. More…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-19 Rustam Eynaliyev , Houcen Liu

Heterogeneous computing systems provide high performance and energy efficiency. However, to optimally utilize such systems, solutions that distribute the work across host CPUs and accelerating devices are needed. In this paper, we present a…

Software Engineering · Computer Science 2021-06-04 Suejb Memeti , Sabri Pllana

We advocate a domain specific software development methodology for heterogeneous computing platforms such as Multicore CPUs, GPUs and FPGAs. We argue that three specific benefits are realised from adopting such an approach: portable,…

Computational Engineering, Finance, and Science · Computer Science 2014-08-22 Gordon Inggs , David Thomas , Wayne Luk

The edge computing paradigm has emerged to handle cloud computing issues such as scalability, security and low response time among others. This new computing trend heavily relies on ubiquitous embedded systems on the edge. Performance and…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-28 Mohammad Hosseinabady , Mohd Amiruddin Bin Zainol , Jose Nunez-Yanez

For computational fluid dynamics (CFD) applications with a large number of grid points/cells, parallel computing is a common efficient strategy to reduce the computational time. How to achieve the best performance in the modern…

Performance · Computer Science 2018-03-12 Yong-Xian Wang , Li-Lun Zhang , Wei Liu , Xing-Hua Cheng , Yu Zhuang , Anthony T. Chronopoulos

Current supercomputers often have a heterogeneous architecture using both CPUs and GPUs. At the same time, numerical simulation tasks frequently involve multiphysics scenarios whose components run on different hardware due to multiple…

Computational Engineering, Finance, and Science · Computer Science 2024-12-10 Samuel Kemmler , Christoph Rettinger , Ulrich Rüde , Pablo Cuéllar , Harald Köstler

Heterogeneous supercomputers have become the standard in HPC. GPUs in particular have dominated the accelerator landscape, offering unprecedented performance in parallel workloads and unlocking new possibilities in fields like AI and…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-08-27 Luigi Fusco , Mikhail Khalilov , Marcin Chrapek , Giridhar Chukkapalli , Thomas Schulthess , Torsten Hoefler
‹ Prev 1 2 3 10 Next ›