English
Related papers

Related papers: Efficient channelization on a Graphics Processing …

200 papers

The Graphics Processing Unit (GPU) has become an integral part of astronomical instrumentation, enabling high-performance online data reduction and accelerated online signal processing. In this paper, we describe a wide-band reconfigurable…

Instrumentation and Methods for Astrophysics · Physics 2015-06-23 Jayanth Chennamangalam , Simon Scott , Glenn Jones , Hong Chen , John Ford , Amanda Kepley , D. R. Lorimer , Jun Nie , Richard Prestage , D. Anish Roshi , Mark Wagner , Dan Werthimer

Graphics Processing Unit, or GPUs, have been successfully adopted both for graphic computation in 3D applications, and for general purpose application (GP-GPUs), thank to their tremendous performance-per-watt. Recently, there is a big…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-10-03 Paolo Burgio

In this memo we investigate the applicability of NVIDIA Graphics Processing Units (GPUs) for SKA1-Low station and Central Signal Processing (CSP)-level processing. Station-level processing primarily involves generating a single station beam…

Instrumentation and Methods for Astrophysics · Physics 2014-08-14 Alessio Magro , Kristian Zarb Adami , Steve Ord

We present a highly parallel implementation of the cross-correlation of time-series data using graphics processing units (GPUs), which is scalable to hundreds of independent inputs and suitable for the processing of signals from "Large-N"…

Instrumentation and Methods for Astrophysics · Physics 2011-08-02 M. A. Clark , P. C. La Plante , L. J. Greenhill

This paper discusses the potential of graphics processing units (GPUs) in high-dimensional optimization problems. A single GPU card with hundreds of arithmetic cores can be inserted in a personal computer and dramatically accelerates many…

Computation · Statistics 2015-03-13 Hua Zhou , Kenneth Lange , Marc A. Suchard

GPU-based beamforming is a relatively unexplored area in radio astronomy, possibly due to the assumption that any such system will be severely limited by the PCIe bandwidth required to transfer data to the GPU. We have developed a…

Instrumentation and Methods for Astrophysics · Physics 2014-12-17 Alessio Magro , Kristian Zarb Adami , Jack Hickish

We present a single-node, multi-GPU programmable graph processing library that allows programmers to easily extend single-GPU graph algorithms to achieve scalable performance on large graphs with billions of edges. Directly using the…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-03-02 Yuechao Pan , Yangzihao Wang , Yuduo Wu , Carl Yang , John D. Owens

High parallel framework has been proved to be very suitable for graph processing. There are various work to optimize the implementation in FPGAs, a pipeline parallel device. The key to make use of the parallel performance of FPGAs is to…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-02 Chengbo Yang

Video and image streaming on edge devices requires low latency. To address this, Neural Networks (NNs) are widely used, and prior work mainly focuses on accelerating them with single hardware units such as Graphics Processing Units (GPUs),…

Hardware Architecture · Computer Science 2026-05-04 Ali Emre Oztas , Mahir Demir , James Garside , Mikel Luj'an

This paper presents a computationally efficient implementation of a Hamming code decoder on a graphics processing unit (GPU) to support real-time software-defined radio (SDR), which is a software alternative for realizing wireless…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-12-23 Shohidul Islam , Cheol-Hong Kim , Jong-Myon Kim

Graphics Processing Units (GPUs) support dynamic voltage and frequency scaling (DVFS) in order to balance computational performance and energy consumption. However, there still lacks simple and accurate performance estimation of a given GPU…

Performance · Computer Science 2018-06-14 Qiang Wang , Xiaowen Chu

In this article we discuss our implementation of a polyphase filter for real-time data processing in radio astronomy. We describe in detail our implementation of the polyphase filter algorithm and its behaviour on three generations of…

Instrumentation and Methods for Astrophysics · Physics 2016-04-22 Karel Adámek , Jan Novotný , Wes Armour

In this paper, a fast and practical GPU-based implementation of Fuzzy C-Means(FCM) clustering algorithm for image segmentation is proposed. First, an extensive analysis is conducted to study the dependency among the image pixels in the…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-03-29 Mishal Almazrooie , Mogana Vadiveloo , Rosni Abdullah

The attention layer, a core component of Transformer-based LLMs, brings out inefficiencies in current GPU systems due to its low operational intensity and the substantial memory requirements of KV caches. We propose a High-bandwidth…

Hardware Architecture · Computer Science 2025-12-19 Myunghyun Rhee , Joonseop Sim , Taeyoung Ahn , Seungyong Lee , Daegun Yoon , Euiseok Kim , Kyoung Park , Youngpyo Joo , Hoshik Kim

In this paper we present an optimized parallel implementation of a flexible MAP decoder for synchronization error correcting codes, supporting a very wide range of code sizes and channel conditions. On mid-range GPUs we demonstrate decoding…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-02-26 Johann A. Briffa

The signal processing firmware that has been developed for the Low Frequency Aperture Array component of the Square Kilometre Array is described. The firmware is implemented on a dual FPGA board, that is capable of processing the streams…

A micromagnetic simulator running on graphics processing unit (GPU) is presented. It achieves significant performance boost as compared to previous central processing unit (CPU) simulators, up to two orders of magnitude for large input…

Computational Engineering, Finance, and Science · Computer Science 2014-11-11 Ru Zhu

With the rapid advances in mobile technology many mobile devices are capable of capturing high quality images and video with their embedded camera. This paper investigates techniques for real-time processing of the resulting images,…

Graphics · Computer Science 2011-12-15 Andrew Ensor , Seth Hall

Convolutional Neural Networks (CNN) has become more popular choice for various tasks such as computer vision, speech recognition and natural language processing. Thanks to their large computational capability and throughput, GPUs ,which are…

Machine Learning · Computer Science 2018-11-28 Natan Liss , Chaim Baskin , Avi Mendelson , Alex M. Bronstein , Raja Giryes

Study of general purpose computation by GPU (Graphics Processing Unit) can improve the image processing capability of micro-computer system. This paper studies the parallelism of the different stages of decimation in time radix 2 FFT…

Mathematical Software · Computer Science 2015-06-01 Feifei Shen , Zhenjian Song , Congrui Wu , Jiaqi Geng , Qingyun Wang
‹ Prev 1 2 3 10 Next ›