Distributed, Parallel, and Cluster Computing · Computer Science
High-Throughput and Memory-Efficient Parallel Viterbi Decoder for Convolutional Codes on GPU
Alireza Mohammadidoost, Matin Hashemi
2020-11-19
Distributed, Parallel, and Cluster Computing · Computer Science
Analyzing GPU Tensor Core Potential for Fast Reductions
Roberto Carrasco, Raimundo Vega, Cristóbal A. Navarro
2019-03-12
Distributed, Parallel, and Cluster Computing · Computer Science
A Gb/s Parallel Block-based Viterbi Decoder for Convolutional Codes on GPU
Hao Peng, Rongke Liu, Yi Hou, Ling Zhao
2016-08-02
Computation and Language · Computer Science
GPU-Accelerated Viterbi Exact Lattice Decoder for Batched Online and Offline Speech Recognition
Hugo Braun, Justin Luitjens, Ryan Leary, Tim Kaldewey +1
2020-02-17
Distributed, Parallel, and Cluster Computing · Computer Science
The Tensor-Core Beamformer: A High-Speed Signal-Processing Library for Multidisciplinary Use
Leon Oostrum, Bram Veenboer, Ronald Rook, Michael Brown +2
2025-05-07
Strongly Correlated Electrons · Physics
Reducing the Computational Cost Scaling of Tensor Network Algorithms via Field-Programmable Gate Array Parallelism
Songtai Lv, Yang Liang, Rui Zhu, Qibin Zheng +1
2026-02-06
Distributed, Parallel, and Cluster Computing · Computer Science
PAGANI: A Parallel Adaptive GPU Algorithm for Numerical
Ioannis Sakiotis, Kamesh Arumugam, Marc Paterno, Desh Ranjan +2
2021-06-24
Hardware Architecture · Computer Science
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors
Wei Sun, Ang Li, Tong Geng, Sander Stuijk +1
2022-11-29
Distributed, Parallel, and Cluster Computing · Computer Science
A Variant of Concurrent Constraint Programming on GPU
Pierre Talbot, Frédéric Pinel, Pascal Bouvry
2022-07-26
Distributed, Parallel, and Cluster Computing · Computer Science
On the performance of various parallel GMRES implementations on CPU and GPU clusters
E. I. Ioannidis, N. Cheimarios, A. N. Spyropoulos, A. G. Boudouvis
2019-06-11
Distributed, Parallel, and Cluster Computing · Computer Science
Revisiting Huffman Coding: Toward Extreme Performance on Modern GPU Architectures
Jiannan Tian, Cody Rivera, Sheng Di, Jieyang Chen +3
2021-03-02
Distributed, Parallel, and Cluster Computing · Computer Science
GPU Tensor Cores for fast Arithmetic Reductions
Cristóbal A. Navarro, Roberto Carrasco, Ricardo J. Barrientos, Javier A. Riquelme +1
2020-01-17
Distributed, Parallel, and Cluster Computing · Computer Science
VDCores: Resource Decoupled Programming and Execution for Asynchronous GPU
Zijian He, Adrian Sampson, Yiying Zhang, Zhiyuan Guo
2026-05-06
Distributed, Parallel, and Cluster Computing · Computer Science
Can Tensor Cores Benefit Memory-Bound Kernels? (No!)
Lingqi Zhang, Jiajun Huang, Sheng Di, Satoshi Matsuoka +1
2025-03-04
Distributed, Parallel, and Cluster Computing · Computer Science
Efficient and High-quality Sparse Graph Coloring on the GPU
Xuhao Chen, Pingfan Li, Jianbin Fang, Tao Tang +2
2020-01-22