Distributed, Parallel, and Cluster Computing · Computer Science
A Tool for Automatically Suggesting Source-Code Optimizations for Complex GPU Kernels
Saeed Taheri, Apan Qasem, Martin Burtscher
2019-10-18
Distributed, Parallel, and Cluster Computing · Computer Science
Accelerating Matrix Multiplication: A Performance Comparison Between Multi-Core CPU and GPU
Mufakir Qamar Ansari, Mudabir Qamar Ansari
2025-07-30
Distributed, Parallel, and Cluster Computing · Computer Science
A Preliminary Study on Accelerating Simulation Optimization with GPU Implementation
Jinghai He, Haoyu Liu, Yuhang Wu, Zeyu Zheng +1
2024-04-19
Distributed, Parallel, and Cluster Computing · Computer Science
Improving the performance of the linear systems solvers using CUDA
Bogdan Oancea, Tudorel Andrei, Raluca Mariana Dragoescu
2015-11-24
Hardware Architecture · Computer Science
Understanding Training Efficiency of Deep Learning Recommendation Models at Scale
Bilge Acun, Matthew Murphy, Xiaodong Wang, Jade Nie +2
2020-11-12
Optimization and Control · Mathematics
GPU-Accelerated Primal Heuristics for Mixed Integer Programming
Akif Çördük, Piotr Sielski, Alice Boucher, Kumar Aatish
2025-10-31
Distributed, Parallel, and Cluster Computing · Computer Science
Analysis of GPU Parallel Computing based on Matlab
Mingzhe Wang, Bo Wang, Qiu He, Xiuxiu Liu +1
2015-05-26
Instrumentation and Methods for Astrophysics · Physics
Application of GPUs for the Calculation of Two Point Correlation Functions in Cosmology
Rafael Ponce, Miguel Cardenas-Montes, Juan Jose Rodriguez-Vazquez, Eusebio Sanchez +1
2012-05-01
Performance · Computer Science
GPA: A GPU Performance Advisor Based on Instruction Sampling
Keren Zhou, Xiaozhu Meng, Ryuichi Sai, John Mellor-Crummey
2020-11-25
Distributed, Parallel, and Cluster Computing · Computer Science
A Comparison of Support Vector Machines Training GPU-Accelerated Open Source Implementations
Jan Vanek, Josef Michalek, Josef Psutka
2017-07-21
Distributed, Parallel, and Cluster Computing · Computer Science
GPU-Accelerated Algorithms for Process Mapping
Petr Samoldekin, Christian Schulz, Henning Woydt
2026-03-16
Computational Physics · Physics
GPU coprocessors as a service for deep learning inference in high energy physics
Jeffrey Krupa, Kelvin Lin, Maria Acosta Flechas, Jack Dinsmore +12
2021-04-26
Computational Physics · Physics
Enabling Multireference Calculations on Multi-Metallic Systems with Graphic Processing Units
Valay Agarawal, Rishu Khurana, Cong Liu, Matthew R. Hermes +2
2025-05-08
Distributed, Parallel, and Cluster Computing · Computer Science
GPU accelerated maximum cardinality matching algorithms for bipartite graphs
Mehmet Deveci, Kamer Kaya, Bora Ucar, Umit V. Catalyurek
2013-03-07
Distributed, Parallel, and Cluster Computing · Computer Science
Large Scale Artificial Neural Network Training Using Multi-GPUs
Linnan Wang, Wei Wu, Jianxiong Xiao, Yang Yi
2015-11-16