English
Related papers

Related papers: GraphBLAST: A High-Performance Linear Algebra-base…

200 papers

The analysis of graphs has become increasingly important to a wide range of applications. Graph analysis presents a number of unique challenges in the areas of (1) software complexity, (2) data complexity, (3) security, (4) mathematical…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-09-19 Jeremy Kepner , David Bader , Aydın Buluc , John Gilbert , Timothy Mattson , Henning Meyerhenke

Recent advances in graph processing on FPGAs promise to alleviate performance bottlenecks with irregular memory access patterns. Such bottlenecks challenge performance for a growing number of important application areas like machine…

Hardware Architecture · Computer Science 2022-06-20 Jonas Dann , Daniel Ritter , Holger Fröning

Applications in High-Performance Computing (HPC) environments face challenges due to increasing complexity. Among them, the increasing usage of sparse data pushes the limits of data structures and programming models and hampers the…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-08-26 Alberto Scolari , Albert-Jan Yzelman

Acceleration of graph applications on GPUs has found large interest due to the ubiquitous use of graph processing in various domains. The inherent \textit{irregularity} in graph applications leads to several challenges for parallelization.…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-11-02 Ananya Raval , Rupesh Nasre , Vivek Kumar , Vasudevan R , Sathish Vadhiyar , Keshav Pingali

Basic Linear Algebra Subprograms (BLAS) are a set of low level linear algebra kernels widely adopted by applications involved with the deep learning and scientific computing. The massive and economic computing power brought forth by the…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-10-20 Linnan Wang , Wei Wu , Jianxiong Xiao , Yi Yang

The HPEC Graph Challenge is a collection of benchmarks representing complex workloads that test the hardware and software components of HPC systems, which traditional benchmarks, such as LINPACK, do not. The first benchmark, Subgraph…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-09-05 Siddharth Samsi , Dan Campbell , Emanuel Scoullos , Oded Green

The GraphBLAS high performance library standard has yielded capabilities beyond enabling graph algorithms to be readily expressed in the language of linear algebra. These GraphBLAS capabilities enable new performant ways of thinking about…

Data Structures and Algorithms · Computer Science 2025-09-24 Hayden Jananthan , Jeremy Kepner , Michael Jones , Vijay Gadepally , Michael Houle , Peter Michaleas , Chasen Milner , Alex Pentland

Designing flexible graph kernels that can run well on various platforms is a crucial research problem due to the frequent usage of graphs for modeling data and recent architectural advances and variety. In this work, we propose a novel…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-09-13 Abdurrahman Yasar , Sivasankaran Rajamanickam , Jonathan W. Berry , Umit V. Catalyurek

Graph algorithms can be expressed in terms of linear algebra. GraphBLAS is a library of low-level building blocks for such algorithms that targets algorithm developers. LAGraph builds on top of the GraphBLAS to target users of graph…

Mathematical Software · Computer Science 2021-04-06 Gábor Szárnyas , David A. Bader , Timothy A. Davis , James Kitchen , Timothy G. Mattson , Scott McMillan , Erik Welch

Load-balancing among the threads of a GPU for graph analytics workloads is difficult because of the irregular nature of graph applications and the high variability in vertex degrees, particularly in power-law graphs. We describe a novel…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-28 Vishwesh Jatala , Loc Hoang , Roshan Dathathri , Gurbinder Gill , V Krishna Nandivada , Keshav Pingali

Graph processing systems are essential for analyzing large-scale data with complex relationships, yet most existing frameworks rely on statically provisioned clusters, resulting in poor elasticity and inefficient resource utilization under…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-13 Chen Zhao , Parsa Poorsistani , Mohammad Goudarzi , Tawfiq Islam , Adel N. Toosi

The GraphBLAS standard (GraphBlas.org) is being developed to bring the potential of matrix based graph algorithms to the broadest possible audience. Mathematically the Graph- BLAS defines a core set of matrix-based graph operations that can…

For large-scale graph analytics on the GPU, the irregularity of data access and control flow, and the complexity of programming GPUs, have presented two significant challenges to developing a programmable high-performance graph library.…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-01-06 Yangzihao Wang , Yuechao Pan , Andrew Davidson , Yuduo Wu , Carl Yang , Leyuan Wang , Muhammad Osama , Chenshan Yuan , Weitang Liu , Andy T. Riffel , John D. Owens

In a general graph data structure like an adjacency matrix, when edges are homogeneous, the connectivity of two nodes can be sufficiently represented using a single bit. This insight has, however, not yet been adequately exploited by the…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-02-23 Jou-An Chen , Hsin-Hsuan Sung , Xipeng Shen , Nathan Tallent , Kevin Barker , Ang Li

Graph processing at scale presents many challenges, including the irregular structure of graphs, the latency-bound nature of graph algorithms, and the overhead associated with distributed execution. While existing frameworks such as Spark…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-03-06 Karame Mohammadiporshokooh , Panagiotis Syskakis , Andrew Lumsdaine , Hartmut Kaiser

Efficient Graph processing is challenging because of the irregularity of graph algorithms. Using GPUs to accelerate irregular graph algorithms is even more difficult to be efficient, since GPU's highly structured SIMT architecture is not a…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-04-05 Xuhao Chen

In this paper, we explore the limits of graphics processors (GPUs) for general purpose parallel computing by studying problems that require highly irregular data access patterns: parallel graph algorithms for list ranking and connected…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-02-25 Frank Dehne , Kumanan Yogaratnam

Given the growing importance of large-scale graph analytics, there is a need to improve the performance of graph analysis frameworks without compromising on productivity. GraphMat is our solution to bridge this gap between a user-friendly…

Subgraph counting aims to count the occurrences of a subgraph template T in a given network G. The basic problem of computing structural properties such as counting triangles and other subgraphs has found applications in diverse domains.…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-12 Langshi Chen , Jiayu Li , Ariful Azad , Lei Jiang , Madhav Marathe , Anil Vullikanti , Andrey Nikolaev , Egor Smirnov , Ruslan Israfilov , Judy Qiu

Graph analytics are vital in fields such as social networks, biomedical research, and graph neural networks (GNNs). However, traditional CPUs and GPUs struggle with the memory bottlenecks caused by large graph datasets and their…

Hardware Architecture · Computer Science 2024-11-25 Oluwole Jaiyeoba , Abdullah T. Mughrabi , Morteza Baradaran , Beenish Gul , Kevin Skadron
‹ Prev 1 2 3 10 Next ›