English
Related papers

Related papers: GraphMend: Code Transformations for Fixing Graph B…

200 papers

Machine learning (ML) workloads launch hundreds to thousands of short-running GPU kernels per iteration. With GPU compute throughput growing rapidly, CPU-side launch latency of kernels is emerging as a bottleneck. CUDA Graphs promise to…

Machine Learning · Computer Science 2025-12-24 Abhishek Ghosh , Ajay Nayak , Ashish Panwar , Arkaprava Basu

Static and dynamic computational graphs represent two distinct approaches to constructing deep learning frameworks. The former prioritizes compiler-based optimizations, while the latter focuses on programmability and user-friendliness. The…

Software Engineering · Computer Science 2023-11-01 Qidong Su , Chuqin Geng , Gennady Pekhimenko , Xujie Si

PyG (PyTorch Geometric) has evolved significantly since its initial release, establishing itself as a leading framework for Graph Neural Networks. In this paper, we present Pyg 2.0 (and its subsequent minor versions), a comprehensive update…

Graph Neural Networks (GNNs) present a fundamental hardware challenge by fusing irregular, memory-bound graph traversals with regular, compute-intensive dense matrix operations. While frameworks such as PyTorch Geometric (PyG) and Deep…

Machine Learning · Computer Science 2026-05-27 Anubhab , Rupesh Nasre

Graph Transformer is a new architecture that surpasses GNNs in graph learning. While there emerge inspiring algorithm advancements, their practical adoption is still limited, particularly on real-world graphs involving up to millions of…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-07-22 Meng Zhang , Jie Sun , Qinghao Hu , Peng Sun , Zeke Wang , Yonggang Wen , Tianwei Zhang

We present GraphTensor, a comprehensive open-source framework that supports efficient parallel neural network processing on large graphs. GraphTensor offers a set of easy-to-use programming primitives that appreciate both graph and neural…

Hardware Architecture · Computer Science 2023-05-30 Junhyeok Jang , Miryeong Kwon , Donghyun Gouk , Hanyeoreum Bae , Myoungsoo Jung

We present GRAFX, an open-source library designed for handling audio processing graphs in PyTorch. Along with various library functionalities, we describe technical details on the efficient parallel computation of input graphs, signals, and…

Graph mining for structural patterns is a fundamental task in many applications. Compilation-based graph mining systems, represented by AutoMine, generate specialized algorithms for the provided patterns and substantially outperform other…

Performance · Computer Science 2019-12-02 Daniel Mawhirter , Sam Reinehr , Connor Holmes , Tongping Liu , Bo Wu

We introduce PyTorch Geometric, a library for deep learning on irregularly structured input data such as graphs, point clouds and manifolds, built upon PyTorch. In addition to general graph data structures and processing methods, it…

Machine Learning · Computer Science 2019-04-26 Matthias Fey , Jan Eric Lenssen

Given the growing importance of large-scale graph analytics, there is a need to improve the performance of graph analysis frameworks without compromising on productivity. GraphMat is our solution to bridge this gap between a user-friendly…

Complex operational workflows coordinating personnel, tools, and information are central to system operations, yet end-to-end automation remains challenging due to extensive human input requirements and limited ability to adapt over time.…

Graph transformers have gained popularity in various graph-based tasks by addressing challenges faced by traditional Graph Neural Networks. However, the quadratic complexity of self-attention operations and the extensive layering in graph…

Machine Learning · Computer Science 2023-09-20 Reza Shirkavand , Heng Huang

Large language models (LLMs) have demonstrated impressive capabilities in natural language understanding and generation, including multi-step reasoning such as mathematical proving. However, existing approaches often lack an explicit and…

Computation and Language · Computer Science 2026-05-19 Yutong Li , Yitian Zhou , Xudong Wang , GuoChen , Caiyan Qin

Diffusion models have excelled in generating natural images and are now being adapted to a variety of data types, including graphs. However, conventional models often rely on Gaussian or categorical diffusion processes, which can struggle…

Machine Learning · Computer Science 2024-10-08 Xinyang Liu , Yilin He , Bo Chen , Mingyuan Zhou

The Design2Code problem, which involves converting digital designs into functional source code, is a significant challenge in software development due to its complexity and time-consuming nature. Traditional approaches often struggle with…

Machine Learning · Computer Science 2025-04-29 Tung D. Vu , Chung Hoang , Truong-Son Hy

Graph analytics elicits insights from large graphs to inform critical decisions for business, safety and security. Several large-scale graph processing frameworks feature efficient runtime systems; however, they often provide programming…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-05-29 Farzin Houshmand , Mohsen Lesani , Keval Vora

Modern deep learning frameworks provide imperative, eager execution programming interfaces embedded in Python to provide a productive development experience. However, deep learning practitioners sometimes need to capture and transform…

Machine Learning · Computer Science 2022-03-08 James K. Reed , Zachary DeVito , Horace He , Ansley Ussery , Jason Ansel

Sparse convolution plays a pivotal role in emerging workloads, including point cloud processing in AR/VR, autonomous driving, and graph understanding in recommendation systems. Since the computation pattern is sparse and irregular,…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-11-23 Haotian Tang , Shang Yang , Zhijian Liu , Ke Hong , Zhongming Yu , Xiuyu Li , Guohao Dai , Yu Wang , Song Han

Robots operating in unstructured environments often require accurate and consistent object-level representations. This typically requires segmenting individual objects from the robot's surroundings. While recent large models such as Segment…

Robotics · Computer Science 2025-04-07 Haozhan Tang , Tianyi Zhang , Oliver Kroemer , Matthew Johnson-Roberson , Weiming Zhi

As deep learning models scale, their training cost has surged significantly. Due to both hardware advancements and limitations in current software stacks, the need for data efficiency has risen. Data efficiency refers to the effective…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-12-09 Kun Wu
‹ Prev 1 2 3 10 Next ›