Related papers: Speeding Up Mixed-Integer Programming Solvers with…

Hybrid Models for Learning to Branch

A recent Graph Neural Network (GNN) approach for learning to branch has been shown to successfully reduce the running time of branch-and-bound algorithms for Mixed Integer Linear Programming (MILP). While the GNN relies on a GPU for…

Machine Learning · Computer Science 2020-10-26 Prateek Gupta , Maxime Gasse , Elias B. Khalil , M. Pawan Kumar , Andrea Lodi , Yoshua Bengio

Sparse GPU Kernels for Deep Learning

Scientific workloads have traditionally exploited high levels of sparsity to accelerate computation and reduce memory requirements. While deep neural networks can be made sparse, achieving practical speedups on GPUs is difficult because…

Machine Learning · Computer Science 2020-09-02 Trevor Gale , Matei Zaharia , Cliff Young , Erich Elsen

A GPU-accelerated Nonlinear Branch-and-Bound Framework for Sparse Linear Models

We study exact sparse linear regression with an $\ell_0-\ell_2$ penalty and develop a branch-and-bound (BnB) algorithm explicitly designed for GPU execution. Starting from a perspective reformulation, we derive an interval relaxation that…

Optimization and Control · Mathematics 2026-02-05 Xiang Meng , Ryan Lucas , Rahul Mazumder

Solving Mixed Integer Programs Using Neural Networks

Mixed Integer Programming (MIP) solvers rely on an array of sophisticated heuristics developed with decades of research to solve large-scale MIP instances encountered in practice. Machine learning offers to automatically construct better…

Optimization and Control · Mathematics 2021-07-30 Vinod Nair , Sergey Bartunov , Felix Gimeno , Ingrid von Glehn , Pawel Lichocki , Ivan Lobov , Brendan O'Donoghue , Nicolas Sonnerat , Christian Tjandraatmadja , Pengming Wang , Ravichandra Addanki , Tharindi Hapuarachchi , Thomas Keck , James Keeling , Pushmeet Kohli , Ira Ktena , Yujia Li , Oriol Vinyals , Yori Zwols

Load-balanced Gather-scatter Patterns for Sparse Deep Neural Networks

Deep neural networks (DNNs) have been proven to be effective in solving many real-life problems, but its high computation cost prohibits those models from being deployed to edge devices. Pruning, as a method to introduce zeros to model…

Machine Learning · Computer Science 2021-12-22 Fei Sun , Minghai Qin , Tianyun Zhang , Xiaolong Ma , Haoran Li , Junwen Luo , Zihao Zhao , Yen-Kuang Chen , Yuan Xie

Batched Sparse Matrix Multiplication for Accelerating Graph Convolutional Networks

Graph Convolutional Networks (GCNs) are recently getting much attention in bioinformatics and chemoinformatics as a state-of-the-art machine learning approach with high accuracy. GCNs process convolutional operations along with graph…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-28 Yusuke Nagasaka , Akira Nukada , Ryosuke Kojima , Satoshi Matsuoka

Learning for Spatial Branching: An Algorithm Selection Approach

The use of machine learning techniques to improve the performance of branch-and-bound optimization algorithms is a very active area in the context of mixed integer linear problems, but little has been done for non-linear optimization. To…

Optimization and Control · Mathematics 2022-04-25 Bissan Ghaddar , Ignacio Gómez-Casares , Julio González-Díaz , Brais González-Rodríguez , Beatriz Pateiro-López , Sofía Rodríguez-Ballesteros

Optimizing Sparse Matrix Multiplications for Graph Neural Networks

Graph neural networks (GNNs) are emerging as a powerful technique for modeling graph structures. Due to the sparsity of real-world graph data, GNN performance is limited by extensive sparse matrix multiplication (SpMM) operations involved…

Machine Learning · Computer Science 2021-11-02 Shenghao Qiu , You Liang , Zheng Wang

Adaptive Elastic Training for Sparse Deep Learning on Heterogeneous Multi-GPU Servers

Motivated by extreme multi-label classification applications, we consider training deep learning models over sparse data in multi-GPU servers. The variance in the number of non-zero features across training batches and the intrinsic GPU…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-10-15 Yujing Ma , Florin Rusu , Kesheng Wu , Alexander Sim

Sparse Decomposition of Graph Neural Networks

Graph Neural Networks (GNN) exhibit superior performance in graph representation learning, but their inference cost can be high, due to an aggregation operation that can require a memory fetch for a very large number of nodes. This…

Machine Learning · Computer Science 2025-03-18 Yaochen Hu , Mai Zeng , Ge Zhang , Pavel Rumiantsev , Liheng Ma , Yingxue Zhang , Mark Coates

Training Sparse Neural Networks

Deep neural networks with lots of parameters are typically used for large-scale computer vision tasks such as image classification. This is a result of using dense matrix multiplications and convolutions. However, sparse computations are…

Computer Vision and Pattern Recognition · Computer Science 2016-11-22 Suraj Srinivas , Akshayvarun Subramanya , R. Venkatesh Babu

Joint Edge-Model Sparse Learning is Provably Efficient for Graph Neural Networks

Due to the significant computational challenge of training large-scale graph neural networks (GNNs), various sparse learning techniques have been exploited to reduce memory and storage costs. Examples include \textit{graph sparsification}…

Machine Learning · Computer Science 2023-02-07 Shuai Zhang , Meng Wang , Pin-Yu Chen , Sijia Liu , Songtao Lu , Miao Liu

Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip

Recurrent Neural Networks (RNNs) are powerful tools for solving sequence-based problems, but their efficacy and execution time are dependent on the size of the network. Following recent work in simplifying these networks with model pruning…

Neural and Evolutionary Computing · Computer Science 2018-04-30 Feiwen Zhu , Jeff Pool , Michael Andersch , Jeremy Appleyard , Fung Xie

Branch and Bound in Mixed Integer Linear Programming Problems: A Survey of Techniques and Trends

In this paper, we surveyed the existing literature studying different approaches and algorithms for the four critical components in the general branch and bound (B&B) algorithm, namely, branching variable selection, node selection, node…

Machine Learning · Computer Science 2021-11-12 Lingying Huang , Xiaomeng Chen , Wei Huo , Jiazheng Wang , Fan Zhang , Bo Bai , Ling Shi

Benchmarking GPU and TPU Performance with Graph Neural Networks

Many artificial intelligence (AI) devices have been developed to accelerate the training and inference of neural networks models. The most common ones are the Graphics Processing Unit (GPU) and Tensor Processing Unit (TPU). They are highly…

Machine Learning · Computer Science 2022-10-25 xiangyang Ju , Yunsong Wang , Daniel Murnane , Nicholas Choma , Steven Farrell , Paolo Calafiura

Accurate, Efficient and Scalable Training of Graph Neural Networks

Graph Neural Networks (GNNs) are powerful deep learning models to generate node embeddings on graphs. When applying deep GNNs on large graphs, it is still challenging to perform training in an efficient and scalable way. We propose a novel…

Machine Learning · Computer Science 2020-10-08 Hanqing Zeng , Hongkuan Zhou , Ajitesh Srivastava , Rajgopal Kannan , Viktor Prasanna

Accelerating Training of Deep Neural Networks via Sparse Edge Processing

We propose a reconfigurable hardware architecture for deep neural networks (DNNs) capable of online training and inference, which uses algorithmically pre-determined, structured sparsity to significantly lower memory and computational…

Neural and Evolutionary Computing · Computer Science 2017-11-07 Sourya Dey , Yinan Shao , Keith M. Chugg , Peter A. Beerel

MGCNN: a learnable multigrid solver for sparse linear systems from PDEs on structured grids

This paper presents a learnable solver tailored to iteratively solve sparse linear systems from discretized partial differential equations (PDEs). Unlike traditional approaches relying on specialized expertise, our solver streamlines the…

Numerical Analysis · Mathematics 2024-05-10 Yan Xie , Minrui Lv , Chensong Zhang

Graph Neural Networks and Applied Linear Algebra

Sparse matrix computations are ubiquitous in scientific computing. With the recent interest in scientific machine learning, it is natural to ask how sparse matrix computations can leverage neural networks (NN). Unfortunately, multi-layer…

Numerical Analysis · Mathematics 2023-10-24 Nicholas S. Moore , Eric C. Cyr , Peter Ohm , Christopher M. Siefert , Raymond S. Tuminaro

Balanced Sparsity for Efficient DNN Inference on GPU

In trained deep neural networks, unstructured pruning can reduce redundant weights to lower storage cost. However, it requires the customization of hardwares to speed up practical inference. Another trend accelerates sparse model inference…

Computer Vision and Pattern Recognition · Computer Science 2020-10-30 Zhuliang Yao , Shijie Cao , Wencong Xiao , Chen Zhang , Lanshun Nie