Related papers: Fast Sparse ConvNets

SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group Sparsity

To address the challenge of increasing network size, researchers have developed sparse models through network pruning. However, maintaining model accuracy while achieving significant speedups on general computing devices remains an open…

Artificial Intelligence · Computer Science 2023-10-31 Haitao Xu , Songwei Liu , Yuyang Xu , Shuai Wang , Jiashi Li , Chenqian Yan , Liangqiang Li , Lean Fu , Xin Pan , Fangmin Chen

Pre-defined Sparsity for Low-Complexity Convolutional Neural Networks

The high energy cost of processing deep convolutional neural networks impedes their ubiquitous deployment in energy-constrained platforms such as embedded systems and IoT devices. This work introduces convolutional layers with pre-defined…

Computer Vision and Pattern Recognition · Computer Science 2020-02-06 Souvik Kundu , Mahdi Nazemi , Massoud Pedram , Keith M. Chugg , Peter A. Beerel

Dynamic ConvNets on Tiny Devices via Nested Sparsity

This work introduces a new training and compression pipeline to build Nested Sparse ConvNets, a class of dynamic Convolutional Neural Networks (ConvNets) suited for inference tasks deployed on resource-constrained devices at the edge of the…

Machine Learning · Computer Science 2022-03-08 Matteo Grimaldi , Luca Mocerino , Antonio Cipolletta , Andrea Calimera

Faster CNNs with Direct Sparse Convolutions and Guided Pruning

Phenomenally successful in practical inference problems, convolutional neural networks (CNN) are widely deployed in mobile devices, data centers, and even supercomputers. The number of parameters needed in CNNs, however, are often large and…

Computer Vision and Pattern Recognition · Computer Science 2017-08-01 Jongsoo Park , Sheng Li , Wei Wen , Ping Tak Peter Tang , Hai Li , Yiran Chen , Pradeep Dubey

Sparse GPU Kernels for Deep Learning

Scientific workloads have traditionally exploited high levels of sparsity to accelerate computation and reduce memory requirements. While deep neural networks can be made sparse, achieving practical speedups on GPUs is difficult because…

Machine Learning · Computer Science 2020-09-02 Trevor Gale , Matei Zaharia , Cliff Young , Erich Elsen

FSCNN: A Fast Sparse Convolution Neural Network Inference System

Convolution neural networks (CNNs) have achieved remarkable success, but typically accompany high computation cost and numerous redundant weight parameters. To reduce the FLOPs, structure pruning is a popular approach to remove the entire…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Bo Ji , Tianyi Chen

DASS: Differentiable Architecture Search for Sparse neural networks

The deployment of Deep Neural Networks (DNNs) on edge devices is hindered by the substantial gap between performance requirements and available processing power. While recent research has made significant strides in developing pruning…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Hamid Mousavi , Mohammad Loni , Mina Alibeigi , Masoud Daneshtalab

QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures

We present QuickNet, a fast and accurate network architecture that is both faster and significantly more accurate than other fast deep architectures like SqueezeNet. Furthermore, it uses less parameters than previous networks, making it…

Machine Learning · Computer Science 2017-01-13 Tapabrata Ghosh

Penetrating the Fog: the Path to Efficient CNN Models

With the increasing demand to deploy convolutional neural networks (CNNs) on mobile platforms, the sparse kernel approach was proposed, which could save more parameters than the standard convolution while maintaining accuracy. However,…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Kun Wan , Boyuan Feng , Shu Yang , Yufei Ding

SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity

Contemporary Deep Neural Network (DNN) contains millions of synaptic connections with tens to hundreds of layers. The large computation and memory requirements pose a challenge to the hardware design. In this work, we leverage the intrinsic…

Machine Learning · Computer Science 2017-11-07 Jingyang Zhu , Jingbo Jiang , Xizi Chen , Chi-Ying Tsui

Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting

Deep neural networks (DNNs) have shown to provide superb performance in many real life applications, but their large computation cost and storage requirement have prevented them from being deployed to many edge and internet-of-things (IoT)…

Neural and Evolutionary Computing · Computer Science 2021-12-22 Minghai Qin , Tianyun Zhang , Fei Sun , Yen-Kuang Chen , Makan Fardad , Yanzhi Wang , Yuan Xie

Towards Principled Design of Deep Convolutional Networks: Introducing SimpNet

Major winning Convolutional Neural Networks (CNNs), such as VGGNet, ResNet, DenseNet, \etc, include tens to hundreds of millions of parameters, which impose considerable computation and memory overheads. This limits their practical usage in…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Seyyed Hossein Hasanpour , Mohammad Rouhani , Mohsen Fayyaz , Mohammad Sabokrou , Ehsan Adeli

Accelerating Sparse Deep Neural Networks

As neural network model sizes have dramatically increased, so has the interest in various techniques to reduce their parameter counts and accelerate their execution. An active area of research in this field is sparsity - encouraging zero…

Machine Learning · Computer Science 2021-04-20 Asit Mishra , Jorge Albericio Latorre , Jeff Pool , Darko Stosic , Dusan Stosic , Ganesh Venkatesh , Chong Yu , Paulius Micikevicius

Efficient Fusion of Sparse and Complementary Convolutions

We propose a new method to create compact convolutional neural networks (CNNs) by exploiting sparse convolutions. Different from previous works that learn sparsity in models, we directly employ hand-crafted kernels with regular sparse…

Computer Vision and Pattern Recognition · Computer Science 2018-09-12 Chun-Fu Chen , Quanfu Fan , Marco Pistoia , Gwo Giun Lee

Weight Block Sparsity: Training, Compilation, and AI Engine Accelerators

Nowadays, increasingly larger Deep Neural Networks (DNNs) are being developed, trained, and utilized. These networks require significant computational resources, putting a strain on both advanced and limited devices. Our solution is to…

Machine Learning · Computer Science 2024-07-16 Paolo D'Alberto , Taehee Jeong , Akshai Jain , Shreyas Manjunath , Mrinal Sarmah , Samuel Hsu , Yaswanth Raparti , Nitesh Pipralia

SparseDNN: Fast Sparse Deep Learning Inference on CPUs

The last few years have seen gigantic leaps in algorithms and systems to support efficient deep learning inference. Pruning and quantization algorithms can now consistently compress neural networks by an order of magnitude. For a compressed…

Machine Learning · Computer Science 2021-07-22 Ziheng Wang

CondenseNet: An Efficient DenseNet using Learned Group Convolutions

Deep neural networks are increasingly used on mobile devices, where computational resources are limited. In this paper we develop CondenseNet, a novel network architecture with unprecedented efficiency. It combines dense connectivity with a…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Gao Huang , Shichen Liu , Laurens van der Maaten , Kilian Q. Weinberger

Sparsely Aggregated Convolutional Networks

We explore a key architectural aspect of deep convolutional neural networks: the pattern of internal skip connections used to aggregate outputs of earlier layers for consumption by deeper layers. Such aggregation is critical to facilitate…

Computer Vision and Pattern Recognition · Computer Science 2019-02-08 Ligeng Zhu , Ruizhi Deng , Michael Maire , Zhiwei Deng , Greg Mori , Ping Tan

Exploring the Regularity of Sparse Structure in Convolutional Neural Networks

Sparsity helps reduce the computational complexity of deep neural networks by skipping zeros. Taking advantage of sparsity is listed as a high priority in next generation DNN accelerators such as TPU. The structure of sparsity, i.e., the…

Machine Learning · Computer Science 2017-06-06 Huizi Mao , Song Han , Jeff Pool , Wenshuo Li , Xingyu Liu , Yu Wang , William J. Dally

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

The growing energy and performance costs of deep learning have driven the community to reduce the size of neural networks by selectively pruning components. Similarly to their biological counterparts, sparse networks generalize just as…

Machine Learning · Computer Science 2021-02-02 Torsten Hoefler , Dan Alistarh , Tal Ben-Nun , Nikoli Dryden , Alexandra Peste