English
Related papers

Related papers: Soft Threshold Weight Reparameterization for Learn…

200 papers

We address the problem of reconstructing sparse signals from noisy and compressive measurements using a feed-forward deep neural network (DNN) with an architecture motivated by the iterative shrinkage-thresholding algorithm (ISTA). We…

Machine Learning · Computer Science 2017-05-23 Debabrata Mahapatra , Subhadip Mukherjee , Chandra Sekhar Seelamantula

Sparse Neural Networks (SNNs) have emerged as powerful tools for efficient feature selection. Leveraging the dynamic sparse training (DST) algorithms within SNNs has demonstrated promising feature selection capabilities while drastically…

Deep neural networks (DNNs) are used in many applications, but their large size and high computational cost make them hard to run on devices with limited resources. Two widely used techniques to address this challenge are weight…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Dan Liu , Nikita Dvornik , Xue Liu

Deep neural networks (DNNs) have shown to provide superb performance in many real life applications, but their large computation cost and storage requirement have prevented them from being deployed to many edge and internet-of-things (IoT)…

Neural and Evolutionary Computing · Computer Science 2021-12-22 Minghai Qin , Tianyun Zhang , Fei Sun , Yen-Kuang Chen , Makan Fardad , Yanzhi Wang , Yuan Xie

Turning the weights to zero when training a neural network helps in reducing the computational complexity at inference. To progressively increase the sparsity ratio in the network without causing sharp weight discontinuities during…

Computer Vision and Pattern Recognition · Computer Science 2023-01-25 Antoine Vanderschueren , Christophe De Vleeschouwer

The most common method for DNN pruning is hard thresholding of network weights, followed by retraining to recover any lost accuracy. Recently developed smart pruning algorithms use the DNN response over the training set for a variety of…

Machine Learning · Computer Science 2019-05-23 Konstantinos Pitas , Mike Davies , Pierre Vandergheynst

This paper addresses the topic of sparsifying deep neural networks (DNN's). While DNN's are powerful models that achieve state-of-the-art performance on a large number of tasks, the large number of model parameters poses serious storage and…

Machine Learning · Computer Science 2018-02-07 Igor Fedorov , Bhaskar D. Rao

The compression of deep neural networks (DNNs) to reduce inference cost becomes increasingly important to meet realistic deployment requirements of various applications. There have been a significant amount of work regarding network…

Machine Learning · Computer Science 2020-11-12 Tianyi Chen , Bo Ji , Yixin Shi , Tianyu Ding , Biyi Fang , Sheng Yi , Xiao Tu

Sparse neural networks have been widely applied to reduce the computational demands of training and deploying over-parameterized deep neural networks. For inference acceleration, methods that discover a sparse network from a pre-trained…

Machine Learning · Computer Science 2021-06-16 Shiwei Liu , Decebal Constantin Mocanu , Yulong Pei , Mykola Pechenizkiy

The success of DNN pruning has led to the development of energy-efficient inference accelerators that support pruned models with sparse weight and activation tensors. Because the memory layouts and dataflows in these architectures are…

Neural and Evolutionary Computing · Computer Science 2020-09-24 Dingqing Yang , Amin Ghasemazar , Xiaowei Ren , Maximilian Golub , Guy Lemieux , Mieszko Lis

As real-world graphs expand in size, larger GNN models with billions of parameters are deployed. High parameter count in such models makes training and inference on graphs expensive and challenging. To reduce the computational and memory…

Machine Learning · Computer Science 2023-02-27 Hongwu Peng , Deniz Gurevin , Shaoyi Huang , Tong Geng , Weiwen Jiang , Omer Khan , Caiwen Ding

Effectively scaling up deep reinforcement learning models has proven notoriously difficult due to network pathologies during training, motivating various targeted interventions such as periodic reset and architectural advances such as layer…

Machine Learning · Computer Science 2025-06-23 Guozheng Ma , Lu Li , Zilin Wang , Li Shen , Pierre-Luc Bacon , Dacheng Tao

Deep neural nets (DNNs) compression is crucial for adaptation to mobile devices. Though many successful algorithms exist to compress naturally trained DNNs, developing efficient and stable compression algorithms for robustly trained DNNs…

Machine Learning · Computer Science 2020-03-03 Thu Dinh , Bao Wang , Andrea L. Bertozzi , Stanley J. Osher

Pruning is a model compression method that removes redundant parameters in deep neural networks (DNNs) while maintaining accuracy. Most available filter pruning methods require complex treatments such as iterative pruning, features…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Yue Wu , Yuan Lan , Luchan Zhang , Yang Xiang

Sparse coding strategies have been lauded for their parsimonious representations of data that leverage low dimensional structure. However, inference of these codes typically relies on an optimization procedure with poor computational…

Machine Learning · Computer Science 2022-09-02 Kion Fallah , Christopher J. Rozell

High demand for computation resources severely hinders deployment of large-scale Deep Neural Networks (DNN) in resource constrained devices. In this work, we propose a Structured Sparsity Learning (SSL) method to regularize the structures…

Neural and Evolutionary Computing · Computer Science 2016-10-19 Wei Wen , Chunpeng Wu , Yandan Wang , Yiran Chen , Hai Li

Recurrent Neural Networks (RNNs) are used in state-of-the-art models in domains such as speech recognition, machine translation, and language modelling. Sparsity is a technique to reduce compute and memory requirements of deep learning…

Machine Learning · Computer Science 2017-11-09 Sharan Narang , Eric Undersander , Gregory Diamos

Sparsity helps reduce the computational complexity of deep neural networks by skipping zeros. Taking advantage of sparsity is listed as a high priority in next generation DNN accelerators such as TPU. The structure of sparsity, i.e., the…

Machine Learning · Computer Science 2017-06-06 Huizi Mao , Song Han , Jeff Pool , Wenshuo Li , Xingyu Liu , Yu Wang , William J. Dally

Convolution neural networks (CNNs) have achieved remarkable success, but typically accompany high computation cost and numerous redundant weight parameters. To reduce the FLOPs, structure pruning is a popular approach to remove the entire…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Bo Ji , Tianyi Chen

Recurrent Neural Networks (RNN) are widely used to solve a variety of problems and as the quantity of data and the amount of available compute have increased, so have model sizes. The number of parameters in recent state-of-the-art networks…

Machine Learning · Computer Science 2017-11-08 Sharan Narang , Erich Elsen , Gregory Diamos , Shubho Sengupta
‹ Prev 1 2 3 10 Next ›