English
Related papers

Related papers: Data-Efficient Structured Pruning via Submodular O…

200 papers

Model compression is crucial for deployment of neural networks on devices with limited computational and memory resources. Many different methods show comparable accuracy of the compressed model and similar compression rates. However, the…

Machine Learning · Computer Science 2020-08-21 Ben Mussay , Daniel Feldman , Samson Zhou , Vladimir Braverman , Margarita Osadchy

Structured pruning is a popular method to reduce the cost of convolutional neural networks, that are the state of the art in many computer vision tasks. However, depending on the architecture, pruning introduces dimensional discrepancies…

Neural and Evolutionary Computing · Computer Science 2022-12-13 Hugo Tessier , Vincent Gripon , Mathieu Léonardon , Matthieu Arzel , David Bertrand , Thomas Hannagan

Pruning methods have shown to be effective at reducing the size of deep neural networks while keeping accuracy almost intact. Among the most effective methods are those that prune a network while training it with a sparsity prior loss and…

Neural and Evolutionary Computing · Computer Science 2019-12-20 Carl Lemaire , Andrew Achkar , Pierre-Marc Jodoin

When deploying pre-trained neural network models in real-world applications, model consumers often encounter resource-constraint platforms such as mobile and smart devices. They typically use the pruning technique to reduce the size and…

Machine Learning · Computer Science 2025-06-19 Mark Huasong Meng , Guangdong Bai , Sin Gee Teo , Jin Song Dong

Structured pruning is an effective compression technique to reduce the computation of neural networks, which is usually achieved by adding perturbations to reduce network parameters at the cost of slightly increasing training loss. A more…

Machine Learning · Computer Science 2021-10-22 Yinchuan Li , Xiaofeng Liu , Yunfeng Shao , Qing Wang , Yanhui Geng

How to develop slim and accurate deep neural networks has become crucial for real- world applications, especially for those employed in embedded systems. Though previous work along this research line has shown some promising results, most…

Neural and Evolutionary Computing · Computer Science 2019-10-02 Xin Dong , Shangyu Chen , Sinno Jialin Pan

Structured pruning is a popular method for compressing a neural network: given a large trained network, one alternates between removing channel connections and fine-tuning; reducing the overall width of the network. However, the efficacy of…

Machine Learning · Statistics 2019-06-10 Elliot J. Crowley , Jack Turner , Amos Storkey , Michael O'Boyle

Recurrent neural networks (RNNs) have recently achieved remarkable successes in a number of applications. However, the huge sizes and computational burden of these models make it difficult for their deployment on edge devices. A practically…

Machine Learning · Computer Science 2019-12-10 Liangjian Wen , Xuanyang Zhang , Haoli Bai , Zenglin Xu

Pruning is a compression method which aims to improve the efficiency of neural networks by reducing their number of parameters while maintaining a good performance, thus enhancing the performance-to-cost ratio in nontrivial ways. Of…

Neural and Evolutionary Computing · Computer Science 2023-09-25 Hugo Tessier , Ghouti Boukli Hacene , Vincent Gripon

Existing high-performance deep learning models require very intensive computing. For this reason, it is difficult to embed a deep learning model into a system with limited resources. In this paper, we propose the novel idea of the network…

Machine Learning · Computer Science 2019-02-13 Dae-Woong Jeong , Jaehun Kim , Youngseok Kim , Tae-Ho Kim , Myungsu Chae

State-of-the-art neural networks are getting deeper and wider. While their performance increases with the increasing number of layers and neurons, it is crucial to design an efficient deep architecture in order to reduce computational and…

Neural and Evolutionary Computing · Computer Science 2016-07-13 Hengyuan Hu , Rui Peng , Yu-Wing Tai , Chi-Keung Tang

Network pruning reduces the size of neural networks by removing (pruning) neurons such that the performance drop is minimal. Traditional pruning approaches focus on designing metrics to quantify the usefulness of a neuron which is often…

Computer Vision and Pattern Recognition · Computer Science 2021-11-01 Shehryar Malik , Muhammad Umair Haider , Omer Iqbal , Murtaza Taj

Network compression is crucial to making the deep networks to be more efficient, faster, and generalizable to low-end hardware. Current network compression methods have two open problems: first, there lacks a theoretical framework to…

Machine Learning · Computer Science 2022-06-09 Ziqi Zhou , Li Lian , Yilong Yin , Ze Wang

Real time application of deep learning algorithms is often hindered by high computational complexity and frequent memory accesses. Network pruning is a promising technique to solve this problem. However, pruning usually results in irregular…

Neural and Evolutionary Computing · Computer Science 2015-12-31 Sajid Anwar , Kyuyeon Hwang , Wonyong Sung

Network pruning is an important research field aiming at reducing computational costs of neural networks. Conventional approaches follow a fixed paradigm which first trains a large and redundant network, and then determines which units…

Computer Vision and Pattern Recognition · Computer Science 2019-09-30 Yulong Wang , Xiaolu Zhang , Lingxi Xie , Jun Zhou , Hang Su , Bo Zhang , Xiaolin Hu

Structured pruning efficiently compresses networks by identifying and removing unimportant neurons. While this can be elegantly achieved by applying sparsity-inducing regularisation on BatchNorm parameters, an L1 penalty would shrink all…

Machine Learning · Computer Science 2022-04-14 Mihai Suteu , Yike Guo

Recent advancements have scaled neural networks to unprecedented sizes, achieving remarkable performance across a wide range of tasks. However, deploying these large-scale models on resource-constrained devices poses significant challenges…

Machine Learning · Computer Science 2024-10-22 Mostafa Hussien , Mahmoud Afifi , Kim Khoa Nguyen , Mohamed Cheriet

Although multi-task deep neural network (DNN) models have computation and storage benefits over individual single-task DNN models, they can be further optimized via model compression. Numerous structured pruning methods are already…

Machine Learning · Computer Science 2023-04-17 Siddhant Garg , Lijun Zhang , Hui Guan

Neural network compression has gained increasing attention in recent years, particularly in computer vision applications, where the need for model reduction is crucial for overcoming deployment constraints. Pruning is a widely used…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Baptiste Bauvin , Loïc Baret , Ola Ahmad

In Machine Learning, Artificial Neural Networks (ANNs) are a very powerful tool, broadly used in many applications. Often, the selected (deep) architectures include many layers, and therefore a large amount of parameters, which makes…

Machine Learning · Computer Science 2022-06-29 Matteo Cacciola , Antonio Frangioni , Xinlin Li , Andrea Lodi
‹ Prev 1 2 3 10 Next ›