English
Related papers

Related papers: A Gradient Flow Framework For Analyzing Network Pr…

200 papers

In recent years, deep neural networks have known a wide success in various application domains. However, they require important computational and memory resources, which severely hinders their deployment, notably on mobile devices or for…

Computer Vision and Pattern Recognition · Computer Science 2021-12-16 Nathan Hubens , Matei Mancas , Bernard Gosselin , Marius Preda , Titus Zaharia

Overparameterization has been shown to benefit both the optimization and generalization of neural networks, but large networks are resource hungry at both training and test time. Network pruning can reduce test-time resource requirements,…

Machine Learning · Computer Science 2020-08-10 Chaoqi Wang , Guodong Zhang , Roger Grosse

Pruning techniques are used comprehensively to compress convolutional neural networks (CNNs) on image classification. However, the majority of pruning methods require a well pre-trained model to provide useful supporting parameters, such as…

Computer Vision and Pattern Recognition · Computer Science 2022-08-10 Yiheng Lu , Maoguo Gong , Wei Zhao , Kaiyuan Feng , Hao Li

Existing generalization measures that aim to capture a model's simplicity based on parameter counts or norms fail to explain generalization in overparameterized deep neural networks. In this paper, we introduce a new, theoretically…

Machine Learning · Computer Science 2021-03-11 Lorenz Kuhn , Clare Lyle , Aidan N. Gomez , Jonas Rothfuss , Yarin Gal

The advancement of convolutional neural networks (CNNs) on various vision applications has attracted lots of attention. Yet the majority of CNNs are unable to satisfy the strict requirement for real-world deployment. To overcome this, the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Wei He , Zhongzhan Huang , Mingfu Liang , Senwei Liang , Haizhao Yang

Diffusion Models (DMs) have impressive capabilities among generation models, but are limited to slower inference speeds and higher computational costs. Previous works utilize one-shot structure pruning to derive lightweight DMs from…

Machine Learning · Computer Science 2025-01-17 Ben Wan , Tianyi Zheng , Zhaoyu Chen , Yuxiao Wang , Jia Wang

A typical deep neural network (DNN) has a large number of trainable parameters. Choosing a network with proper capacity is challenging and generally a larger network with excessive capacity is trained. Pruning is an established approach to…

Neural and Evolutionary Computing · Computer Science 2021-03-01 Hojjat Salehinejad , Shahrokh Valaee

Model pruning aims to reduce the deep neural network (DNN) model size or computational overhead. Traditional model pruning methods such as l-1 pruning that evaluates the channel significance for DNN pay too much attention to the local…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Xinyu Liu , Baopu Li , Zhen Chen , Yixuan Yuan

Neural network pruning is a popular technique used to reduce the inference costs of modern, potentially overparameterized, networks. Starting from a pre-trained network, the process is as follows: remove redundant parameters, retrain, and…

Machine Learning · Computer Science 2021-03-05 Lucas Liebenwein , Cenk Baykal , Brandon Carter , David Gifford , Daniela Rus

The optimization of over-parameterized deep neural networks represents a large-scale, high-dimensional, and strongly non-convex decision problem that challenges existing optimization frameworks. Current evolutionary and gradient-based…

Neural and Evolutionary Computing · Computer Science 2026-04-02 Zak Khan , Azam Asilian Bidgoli

The enormous inference cost of deep neural networks can be scaled down by network compression. Pruning is one of the predominant approaches used for deep network compression. However, existing pruning techniques have one or more of the…

Machine Learning · Computer Science 2020-10-13 Sai Aparna Aketi , Sourjya Roy , Anand Raghunathan , Kaushik Roy

By removing parameters from deep neural networks, unstructured pruning methods aim at cutting down memory footprint and computational cost, while maintaining prediction accuracy. In order to tackle this otherwise intractable problem, many…

Machine Learning · Computer Science 2020-06-23 César Laurent , Camille Ballas , Thomas George , Nicolas Ballas , Pascal Vincent

In recent years, the increasing size of deep learning models and their growing demand for computational resources have drawn significant attention to the practice of pruning neural networks, while aiming to preserve their accuracy. In…

Computer Vision and Pattern Recognition · Computer Science 2024-11-11 Lingkai Zhu , Can Deniz Bezek , Orcun Goksel

This paper presents an efficient technique to prune deep and/or wide convolutional neural network models by eliminating redundant features (or filters). Previous studies have shown that over-sized deep neural network models tend to produce…

Computer Vision and Pattern Recognition · Computer Science 2018-02-22 Babajide O. Ayinde , Jacek M. Zurada

Recent advances in pruning of neural networks have made it possible to remove a large number of filters or weights without any perceptible drop in accuracy. The number of parameters and that of FLOPs are usually the reported metrics to…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Sara Elkerdawy , Mostafa Elhoushi , Abhineet Singh , Hong Zhang , Nilanjan Ray

Machine learning optimization often depends on stochastic gradient descent, where the precision of gradient estimation is vital for model performance. Gradients are calculated from mini-batches formed by uniformly selecting data samples…

Machine Learning · Computer Science 2025-01-29 Corentin Salaün , Xingchang Huang , Iliyan Georgiev , Niloy J. Mitra , Gurprit Singh

The current landscape in time-series forecasting is dominated by Transformer-based models. Their high parameter count and corresponding demand in computational resources pose a challenge to real-world deployment, especially for commercial…

Machine Learning · Computer Science 2024-12-18 Nicholas Kiefer , Arvid Weyrauch , Muhammed Öz , Achim Streit , Markus Götz , Charlotte Debus

Network pruning is a widely-used compression technique that is able to significantly scale down overparameterized models with minimal loss of accuracy. This paper shows that pruning may create or exacerbate disparate impacts. The paper…

Machine Learning · Computer Science 2022-10-14 Cuong Tran , Ferdinando Fioretto , Jung-Eun Kim , Rakshit Naidu

Model pruning is an essential procedure for building compact and computationally-efficient machine learning models. A key feature of a good pruning algorithm is that it accurately quantifies the relative importance of the model weights.…

Machine Learning · Computer Science 2020-06-22 Mingchen Li , Yahya Sattar , Christos Thrampoulidis , Samet Oymak

Transformer-based NLP models are trained using hundreds of millions or even billions of parameters, limiting their applicability in computationally constrained environments. While the number of parameters generally correlates with…

Computation and Language · Computer Science 2022-08-16 Hassan Sajjad , Fahim Dalvi , Nadir Durrani , Preslav Nakov
‹ Prev 1 2 3 10 Next ›