English
Related papers

Related papers: Structured Pattern Pruning Using Regularization

200 papers

In Machine Learning, Artificial Neural Networks (ANNs) are a very powerful tool, broadly used in many applications. Often, the selected (deep) architectures include many layers, and therefore a large amount of parameters, which makes…

Machine Learning · Computer Science 2022-06-29 Matteo Cacciola , Antonio Frangioni , Xinlin Li , Andrea Lodi

Modern-day neural networks are famously large, yet also highly redundant and compressible; there exist numerous pruning strategies in the deep learning literature that yield over 90% sparser sub-networks of fully-trained, dense…

Machine Learning · Computer Science 2022-06-20 Aishwarya Balwani , Jakob Krzyston

Given the ever-increasing size of modern neural networks, the significance of sparse architectures has surged due to their accelerated inference speeds and minimal memory demands. When it comes to global pruning techniques, Iterative…

Machine Learning · Computer Science 2024-04-29 Moonseok Choi , Hyungi Lee , Giung Nam , Juho Lee

Large Language Models (LLMs) have achieved remarkable success across a wide spectrum of natural language processing tasks. However, their ever-growing scale introduces significant barriers to real-world deployment, including substantial…

Computation and Language · Computer Science 2026-01-07 Guangxin Wu , Hao Zhang , Zhang Zhibin , Jiafeng Guo , Xueqi Cheng

Iterative magnitude pruning methods (IMPs), proven to be successful in reducing the number of insignificant nodes in over-parameterized deep neural networks (DNNs), have been getting an enormous amount of attention with the rapid deployment…

Machine Learning · Computer Science 2025-01-28 Soheil Gharatappeh , Salimeh Yasaei Sekeh

Iterative pruning is one of the most effective compression methods for pre-trained language models. We discovered that finding the optimal pruning decision is an equality-constrained 0-1 Integer Linear Programming problem. The solution to…

Computation and Language · Computer Science 2023-05-23 Siyu Ren , Kenny Q. Zhu

Neural network compression has gained increasing attention in recent years, particularly in computer vision applications, where the need for model reduction is crucial for overcoming deployment constraints. Pruning is a widely used…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Baptiste Bauvin , Loïc Baret , Ola Ahmad

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and…

Machine Learning · Computer Science 2023-02-14 Marwa El Halabi , Suraj Srinivas , Simon Lacoste-Julien

Pruning is a well-established technique for removing unnecessary structure from neural networks after training to improve the performance of inference. Several recent results have explored the possibility of pruning at initialization time…

Machine Learning · Computer Science 2020-09-29 Jonathan Frankle , Gintare Karolina Dziugaite , Daniel M. Roy , Michael Carbin

The deployment of large language models (LLMs) is often constrained by their substantial computational and memory demands. While structured pruning presents a viable approach by eliminating entire network components, existing methods suffer…

Machine Learning · Computer Science 2025-05-07 Hanyu Hu , Xiaoming Yuan

Large language models have recently achieved state of the art performance across a wide variety of natural language tasks. Meanwhile, the size of these models and their latency have significantly increased, which makes their usage costly,…

Computation and Language · Computer Science 2021-03-30 Ziheng Wang , Jeremy Wohlwend , Tao Lei

Deploying complex deep learning models on edge devices is challenging because they have substantial compute and memory resource requirements, whereas edge devices' resource budget is limited. To solve this problem, extensive pruning…

Machine Learning · Computer Science 2022-01-26 Kaiqi Zhao , Animesh Jain , Ming Zhao

Considering the hardware-friendly characteristics and broad applicability, structured pruning has emerged as an efficient solution to reduce the resource demands of large language models (LLMs) on resource-constrained devices. Traditional…

Machine Learning · Computer Science 2025-01-28 Zihuai Xu , Yang Xu , Hongli Xu , Yunming Liao , Zhiwei Yao , Zuan Xie

Image super-resolution (SR) has witnessed extensive neural network designs from CNN to transformer architectures. However, prevailing SR models suffer from prohibitive memory footprint and intensive computations, which limits further…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Jiamian Wang , Huan Wang , Yulun Zhang , Yun Fu , Zhiqiang Tao

Structural neural network pruning aims to remove the redundant channels in the deep convolutional neural networks (CNNs) by pruning the filters of less importance to the final output accuracy. To reduce the degradation of performance after…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Nanfei Jiang , Xu Zhao , Chaoyang Zhao , Yongqi An , Ming Tang , Jinqiao Wang

Pruning is a popular technique for reducing the model size and computational cost of convolutional neural networks (CNNs). However, a slow retraining or fine-tuning procedure is often required to recover the accuracy loss caused by pruning.…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Yaohui Cai , Weizhe Hua , Hongzheng Chen , G. Edward Suh , Christopher De Sa , Zhiru Zhang

Pruning deep neural networks is a widely used strategy to alleviate the computational burden in machine learning. Overwhelming empirical evidence suggests that pruned models retain very high accuracy even with a tiny fraction of parameters.…

Machine Learning · Computer Science 2023-09-27 Viplove Arora , Daniele Irto , Sebastian Goldt , Guido Sanguinetti

Pruning is a core technique for compressing neural networks to improve computational efficiency. This process is typically approached in two ways: one-shot pruning, which involves a single pass of training and pruning, and iterative…

Machine Learning · Computer Science 2025-08-20 Mikołaj Janusz , Tomasz Wojnar , Yawei Li , Luca Benini , Kamil Adamczewski

Structured pruning is a popular method for compressing a neural network: given a large trained network, one alternates between removing channel connections and fine-tuning; reducing the overall width of the network. However, the efficacy of…

Machine Learning · Statistics 2019-06-10 Elliot J. Crowley , Jack Turner , Amos Storkey , Michael O'Boyle

Pruning methods have shown to be effective at reducing the size of deep neural networks while keeping accuracy almost intact. Among the most effective methods are those that prune a network while training it with a sparsity prior loss and…

Neural and Evolutionary Computing · Computer Science 2019-12-20 Carl Lemaire , Andrew Achkar , Pierre-Marc Jodoin
‹ Prev 1 2 3 10 Next ›