English
Related papers

Related papers: Layer Pruning with Consensus: A Triple-Win Solutio…

200 papers

Recent advances in pruning of neural networks have made it possible to remove a large number of filters or weights without any perceptible drop in accuracy. The number of parameters and that of FLOPs are usually the reported metrics to…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Sara Elkerdawy , Mostafa Elhoushi , Abhineet Singh , Hong Zhang , Nilanjan Ray

Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the…

Machine Learning · Computer Science 2024-11-05 Ian Pons , Bruno Yamamoto , Anna H. Reali Costa , Artur Jordao

The proposed pruning strategy offers merits over weight-based pruning techniques: (1) it avoids irregular memory access since representations and matrices can be squeezed into their smaller but dense counterparts, leading to greater…

Computation and Language · Computer Science 2021-08-31 Chun Fan , Jiwei Li , Xiang Ao , Fei Wu , Yuxian Meng , Xiaofei Sun

As the need for more accurate and powerful Convolutional Neural Networks (CNNs) increases, so too does the size, execution time, memory footprint, and power consumption. To overcome this, solutions such as pruning have been proposed with…

Artificial Intelligence · Computer Science 2026-02-20 Joseph Bingham , Sam Helmich

Resource-efficient convolution neural networks enable not only the intelligence on edge devices but also opportunities in system-level optimization such as scheduling. In this work, we aim to improve the performance of resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2018-10-19 Ting-Wu Chin , Cha Zhang , Diana Marculescu

The success of convolutional neural networks (CNNs) in various applications is accompanied by a significant increase in computation and parameter storage costs. Recent efforts to reduce these overheads involve pruning and compressing the…

Large Language Models (LLMs) exhibit strong reasoning abilities, but their high computational costs limit their practical deployment. Recent studies reveal significant redundancy in LLMs layers, making layer pruning an active research…

Computation and Language · Computer Science 2026-02-17 Hao Liu , Guangyan Li , Wensheng Zhang , Yongqiang Tang

Filter pruning has drawn more attention since resource constrained platform requires more compact model for deployment. However, current pruning methods suffer either from the inferior performance of one-shot methods, or the expensive time…

Computer Vision and Pattern Recognition · Computer Science 2020-10-15 Dong Li , Sitong Chen , Xudong Liu , Yunda Sun , Li Zhang

The advancement of convolutional neural networks (CNNs) on various vision applications has attracted lots of attention. Yet the majority of CNNs are unable to satisfy the strict requirement for real-world deployment. To overcome this, the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Wei He , Zhongzhan Huang , Mingfu Liang , Senwei Liang , Haizhao Yang

Correspondence selection aims to correctly select the consistent matches (inliers) from an initial set of putative correspondences. The selection is challenging since putative matches are typically extremely unbalanced, largely dominated by…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Chen Zhao , Yixiao Ge , Feng Zhu , Rui Zhao , Hongsheng Li , Mathieu Salzmann

Pruning is a compression method which aims to improve the efficiency of neural networks by reducing their number of parameters while maintaining a good performance, thus enhancing the performance-to-cost ratio in nontrivial ways. Of…

Neural and Evolutionary Computing · Computer Science 2023-09-25 Hugo Tessier , Ghouti Boukli Hacene , Vincent Gripon

LLM-based recommender systems have made significant progress; however, the deployment cost associated with the large parameter volume of LLMs still hinders their real-world applications. This work explores parameter pruning to improve…

Information Retrieval · Computer Science 2025-07-10 Shanle Zheng , Keqin Bao , Jizhi Zhang , Yang Zhang , Fuli Feng , Xiangnan He

Layer pruning has emerged as a promising technique for compressing large language models (LLMs) while achieving acceleration proportional to the pruning ratio. In this work, we identify that removing any layer induces a significant…

Computation and Language · Computer Science 2025-07-25 Xinrui Chen , Hongxing Zhang , Fanyi Zeng , Yongxian Wei , Yizhi Wang , Xitong Ling , Guanghao Li , Chun Yuan

Filter pruning is widely adopted to compress and accelerate the Convolutional Neural Networks (CNNs), but most previous works ignore the relationship between filters and channels in different layers. Processing each layer independently…

Computer Vision and Pattern Recognition · Computer Science 2023-04-27 Xiaorui Wang , Jun Wang , Xin Tang , Peng Gao , Rui Fang , Guotong Xie

Depth pruning improves the deployment efficiency of large language models (LLMs) by identifying and removing redundant layers. A widely accepted standard for this identification process is to measure the similarity between layers using…

Artificial Intelligence · Computer Science 2026-04-22 Yuli Chen , Shuhao Zhang , Fanshen Meng , Bo Cheng , Jiale Han , Qiang Tong , Xiulei Liu

Deep learning stands as the modern paradigm for solving cognitive tasks. However, as the problem complexity increases, models grow deeper and computationally prohibitive, hindering advancements in real-world and resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2025-06-06 Gustavo Henrique do Nascimento , Ian Pons , Anna Helena Reali Costa , Artur Jordao

Pruning is a core technique for compressing neural networks to improve computational efficiency. This process is typically approached in two ways: one-shot pruning, which involves a single pass of training and pruning, and iterative…

Machine Learning · Computer Science 2025-08-20 Mikołaj Janusz , Tomasz Wojnar , Yawei Li , Luca Benini , Kamil Adamczewski

Most neural network pruning methods, such as filter-level and layer-level prunings, prune the network model along one dimension (depth, width, or resolution) solely to meet a computational budget. However, such a pruning policy often leads…

Computer Vision and Pattern Recognition · Computer Science 2021-06-16 Wenxiao Wang , Minghao Chen , Shuai Zhao , Long Chen , Jinming Hu , Haifeng Liu , Deng Cai , Xiaofei He , Wei Liu

Kernel pruning methods have been proposed to speed up, simplify, and improve explanation of convolutional neural network (CNN) models. However, the effectiveness of a simplified model is often below the original one. In this letter, we…

Machine Learning · Computer Science 2021-08-19 D. Osaku , J. F. Gomes , A. X. Falcão

In order to deploy deep convolutional neural networks (CNNs) on resource-limited devices, many model pruning methods for filters and weights have been developed, while only a few to layer pruning. However, compared with filter pruning and…

Computer Vision and Pattern Recognition · Computer Science 2020-12-01 Pengtao Xu , Jian Cao , Fanhua Shang , Wenyu Sun , Pu Li
‹ Prev 1 2 3 10 Next ›