Related papers: A Generic Layer Pruning Method for Signal Modulati…

Multi-loss-aware Channel Pruning of Deep Networks

Channel pruning, which seeks to reduce the model size by removing redundant channels, is a popular solution for deep networks compression. Existing channel pruning methods usually conduct layer-wise channel selection by directly minimizing…

Computer Vision and Pattern Recognition · Computer Science 2019-05-14 Yiming Hu , Siyang Sun , Jianquan Li , Jiagang Zhu , Xingang Wang , Qingyi Gu

Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon

How to develop slim and accurate deep neural networks has become crucial for real- world applications, especially for those employed in embedded systems. Though previous work along this research line has shown some promising results, most…

Neural and Evolutionary Computing · Computer Science 2019-10-02 Xin Dong , Shangyu Chen , Sinno Jialin Pan

Pruning Everything, Everywhere, All at Once

Deep learning stands as the modern paradigm for solving cognitive tasks. However, as the problem complexity increases, models grow deeper and computationally prohibitive, hindering advancements in real-world and resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2025-06-06 Gustavo Henrique do Nascimento , Ian Pons , Anna Helena Reali Costa , Artur Jordao

Effective Layer Pruning Through Similarity Metric Perspective

Deep neural networks have been the predominant paradigm in machine learning for solving cognitive tasks. Such models, however, are restricted by a high computational overhead, limiting their applicability and hindering advancements in the…

Machine Learning · Computer Science 2024-11-05 Ian Pons , Bruno Yamamoto , Anna H. Reali Costa , Artur Jordao

A "Network Pruning Network" Approach to Deep Model Compression

We present a filter pruning approach for deep model compression, using a multitask network. Our approach is based on learning a a pruner network to prune a pre-trained target network. The pruner is essentially a multitask deep neural…

Computer Vision and Pattern Recognition · Computer Science 2020-01-17 Vinay Kumar Verma , Pravendra Singh , Vinay P. Namboodiri , Piyush Rai

Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling

Many efforts have been made to facilitate natural language processing tasks with pre-trained language models (LMs), and brought significant improvements to various applications. To fully leverage the nearly unlimited corpora and capture…

Computation and Language · Computer Science 2018-09-11 Liyuan Liu , Xiang Ren , Jingbo Shang , Jian Peng , Jiawei Han

Chain-Net: Learning Deep Model for Modulation Classification Under Synthetic Channel Impairment

Modulation classification, an intermediate process between signal detection and demodulation in a physical layer, is now attracting more interest to the cognitive radio field, wherein the performance is powered by artificial intelligence…

Signal Processing · Electrical Eng. & Systems 2020-09-07 Thien Huynh-The , Van-Sang Doan , Cam-Hao Hua , Quoc-Viet Pham , Dong-Seong Kim

Modular Representation of Layered Neural Networks

Layered neural networks have greatly improved the performance of various applications including image processing, speech recognition, natural language processing, and bioinformatics. However, it is still difficult to discover or interpret…

Machine Learning · Statistics 2017-10-05 Chihiro Watanabe , Kaoru Hiramatsu , Kunio Kashino

Modeling of Pruning Techniques for Deep Neural Networks Simplification

Convolutional Neural Networks (CNNs) suffer from different issues, such as computational complexity and the number of parameters. In recent years pruning techniques are employed to reduce the number of operations and model size in CNNs.…

Computer Vision and Pattern Recognition · Computer Science 2020-01-14 Morteza Mousa Pasandi , Mohsen Hajabdollahi , Nader Karimi , Shadrokh Samavi

Towards thinner convolutional neural networks through Gradually Global Pruning

Deep network pruning is an effective method to reduce the storage and computation cost of deep neural networks when applying them to resource-limited devices. Among many pruning granularities, neuron level pruning will remove redundant…

Computer Vision and Pattern Recognition · Computer Science 2017-03-30 Zhengtao Wang , Ce Zhu , Zhiqiang Xia , Qi Guo , Yipeng Liu

Effective Network Compression Using Simulation-Guided Iterative Pruning

Existing high-performance deep learning models require very intensive computing. For this reason, it is difficult to embed a deep learning model into a system with limited resources. In this paper, we propose the novel idea of the network…

Machine Learning · Computer Science 2019-02-13 Dae-Woong Jeong , Jaehun Kim , Youngseok Kim , Tae-Ho Kim , Myungsu Chae

Layer pruning has emerged as a potent approach to remove redundant layers in the pre-trained network on the purpose of reducing network size and improve computational efficiency. However, existing layer pruning methods mostly overlook the…

Machine Learning · Computer Science 2025-11-17 Yuqi Li , Yao Lu , Junhao Dong , Zeyu Dong , Chuanguang Yang , Xin Yin , Yihao Chen , Jianping Gou , Yingli Tian , Tingwen Huang

C2S2: Cost-aware Channel Sparse Selection for Progressive Network Pruning

This paper describes a channel-selection approach for simplifying deep neural networks. Specifically, we propose a new type of generic network layer, called pruning layer, to seamlessly augment a given pre-trained model for compression.…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Chih-Yao Chiu , Hwann-Tzong Chen , Tyng-Luh Liu

Pruning at a Glance: Global Neural Pruning for Model Compression

Deep Learning models have become the dominant approach in several areas due to their high performance. Unfortunately, the size and hence computational requirements of operating such models can be considerably high. Therefore, this…

Computer Vision and Pattern Recognition · Computer Science 2019-12-04 Abdullah Salama , Oleksiy Ostapenko , Tassilo Klein , Moin Nabi

Differentiable Mask for Pruning Convolutional and Recurrent Networks

Pruning is one of the most effective model reduction techniques. Deep networks require massive computation and such models need to be compressed to bring them on edge devices. Most existing pruning techniques are focused on vision-based…

Machine Learning · Computer Science 2020-04-30 Ramchalam Kinattinkara Ramakrishnan , Eyyüb Sari , Vahid Partovi Nia

Data-Efficient Structured Pruning via Submodular Optimization

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and…

Machine Learning · Computer Science 2023-02-14 Marwa El Halabi , Suraj Srinivas , Simon Lacoste-Julien

Rethinking the Smaller-Norm-Less-Informative Assumption in Channel Pruning of Convolution Layers

Model pruning has become a useful technique that improves the computational efficiency of deep learning, making it possible to deploy solutions in resource-limited scenarios. A widely-used practice in relevant work assumes that a…

Machine Learning · Computer Science 2018-02-06 Jianbo Ye , Xin Lu , Zhe Lin , James Z. Wang

AutoPruning for Deep Neural Network with Dynamic Channel Masking

Modern deep neural network models are large and computationally intensive. One typical solution to this issue is model pruning. However, most current pruning algorithms depend on hand crafted rules or domain expertise. To overcome this…

Computer Vision and Pattern Recognition · Computer Science 2020-11-03 Baopu Li , Yanwen Fan , Zhihong Pan , Gang Zhang

Model Preserving Compression for Neural Networks

After training complex deep learning models, a common task is to compress the model to reduce compute and storage demands. When compressing, it is desirable to preserve the original model's per-example decisions (e.g., to go beyond top-1…

Machine Learning · Computer Science 2022-10-18 Jerry Chee , Megan Renz , Anil Damle , Christopher De Sa

IG-Pruning: Input-Guided Block Pruning for Large Language Models

With the growing computational demands of large language models (LLMs), efficient inference has become increasingly critical for practical deployment. Depth pruning has emerged as a promising approach for reducing the computational costs of…

Computation and Language · Computer Science 2025-11-05 Kangyu Qiao , Shaolei Zhang , Yang Feng