Related papers: Building Efficient ConvNets using Redundant Featur…

SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks

We introduce a pruning algorithm that provably sparsifies the parameters of a trained model in a way that approximately preserves the model's predictive accuracy. Our algorithm uses a small batch of input points to construct a data-informed…

Machine Learning · Computer Science 2021-03-16 Cenk Baykal , Lucas Liebenwein , Igor Gilitschenski , Dan Feldman , Daniela Rus

Towards Compact and Robust Deep Neural Networks

Deep neural networks have achieved impressive performance in many applications but their large number of parameters lead to significant computational and storage overheads. Several recent works attempt to mitigate these overheads by…

Machine Learning · Computer Science 2019-06-17 Vikash Sehwag , Shiqi Wang , Prateek Mittal , Suman Jana

Blending Pruning Criteria for Convolutional Neural Networks

The advancement of convolutional neural networks (CNNs) on various vision applications has attracted lots of attention. Yet the majority of CNNs are unable to satisfy the strict requirement for real-world deployment. To overcome this, the…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Wei He , Zhongzhan Huang , Mingfu Liang , Senwei Liang , Haizhao Yang

A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs

Deep convolutional neural networks (CNNs) have achieved impressive performance in many computer vision tasks. However, their large model sizes require heavy computational resources, making pruning redundant filters from existing pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Kiran Purohit , Anurag Reddy Parvathgari , Sourangshu Bhattacharya

Resolution Adaptive Networks for Efficient Inference

Adaptive inference is an effective mechanism to achieve a dynamic tradeoff between accuracy and computational cost in deep networks. Existing works mainly exploit architecture redundancy in network depth or width. In this paper, we focus on…

Computer Vision and Pattern Recognition · Computer Science 2020-05-19 Le Yang , Yizeng Han , Xi Chen , Shiji Song , Jifeng Dai , Gao Huang

Neuron Pruning for Compressing Deep Networks using Maxout Architectures

This paper presents an efficient and robust approach for reducing the size of deep neural networks by pruning entire neurons. It exploits maxout units for combining neurons into more complex convex functions and it makes use of a local…

Computer Vision and Pattern Recognition · Computer Science 2017-07-24 Fernando Moya Rueda , Rene Grzeszick , Gernot A. Fink

Advanced deep architecture pruning using single filter performance

Pruning the parameters and structure of neural networks reduces the computational complexity, energy consumption, and latency during inference. Recently, a novel underlying mechanism for successful deep learning (DL) was presented based on…

Machine Learning · Computer Science 2025-06-13 Yarden Tzach , Yuval Meir , Ronit D. Gross , Ofek Tevet , Ella Koresh , Ido Kanter

Partition Pruning: Parallelization-Aware Pruning for Deep Neural Networks

Parameters of recent neural networks require a huge amount of memory. These parameters are used by neural networks to perform machine learning tasks when processing inputs. To speed up inference, we develop Partition Pruning, an innovative…

Computer Vision and Pattern Recognition · Computer Science 2019-02-28 Sina Shahhosseini , Ahmad Albaqsami , Masoomeh Jasemi , Nader Bagherzadeh

Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration

Previous works utilized ''smaller-norm-less-important'' criterion to prune filters with smaller norm values in a convolutional neural network. In this paper, we analyze this norm-based criterion and point out that its effectiveness depends…

Computer Vision and Pattern Recognition · Computer Science 2019-07-16 Yang He , Ping Liu , Ziwei Wang , Zhilan Hu , Yi Yang

Learning a Consensus Sub-Network with Polarization Regularization and One Pass Training

The subject of green AI has been gaining attention within the deep learning community given the recent trend of ever larger and more complex neural network models. Existing solutions for reducing the computational load of training at…

Machine Learning · Computer Science 2025-01-13 Xiaoying Zhi , Varun Babbar , Rundong Liu , Pheobe Sun , Fran Silavong , Ruibo Shi , Sean Moran

Toward domain generalized pruning by scoring out-of-distribution importance

Filter pruning has been widely used for compressing convolutional neural networks to reduce computation costs during the deployment stage. Recent studies have shown that filter pruning techniques can achieve lossless compression of deep…

Machine Learning · Computer Science 2022-10-26 Rizhao Cai , Haoliang Li , Alex Kot

Deep Network Pruning: A Comparative Study on CNNs in Face Recognition

The widespread use of mobile devices for all kinds of transactions makes necessary reliable and real-time identity authentication, leading to the adoption of face recognition (FR) via the cameras embedded in such devices. Progress of deep…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Fernando Alonso-Fernandez , Kevin Hernandez-Diaz , Jose Maria Buades Rubio , Prayag Tiwari , Josef Bigun

Network Pruning via Transformable Architecture Search

Network pruning reduces the computation costs of an over-parameterized network without performance damage. Prevailing pruning algorithms pre-define the width and depth of the pruned networks, and then transfer parameters from the unpruned…

Computer Vision and Pattern Recognition · Computer Science 2019-10-17 Xuanyi Dong , Yi Yang

Adaptive Pruning of Deep Neural Networks for Resource-Aware Embedded Intrusion Detection on the Edge

Artificial neural network pruning is a method in which artificial neural network sizes can be reduced while attempting to preserve the predicting capabilities of the network. This is done to make the model smaller or faster during inference…

Machine Learning · Computer Science 2025-05-21 Alexandre Broggi , Nathaniel Bastian , Lance Fiondella , Gokhan Kul

Towards Efficient Convolutional Network Models with Filter Distribution Templates

Increasing number of filters in deeper layers when feature maps are decreased is a widely adopted pattern in convolutional network design. It can be found in classical CNN architectures and in automatic discovered models. Even CNS methods…

Computer Vision and Pattern Recognition · Computer Science 2021-04-20 Ramon Izquierdo-Cordova , Walterio Mayol-Cuevas

Efficient Column-Wise N:M Pruning on RISC-V CPU

In deep learning frameworks, weight pruning is a widely used technique for improving computational efficiency by reducing the size of large models. This is especially critical for convolutional operators, which often act as performance…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-24 Chi-Wei Chu , Ding-Yong Hong , Jan-Jan Wu

Growing Deep Neural Network Considering with Similarity between Neurons

Deep learning has excelled in image recognition tasks through neural networks inspired by the human brain. However, the necessity for large models to improve prediction accuracy introduces significant computational demands and extended…

Computer Vision and Pattern Recognition · Computer Science 2024-08-27 Taigo Sakai , Kazuhiro Hotta

Distilled Pruning: Using Synthetic Data to Win the Lottery

This work introduces a novel approach to pruning deep learning models by using distilled data. Unlike conventional strategies which primarily focus on architectural or algorithmic optimization, our method reconsiders the role of data in…

Machine Learning · Computer Science 2023-08-10 Luke McDermott , Daniel Cummings

Automatic Block-wise Pruning with Auxiliary Gating Structures for Deep Convolutional Neural Networks

Convolutional neural networks are prevailing in deep learning tasks. However, they suffer from massive cost issues when working on mobile devices. Network pruning is an effective method of model compression to handle such problems. This…

Computer Vision and Pattern Recognition · Computer Science 2022-05-10 Zhaofeng Si , Honggang Qi , Xiaoyu Song

RRR-Net: Reusing, Reducing, and Recycling a Deep Backbone Network

It has become mainstream in computer vision and other machine learning domains to reuse backbone networks pre-trained on large datasets as preprocessors. Typically, the last layer is replaced by a shallow learning machine of sorts; the…

Machine Learning · Computer Science 2023-10-03 Haozhe Sun , Isabelle Guyon , Felix Mohr , Hedi Tabia