Related papers: Auto-Train-Once: Controller Network Guided Automat…

Only Train Once: A One-Shot Neural Network Training And Pruning Framework

Structured pruning is a commonly used technique in deploying deep neural networks (DNNs) onto resource-constrained devices. However, the existing pruning methods are usually heuristic, task-specified, and require an extra fine-tuning…

Machine Learning · Computer Science 2021-11-15 Tianyi Chen , Bo Ji , Tianyu Ding , Biyi Fang , Guanyi Wang , Zhihui Zhu , Luming Liang , Yixin Shi , Sheng Yi , Xiao Tu

OTOv3: Automatic Architecture-Agnostic Neural Network Training and Compression from Structured Pruning to Erasing Operators

Compressing a predefined deep neural network (DNN) into a compact sub-network with competitive performance is crucial in the efficient machine learning realm. This topic spans various techniques, from structured pruning to neural…

Machine Learning · Computer Science 2023-12-18 Tianyi Chen , Tianyu Ding , Zhihui Zhu , Zeyu Chen , HsiangTao Wu , Ilya Zharkov , Luming Liang

OTOV2: Automatic, Generic, User-Friendly

The existing model compression methods via structured pruning typically require complicated multi-stage procedures. Each individual stage necessitates numerous engineering efforts and domain-knowledge from the end-users which prevent their…

Computer Vision and Pattern Recognition · Computer Science 2023-06-26 Tianyi Chen , Luming Liang , Tianyu Ding , Zhihui Zhu , Ilya Zharkov

Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery

We propose a novel algorithm for combined unit and layer pruning of deep neural networks that functions during training and without requiring a pre-trained network to apply. Our algorithm optimally trades-off learning accuracy and pruning…

Machine Learning · Computer Science 2025-07-17 Valentin Frank Ingmar Guenter , Athanasios Sideris

HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and Pruning

Structured pruning is one of the most popular approaches to effectively compress the heavy deep neural networks (DNNs) into compact sub-networks while retaining performance. The existing methods suffer from multi-stage procedures along with…

Machine Learning · Computer Science 2025-05-09 Tianyi Chen , Xiaoyi Qu , David Aponte , Colby Banbury , Jongwoo Ko , Tianyu Ding , Yong Ma , Vladimir Lyapunov , Ilya Zharkov , Luming Liang

Once-for-All: Train One Network and Specialize it for Efficient Deployment

We address the challenging problem of efficient inference across many devices and resource constraints, especially on edge devices. Conventional approaches either manually design or use neural architecture search (NAS) to find a specialized…

Machine Learning · Computer Science 2020-05-01 Han Cai , Chuang Gan , Tianzhe Wang , Zhekai Zhang , Song Han

Automated Pruning for Deep Neural Network Compression

In this work we present a method to improve the pruning step of the current state-of-the-art methodology to compress neural networks. The novelty of the proposed pruning technique is in its differentiability, which allows pruning to be…

Computer Vision and Pattern Recognition · Computer Science 2019-01-08 Franco Manessi , Alessandro Rozza , Simone Bianco , Paolo Napoletano , Raimondo Schettini

MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning

In this paper, we propose a novel meta learning approach for automatic channel pruning of very deep neural networks. We first train a PruningNet, a kind of meta network, which is able to generate weight parameters for any pruned structure…

Computer Vision and Pattern Recognition · Computer Science 2019-08-15 Zechun Liu , Haoyuan Mu , Xiangyu Zhang , Zichao Guo , Xin Yang , Tim Kwang-Ting Cheng , Jian Sun

Boosting the Convergence of Reinforcement Learning-based Auto-pruning Using Historical Data

Recently, neural network compression schemes like channel pruning have been widely used to reduce the model size and computational complexity of deep neural network (DNN) for applications in power-constrained scenarios such as embedded…

Machine Learning · Computer Science 2021-07-20 Jiandong Mu , Mengdi Wang , Feiwen Zhu , Jun Yang , Wei Lin , Wei Zhang

AutoSculpt: A Pattern-based Model Auto-pruning Framework Using Reinforcement Learning and Graph Learning

As deep neural networks (DNNs) are increasingly deployed on edge devices, optimizing models for constrained computational resources is critical. Existing auto-pruning methods face challenges due to the diversity of DNN models, various…

Artificial Intelligence · Computer Science 2026-04-21 Lixian Jing , Jianpeng Qi , Junyu Dong , Yanwei Yu

Plug-in, Trainable Gate for Streamlining Arbitrary Neural Networks

Architecture optimization, which is a technique for finding an efficient neural network that meets certain requirements, generally reduces to a set of multiple-choice selection problems among alternative sub-structures or parameters. The…

Machine Learning · Computer Science 2019-11-15 Jaedeok Kim , Chiyoun Park , Hyun-Joo Jung , Yoonsuck Choe

Network Pruning via Annealing and Direct Sparsity Control

Artificial neural networks (ANNs) especially deep convolutional networks are very popular these days and have been proved to successfully offer quite reliable solutions to many vision problems. However, the use of deep neural networks is…

Machine Learning · Computer Science 2020-07-28 Yangzi Guo , Yiyuan She , Adrian Barbu

Data-Efficient Structured Pruning via Submodular Optimization

Structured pruning is an effective approach for compressing large pre-trained neural networks without significantly affecting their performance. However, most current structured pruning methods do not provide any performance guarantees, and…

Machine Learning · Computer Science 2023-02-14 Marwa El Halabi , Suraj Srinivas , Simon Lacoste-Julien

Activation Density driven Energy-Efficient Pruning in Training

Neural network pruning with suitable retraining can yield networks with considerably fewer parameters than the original with comparable degrees of accuracy. Typical pruning methods require large, fully trained networks as a starting point…

Machine Learning · Computer Science 2020-10-13 Timothy Foldy-Porto , Yeshwanth Venkatesha , Priyadarshini Panda

AutoPruning for Deep Neural Network with Dynamic Channel Masking

Modern deep neural network models are large and computationally intensive. One typical solution to this issue is model pruning. However, most current pruning algorithms depend on hand crafted rules or domain expertise. To overcome this…

Computer Vision and Pattern Recognition · Computer Science 2020-11-03 Baopu Li , Yanwen Fan , Zhihong Pan , Gang Zhang

Online Training and Pruning of Deep Reinforcement Learning Networks

Scaling deep neural networks (NN) of reinforcement learning (RL) algorithms has been shown to enhance performance when feature extraction networks are used but the gained performance comes at the significant expense of increased…

Machine Learning · Computer Science 2025-07-17 Valentin Frank Ingmar Guenter , Athanasios Sideris

Pruning from Scratch

Network pruning is an important research field aiming at reducing computational costs of neural networks. Conventional approaches follow a fixed paradigm which first trains a large and redundant network, and then determines which units…

Computer Vision and Pattern Recognition · Computer Science 2019-09-30 Yulong Wang , Xiaolu Zhang , Lingxi Xie , Jun Zhou , Hang Su , Bo Zhang , Xiaolin Hu

Network Trimming: A Data-Driven Neuron Pruning Approach towards Efficient Deep Architectures

State-of-the-art neural networks are getting deeper and wider. While their performance increases with the increasing number of layers and neurons, it is crucial to design an efficient deep architecture in order to reduce computational and…

Neural and Evolutionary Computing · Computer Science 2016-07-13 Hengyuan Hu , Rui Peng , Yu-Wing Tai , Chi-Keung Tang

A Framework For Pruning Deep Neural Networks Using Energy-Based Models

A typical deep neural network (DNN) has a large number of trainable parameters. Choosing a network with proper capacity is challenging and generally a larger network with excessive capacity is trained. Pruning is an established approach to…

Neural and Evolutionary Computing · Computer Science 2021-03-01 Hojjat Salehinejad , Shahrokh Valaee

Single-Shot Pruning for Offline Reinforcement Learning

Deep Reinforcement Learning (RL) is a powerful framework for solving complex real-world problems. Large neural networks employed in the framework are traditionally associated with better generalization capabilities, but their increased size…

Machine Learning · Computer Science 2022-01-03 Samin Yeasar Arnob , Riyasat Ohib , Sergey Plis , Doina Precup