Related papers: Advancing Model Pruning via Bi-level Optimization

Stabilizing the Lottery Ticket Hypothesis

Pruning is a well-established technique for removing unnecessary structure from neural networks after training to improve the performance of inference. Several recent results have explored the possibility of pruning at initialization time…

Machine Learning · Computer Science 2020-09-29 Jonathan Frankle , Gintare Karolina Dziugaite , Daniel M. Roy , Michael Carbin

Quantifying lottery tickets under label noise: accuracy, calibration, and complexity

Pruning deep neural networks is a widely used strategy to alleviate the computational burden in machine learning. Overwhelming empirical evidence suggests that pruned models retain very high accuracy even with a tiny fraction of parameters.…

Machine Learning · Computer Science 2023-09-27 Viplove Arora , Daniele Irto , Sebastian Goldt , Guido Sanguinetti

Lottery Tickets in Linear Models: An Analysis of Iterative Magnitude Pruning

We analyse the pruning procedure behind the lottery ticket hypothesis arXiv:1803.03635v5, iterative magnitude pruning (IMP), when applied to linear models trained by gradient flow. We begin by presenting sufficient conditions on the…

Machine Learning · Computer Science 2021-07-06 Bryn Elesedy , Varun Kanade , Yee Whye Teh

Learning effective pruning at initialization from iterative pruning

Pruning at initialization (PaI) reduces training costs by removing weights before training, which becomes increasingly crucial with the growing network size. However, current PaI methods still have a large accuracy gap with iterative…

Computer Vision and Pattern Recognition · Computer Science 2025-12-10 Shengkai Liu , Yaofeng Cheng , Fusheng Zha , Wei Guo , Lining Sun , Zhenshan Bing , Chenguang Yang

PIP: Perturbation-based Iterative Pruning for Large Language Models

The rapid increase in the parameter counts of Large Language Models (LLMs), which often reach into the billions or even trillions, presents significant challenges for their practical deployment, particularly in resource-constrained…

Machine Learning · Computer Science 2025-11-18 Yi Cao , Wei-Jie Xu , Yucheng Shen , Weijie Shi , Chi-Min Chan , Jianfeng Qu , Jiajie Xu

Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?

Modern deep learning involves training costly, highly overparameterized networks, thus motivating the search for sparser networks that can still be trained to the same accuracy as the full network (i.e. matching). Iterative magnitude…

Machine Learning · Computer Science 2022-10-07 Mansheej Paul , Feng Chen , Brett W. Larsen , Jonathan Frankle , Surya Ganguli , Gintare Karolina Dziugaite

Iterative Magnitude Pruning as a Renormalisation Group: A Study in The Context of The Lottery Ticket Hypothesis

This thesis delves into the intricate world of Deep Neural Networks (DNNs), focusing on the exciting concept of the Lottery Ticket Hypothesis (LTH). The LTH posits that within extensive DNNs, smaller, trainable subnetworks termed "winning…

Machine Learning · Computer Science 2023-08-08 Abu-Al Hassan

Win the Lottery Ticket via Fourier Analysis: Frequencies Guided Network Pruning

With the remarkable success of deep learning recently, efficient network compression algorithms are urgently demanded for releasing the potential computational power of edge devices, such as smartphones or tablets. However, optimal network…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Yuzhang Shang , Bin Duan , Ziliang Zong , Liqiang Nie , Yan Yan

Instant Soup: Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models

Large pre-trained transformers have been receiving explosive attention in the past few years, due to their wide adaptability for numerous downstream applications via fine-tuning, but their exponentially increasing parameter counts are…

Machine Learning · Computer Science 2023-06-21 Ajay Jaiswal , Shiwei Liu , Tianlong Chen , Ying Ding , Zhangyang Wang

SHRIMP: Sparser Random Feature Models via Iterative Magnitude Pruning

Sparse shrunk additive models and sparse random feature models have been developed separately as methods to learn low-order functions, where there are few interactions between variables, but neither offers computational efficiency. On the…

Machine Learning · Computer Science 2021-12-09 Yuege Xie , Bobby Shi , Hayden Schaeffer , Rachel Ward

One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression

Pruning is a core technique for compressing neural networks to improve computational efficiency. This process is typically approached in two ways: one-shot pruning, which involves a single pass of training and pruning, and iterative…

Machine Learning · Computer Science 2025-08-20 Mikołaj Janusz , Tomasz Wojnar , Yawei Li , Luca Benini , Kamil Adamczewski

The Elastic Lottery Ticket Hypothesis

Lottery Ticket Hypothesis (LTH) raises keen attention to identifying sparse trainable subnetworks, or winning tickets, which can be trained in isolation to achieve similar or even better performance compared to the full models. Despite many…

Computer Vision and Pattern Recognition · Computer Science 2021-10-29 Xiaohan Chen , Yu Cheng , Shuohang Wang , Zhe Gan , Jingjing Liu , Zhangyang Wang

COLT: Cyclic Overlapping Lottery Tickets for Faster Pruning of Convolutional Neural Networks

Pruning refers to the elimination of trivial weights from neural networks. The sub-networks within an overparameterized model produced after pruning are often called Lottery tickets. This research aims to generate winning lottery tickets…

Computer Vision and Pattern Recognition · Computer Science 2025-01-27 Md. Ismail Hossain , Mohammed Rakib , M. M. Lutfe Elahi , Nabeel Mohammed , Shafin Rahman

Iterative Activation-based Structured Pruning

Deploying complex deep learning models on edge devices is challenging because they have substantial compute and memory resource requirements, whereas edge devices' resource budget is limited. To solve this problem, extensive pruning…

Machine Learning · Computer Science 2022-01-26 Kaiqi Zhao , Animesh Jain , Ming Zhao

Insights into the Lottery Ticket Hypothesis and Iterative Magnitude Pruning

Lottery ticket hypothesis for deep neural networks emphasizes the importance of initialization used to re-train the sparser networks obtained using the iterative magnitude pruning process. An explanation for why the specific initialization…

Machine Learning · Computer Science 2024-06-26 Tausifa Jan Saleem , Ramanjit Ahuja , Surendra Prasad , Brejesh Lall

Beyond One-Way Pruning: Bidirectional Pruning-Regrowth for Extreme Accuracy-Sparsity Tradeoff

As a widely adopted model compression technique, model pruning has demonstrated strong effectiveness across various architectures. However, we observe that when sparsity exceeds a certain threshold, both iterative and one-shot pruning…

Machine Learning · Computer Science 2025-11-18 Junchen Liu , Yi Sheng

Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network

Recently, Frankle & Carbin (2019) demonstrated that randomly-initialized dense networks contain subnetworks that once found can be trained to reach test accuracy comparable to the trained dense network. However, finding these high…

Machine Learning · Computer Science 2021-03-18 James Diffenderfer , Bhavya Kailkhura

Mutual Information Preserving Neural Network Pruning

Pruning has emerged as the primary approach used to limit the resource requirements of large neural networks (NNs). Since the proposal of the lottery ticket hypothesis, researchers have focused either on pruning at initialization or after…

Machine Learning · Computer Science 2025-02-04 Charles Westphal , Stephen Hailes , Mirco Musolesi

LOP: Learning Optimal Pruning for Efficient On-Demand MLLMs Scaling

Structural pruning techniques are essential for deploying multimodal large language models (MLLMs) across various hardware platforms, from edge devices to cloud servers. However, current pruning methods typically determine optimal…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Zhihan Zhang , Xiang Pan , Hongchen Wei , Zhenzhong Chen

DRoP: Distributionally Robust Data Pruning

In the era of exceptionally data-hungry models, careful selection of the training data is essential to mitigate the extensive costs of deep learning. Data pruning offers a solution by removing redundant or uninformative samples from the…

Machine Learning · Computer Science 2025-02-11 Artem Vysogorets , Kartik Ahuja , Julia Kempe