English
Related papers

Related papers: Efficient DNN-Powered Software with Fair Sparse Mo…

200 papers

Deep neural networks have achieved exceptional results across a range of applications. As the demand for efficient and sparse deep learning models escalates, the significance of model compression, particularly pruning, is increasingly…

Machine Learning · Computer Science 2025-04-01 Yucong Dai , Gen Li , Feng Luo , Xiaolong Ma , Yongkai Wu

This study introduces an innovative approach aimed at the efficient pruning of neural networks, with a particular focus on their deployment on edge devices. Our method involves the integration of the Lottery Ticket Hypothesis (LTH) with the…

Information Retrieval · Computer Science 2024-01-22 Rajaram R , Manoj Bharadhwaj , Vasan VS , Nargis Pervin

The \textit{lottery ticket hypothesis} (LTH) states that learning on a properly pruned network (the \textit{winning ticket}) improves test accuracy over the original unpruned network. Although LTH has been justified empirically in a broad…

Machine Learning · Computer Science 2021-12-06 Shuai Zhang , Meng Wang , Sijia Liu , Pin-Yu Chen , Jinjun Xiong

In deep model compression, the recent finding "Lottery Ticket Hypothesis" (LTH) (Frankle & Carbin, 2018) pointed out that there could exist a winning ticket (i.e., a properly pruned sub-network together with original weight initialization)…

Machine Learning · Computer Science 2021-07-20 Ning Liu , Geng Yuan , Zhengping Che , Xuan Shen , Xiaolong Ma , Qing Jin , Jian Ren , Jian Tang , Sijia Liu , Yanzhi Wang

Network pruning is a method for reducing test-time computational resource requirements with minimal performance degradation. Conventional wisdom of pruning algorithms suggests that: (1) Pruning methods exploit information from training data…

Machine Learning · Computer Science 2020-10-23 Jingtong Su , Yihang Chen , Tianle Cai , Tianhao Wu , Ruiqi Gao , Liwei Wang , Jason D. Lee

Quantization is an essential technique for making neural networks more efficient, yet our theoretical understanding of it remains limited. Previous works demonstrated that extremely low-precision networks, such as binary networks, can be…

Machine Learning · Computer Science 2025-08-18 Aakash Kumar , Emanuele Natale

Discovering a high-performing sparse network within a massive neural network is advantageous for deploying them on devices with limited storage, such as mobile phones. Additionally, model explainability is essential to fostering trust in…

Computer Vision and Pattern Recognition · Computer Science 2023-07-27 Shantanu Ghosh , Kayhan Batmanghelich

The Strong Lottery Ticket Hypothesis (SLTH) states that randomly-initialised neural networks likely contain subnetworks that perform well without any training. Although unstructured pruning has been extensively studied in this context, its…

Machine Learning · Computer Science 2026-03-11 Arthur da Cunha , Francesco d'Amore , Emanuele Natale

Modern deep neural networks (DNNs) are vulnerable to adversarial attacks and adversarial training has been shown to be a promising method for improving the adversarial robustness of DNNs. Pruning methods have been considered in adversarial…

Machine Learning · Computer Science 2022-03-08 Xupeng Shi , Pengfei Zheng , A. Adam Ding , Yuan Gao , Weizhong Zhang

The most common method for DNN pruning is hard thresholding of network weights, followed by retraining to recover any lost accuracy. Recently developed smart pruning algorithms use the DNN response over the training set for a variety of…

Machine Learning · Computer Science 2019-05-23 Konstantinos Pitas , Mike Davies , Pierre Vandergheynst

When deploying pre-trained neural network models in real-world applications, model consumers often encounter resource-constraint platforms such as mobile and smart devices. They typically use the pruning technique to reduce the size and…

Machine Learning · Computer Science 2025-06-19 Mark Huasong Meng , Guangdong Bai , Sin Gee Teo , Jin Song Dong

Deep Neural Network (DNN) is powerful but computationally expensive and memory intensive, thus impeding its practical usage on resource-constrained front-end devices. DNN pruning is an approach for deep model compression, which aims at…

Machine Learning · Computer Science 2019-10-28 Xiaohan Ding , Guiguang Ding , Xiangxin Zhou , Yuchen Guo , Jungong Han , Ji Liu

Model pruning seeks to induce sparsity in a deep neural network's various connection matrices, thereby reducing the number of nonzero-valued parameters in the model. Recent reports (Han et al., 2015; Narang et al., 2017) prune deep networks…

Machine Learning · Statistics 2017-11-15 Michael Zhu , Suyog Gupta

Sparse models require less memory for storage and enable a faster inference by reducing the necessary number of FLOPs. This is relevant both for time-critical and on-device computations using neural networks. The stabilized lottery ticket…

Machine Learning · Computer Science 2020-07-06 Christopher Brix , Parnia Bahar , Hermann Ney

Irrespective of the specific definition of fairness in a machine learning application, pruning the underlying model affects it. We investigate and document the emergence and exacerbation of undesirable per-class performance imbalances,…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Michela Paganini

The rapid growth of resource-constrained mobile platforms, including mobile robots, wearable systems, and Internet-of-Things devices, has increased the demand for computationally efficient neural network controllers (NNCs) that can operate…

Robotics · Computer Science 2025-08-12 Ganesh Sundaram , Jonas Ulmen , Amjad Haider , Daniel Görges

Transformer-based language models have shown state-of-the-art performance on a variety of natural language understanding tasks. To achieve this performance, these models are first pre-trained on general corpus and then fine-tuned on…

Computation and Language · Computer Science 2024-07-15 Mohammadreza Tayaranian , Seyyed Hasan Mozafari , Brett H. Meyer , James J. Clark , Warren J. Gross

Network pruning is widely used for reducing the heavy inference cost of deep models in low-resource settings. A typical pruning algorithm is a three-stage pipeline, i.e., training (a large model), pruning and fine-tuning. During pruning,…

Machine Learning · Computer Science 2019-03-06 Zhuang Liu , Mingjie Sun , Tinghui Zhou , Gao Huang , Trevor Darrell

Deep neural networks (DNNs) underpin many machine learning applications. Production quality DNN models achieve high inference accuracy by training millions of DNN parameters which has a significant resource footprint. This presents a…

Machine Learning · Computer Science 2025-04-09 Bailey J. Eccles , Philip Rodgers , Peter Kilpatrick , Ivor Spence , Blesson Varghese

Neural Network pruning is an increasingly popular way for producing compact and efficient models, suitable for resource-limited environments, while preserving high performance. While the pruning can be performed using a multi-cycle training…

Machine Learning · Computer Science 2025-01-22 Athanasios Glentis Georgoulakis , George Retsinas , Petros Maragos
‹ Prev 1 2 3 10 Next ›