Related papers: Prototype-based Personalized Pruning

Now that I can see, I can improve: Enabling data-driven finetuning of CNNs on the edge

In today's world, a vast amount of data is being generated by edge devices that can be used as valuable training data to improve the performance of machine learning algorithms in terms of the achieved accuracy or to reduce the compute…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Aditya Rajagopal , Christos-Savvas Bouganis

Supervised Robustness-preserving Data-free Neural Network Pruning

When deploying pre-trained neural network models in real-world applications, model consumers often encounter resource-constraint platforms such as mobile and smart devices. They typically use the pruning technique to reduce the size and…

Machine Learning · Computer Science 2025-06-19 Mark Huasong Meng , Guangdong Bai , Sin Gee Teo , Jin Song Dong

Model Pruning Enables Efficient Federated Learning on Edge Devices

Federated learning (FL) allows model training from local data collected by edge/mobile devices while preserving data privacy, which has wide applicability to image and vision applications. A challenge is that client devices in FL usually…

Machine Learning · Computer Science 2022-04-07 Yuang Jiang , Shiqiang Wang , Victor Valls , Bong Jun Ko , Wei-Han Lee , Kin K. Leung , Leandros Tassiulas

CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge Devices

Multi-task learning has garnered widespread attention in the industry due to its efficient data utilization and strong generalization capabilities, making it particularly suitable for providing high-quality intelligent services to users.…

Machine Learning · Computer Science 2026-01-06 Jingxuan Zhou , Weidong Bao , Ji Wang , Zhengyi Zhong

Lightweight Edge Learning via Dataset Pruning

Edge learning facilitates ubiquitous intelligence by enabling model training and adaptation directly on data-generating devices, thereby mitigating privacy risks and communication latency. However, the high computational and energy overhead…

Machine Learning · Computer Science 2026-02-03 Laha Ale , Hu Luo , Mingsheng Cao , Shichao Li , Huanlai Xing , Haifeng Sun

A Privacy-Preserving-Oriented DNN Pruning and Mobile Acceleration Framework

Weight pruning of deep neural networks (DNNs) has been proposed to satisfy the limited storage and computing capability of mobile edge devices. However, previous pruning methods mainly focus on reducing the model size and/or improving…

Machine Learning · Computer Science 2022-03-29 Yifan Gong , Zheng Zhan , Zhengang Li , Wei Niu , Xiaolong Ma , Wenhao Wang , Bin Ren , Caiwen Ding , Xue Lin , Xiaolin Xu , Yanzhi Wang

Complexity-Driven CNN Compression for Resource-constrained Edge AI

Recent advances in Artificial Intelligence (AI) on the Internet of Things (IoT)-enabled network edge has realized edge intelligence in several applications such as smart agriculture, smart hospitals, and smart factories by enabling…

Machine Learning · Computer Science 2024-01-18 Muhammad Zawish , Steven Davy , Lizy Abraham

Differentiable Network Pruning for Microcontrollers

Embedded and personal IoT devices are powered by microcontroller units (MCUs), whose extreme resource scarcity is a major obstacle for applications relying on on-device deep learning inference. Orders of magnitude less storage, memory and…

Machine Learning · Computer Science 2022-12-09 Edgar Liberis , Nicholas D. Lane

Differentiable Transportation Pruning

Deep learning algorithms are increasingly employed at the edge. However, edge devices are resource constrained and thus require efficient deployment of deep neural networks. Pruning methods are a key tool for edge deployment as they can…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Yunqiang Li , Jan C. van Gemert , Torsten Hoefler , Bert Moons , Evangelos Eleftheriou , Bram-Ernst Verhoef

Robust error bounds for quantised and pruned neural networks

With the rise of smartphones and the internet-of-things, data is increasingly getting generated at the edge on local, personal devices. For privacy, latency and energy saving reasons, this shift is causing machine learning algorithms to…

Machine Learning · Computer Science 2021-04-29 Jiaqi Li , Ross Drummond , Stephen R. Duncan

SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks

We introduce a pruning algorithm that provably sparsifies the parameters of a trained model in a way that approximately preserves the model's predictive accuracy. Our algorithm uses a small batch of input points to construct a data-informed…

Machine Learning · Computer Science 2021-03-16 Cenk Baykal , Lucas Liebenwein , Igor Gilitschenski , Dan Feldman , Daniela Rus

Heterogeneous Federated Learning using Dynamic Model Pruning and Adaptive Gradient

Federated Learning (FL) has emerged as a new paradigm for training machine learning models distributively without sacrificing data security and privacy. Learning models on edge devices such as mobile phones is one of the most common use…

Machine Learning · Computer Science 2023-02-10 Sixing Yu , Phuong Nguyen , Ali Anwar , Ali Jannesari

PQK: Model Compression via Pruning, Quantization, and Knowledge Distillation

As edge devices become prevalent, deploying Deep Neural Networks (DNN) on edge devices has become a critical issue. However, DNN requires a high computational resource which is rarely available for edge devices. To handle this, we propose a…

Machine Learning · Computer Science 2021-06-29 Jangho Kim , Simyung Chang , Nojun Kwak

Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions

Pruning is a promising approach to compress deep learning models in order to deploy them on resource-constrained edge devices. However, many existing pruning solutions are based on unstructured pruning, which yields models that cannot…

Machine Learning · Computer Science 2023-03-16 Kaiqi Zhao , Animesh Jain , Ming Zhao

Local Learning at the Network Edge for Efficient & Secure Real-Time Predictive Analytics

The ability to perform computation on devices, such as smartphones, cars, or other nodes present at the Internet of Things leads to constraints regarding bandwidth, storage, and energy, as most of these devices are mobile and operate on…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-11-29 Natascha Harth , Hans-Joerg Voegel , Kostas Kolomvatsos , Christos Anagnostopoulos

Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization

Edge machine learning (ML) enables localized processing of data on devices and is underpinned by deep neural networks (DNNs). However, DNNs cannot be easily run on devices due to their substantial computing, memory and energy requirements…

Machine Learning · Computer Science 2025-04-09 Bailey J. Eccles , Leon Wong , Blesson Varghese

Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon

How to develop slim and accurate deep neural networks has become crucial for real- world applications, especially for those employed in embedded systems. Though previous work along this research line has shown some promising results, most…

Neural and Evolutionary Computing · Computer Science 2019-10-02 Xin Dong , Shangyu Chen , Sinno Jialin Pan

Structured Deep Neural Network Pruning via Matrix Pivoting

Deep Neural Networks (DNNs) are the key to the state-of-the-art machine vision, sensor fusion and audio/video signal processing. Unfortunately, their computation complexity and tight resource constraints on the Edge make them hard to…

Machine Learning · Computer Science 2017-12-05 Ranko Sredojevic , Shaoyi Cheng , Lazar Supic , Rawan Naous , Vladimir Stojanovic

Reweighted Proximal Pruning for Large-Scale Language Representation

Recently, pre-trained language representation flourishes as the mainstay of the natural language understanding community, e.g., BERT. These pre-trained language representations can create state-of-the-art results on a wide range of…

Machine Learning · Computer Science 2019-12-24 Fu-Ming Guo , Sijia Liu , Finlay S. Mungall , Xue Lin , Yanzhi Wang

Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning

Most existing pruning works are resource-intensive, requiring retraining or fine-tuning of the pruned models for accuracy. We propose a retraining-free pruning method based on hyperspherical learning and loss penalty terms. The proposed…

Computer Vision and Pattern Recognition · Computer Science 2022-12-27 Dan Liu , Xue Liu