Related papers: Joint Device-Edge Inference over Wireless Links wi…

SlimEdge: Performance and Device Aware Distributed DNN Deployment on Resource-Constrained Edge Hardware

Distributed deep neural networks (DNNs) have become central to modern computer vision, yet their deployment on resource-constrained edge devices remains hindered by substantial parameter counts, computational demands, and the probability of…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-02-17 Mahadev Sunil Kumar , Arnab Raha , Debayan Das , Gopakumar G , Rounak Chatterjee , Amitava Mukherjee

Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs

Semantic- and task-oriented communication has emerged as a promising approach to reducing the latency and bandwidth requirements of next-generation mobile networks by transmitting only the most relevant information needed to complete a…

Information Theory · Computer Science 2024-09-27 Deniz Gündüz , Michèle A. Wigger , Tze-Yang Tung , Ping Zhang , Yong Xiao

To prune, or not to prune: exploring the efficacy of pruning for model compression

Model pruning seeks to induce sparsity in a deep neural network's various connection matrices, thereby reducing the number of nonzero-valued parameters in the model. Recent reports (Han et al., 2015; Narang et al., 2017) prune deep networks…

Machine Learning · Statistics 2017-11-15 Michael Zhu , Suyog Gupta

Coding for Computation: Efficient Compression of Neural Networks for Reconfigurable Hardware

As state of the art neural networks (NNs) continue to grow in size, their resource-efficient implementation becomes ever more important. In this paper, we introduce a compression scheme that reduces the number of computations required for…

Machine Learning · Computer Science 2025-04-25 Hans Rosenberger , Rodrigo Fischer , Johanna S. Fröhlich , Ali Bereyhi , Ralf R. Müller

Communication-Efficient Edge AI Inference Over Wireless Networks

Given the fast growth of intelligent devices, it is expected that a large number of high-stake artificial intelligence (AI) applications, e.g., drones, autonomous cars, tactile robots, will be deployed at the edge of wireless networks in…

Signal Processing · Electrical Eng. & Systems 2020-04-29 Kai Yang , Yong Zhou , Zhanpeng Yang , Yuanming Shi

Retrospective: EIE: Efficient Inference Engine on Sparse and Compressed Neural Network

EIE proposed to accelerate pruned and compressed neural networks, exploiting weight sparsity, activation sparsity, and 4-bit weight-sharing in neural network accelerators. Since published in ISCA'16, it opened a new design space to…

Hardware Architecture · Computer Science 2023-06-19 Song Han , Xingyu Liu , Huizi Mao , Jing Pu , Ardavan Pedram , Mark A. Horowitz , William J. Dally

Energy-Efficient Classification at the Wireless Edge with Reliability Guarantees

Learning at the edge is a challenging task from several perspectives, since data must be collected by end devices (e.g. sensors), possibly pre-processed (e.g. data compression), and finally processed remotely to output the result of…

Signal Processing · Electrical Eng. & Systems 2022-04-26 Mattia Merluzzi , Claudio Battiloro , Paolo Di Lorenzo , Emilio Calvanese Strinati

Residual Diffusion Models for Variable-Rate Joint Source Channel Coding of MIMO CSI

Despite significant advancements in deep learning based CSI compression, some key limitations remain unaddressed. Current approaches predominantly treat CSI compression as a source-coding problem, thereby neglecting transmission errors.…

Information Theory · Computer Science 2026-02-19 Sravan Kumar Ankireddy , Heasung Kim , Joonyoung Cho , Hyeji Kim

Structured Pruning of Deep Convolutional Neural Networks

Real time application of deep learning algorithms is often hindered by high computational complexity and frequent memory accesses. Network pruning is a promising technique to solve this problem. However, pruning usually results in irregular…

Neural and Evolutionary Computing · Computer Science 2015-12-31 Sajid Anwar , Kyuyeon Hwang , Wonyong Sung

Distributed Deep Joint Source-Channel Coding with Decoder-Only Side Information

We consider low-latency image transmission over a noisy wireless channel when correlated side information is present only at the receiver side (the Wyner-Ziv scenario). In particular, we are interested in developing practical schemes using…

Computer Vision and Pattern Recognition · Computer Science 2024-02-28 Selim F. Yilmaz , Ezgi Ozyilkan , Deniz Gunduz , Elza Erkip

Lightweight Edge Learning via Dataset Pruning

Edge learning facilitates ubiquitous intelligence by enabling model training and adaptation directly on data-generating devices, thereby mitigating privacy risks and communication latency. However, the high computational and energy overhead…

Machine Learning · Computer Science 2026-02-03 Laha Ale , Hu Luo , Mingsheng Cao , Shichao Li , Huanlai Xing , Haifeng Sun

Efficient Model Compression for Bayesian Neural Networks

Model Compression has drawn much attention within the deep learning community recently. Compressing a dense neural network offers many advantages including lower computation cost, deployability to devices of limited storage and memories,…

Machine Learning · Computer Science 2024-11-04 Diptarka Saha , Zihe Liu , Feng Liang

Diffusion-Aided Joint Source Channel Coding For High Realism Wireless Image Transmission

Deep learning-based joint source-channel coding (deep JSCC) has been demonstrated to be an effective approach for wireless image transmission. Nevertheless, most existing work adopts an autoencoder framework to optimize conventional…

Signal Processing · Electrical Eng. & Systems 2025-03-25 Mingyu Yang , Bowen Liu , Boyang Wang , Hun-Seok Kim

Towards Compact CNNs via Collaborative Compression

Channel pruning and tensor decomposition have received extensive attention in convolutional neural network compression. However, these two techniques are traditionally deployed in an isolated manner, leading to significant accuracy drop…

Computer Vision and Pattern Recognition · Computer Science 2021-05-25 Yuchao Li , Shaohui Lin , Jianzhuang Liu , Qixiang Ye , Mengdi Wang , Fei Chao , Fan Yang , Jincheng Ma , Qi Tian , Rongrong Ji

Branchy-GNN: a Device-Edge Co-Inference Framework for Efficient Point Cloud Processing

The recent advancements of three-dimensional (3D) data acquisition devices have spurred a new breed of applications that rely on point cloud data processing. However, processing a large volume of point cloud data brings a significant…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-06 Jiawei Shao , Haowei Zhang , Yuyi Mao , Jun Zhang

Pruning at a Glance: Global Neural Pruning for Model Compression

Deep Learning models have become the dominant approach in several areas due to their high performance. Unfortunately, the size and hence computational requirements of operating such models can be considerably high. Therefore, this…

Computer Vision and Pattern Recognition · Computer Science 2019-12-04 Abdullah Salama , Oleksiy Ostapenko , Tassilo Klein , Moin Nabi

Deep Neural Compression Via Concurrent Pruning and Self-Distillation

Pruning aims to reduce the number of parameters while maintaining performance close to the original network. This work proposes a novel \emph{self-distillation} based pruning strategy, whereby the representational similarity between the…

Machine Learning · Computer Science 2021-10-01 James O' Neill , Sourav Dutta , Haytham Assem

DeepJSCC-f: Deep Joint Source-Channel Coding of Images with Feedback

We consider wireless transmission of images in the presence of channel output feedback. From a Shannon theoretic perspective feedback does not improve the asymptotic end-to-end performance, and separate source coding followed by…

Information Theory · Computer Science 2020-04-13 David Burth Kurka , Deniz Gündüz

Pruning Convolutional Neural Networks for Resource Efficient Inference

We propose a new formulation for pruning convolutional kernels in neural networks to enable efficient inference. We interleave greedy criteria-based pruning with fine-tuning by backpropagation - a computationally efficient procedure that…

Machine Learning · Computer Science 2017-06-12 Pavlo Molchanov , Stephen Tyree , Tero Karras , Timo Aila , Jan Kautz

Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing

The recent advances of hardware technology have made the intelligent analysis equipped at the front-end with deep learning more prevailing and practical. To better enable the intelligent sensing at the front-end, instead of compressing and…

Multimedia · Computer Science 2018-09-18 Zhuo Chen , Weisi Lin , Shiqi Wang , Lingyu Duan , Alex C. Kot