Related papers: Joint Device-Edge Inference over Wireless Links wi…

Attention-based Feature Compression for CNN Inference Offloading in Edge Computing

This paper studies the computational offloading of CNN inference in device-edge co-inference systems. Inspired by the emerging paradigm semantic communication, we propose a novel autoencoder-based CNN architecture (AECNN), for effective…

Computer Vision and Pattern Recognition · Computer Science 2023-02-13 Nan Li , Alexandros Iosifidis , Qi Zhang

Deep Joint Source-Channel Coding for Multi-Task Network

Multi-task learning (MTL) is an efficient way to improve the performance of related tasks by sharing knowledge. However, most existing MTL networks run on a single end and are not suitable for collaborative intelligence (CI) scenarios. In…

Computer Vision and Pattern Recognition · Computer Science 2021-11-03 Mengyang Wang , Zhicong Zhang , Jiahui Li , Mengyao Ma , Xiaopeng Fan

Bayes-Split-Edge: Bayesian Optimization for Constrained Collaborative Inference in Wireless Edge Systems

Mobile edge devices (e.g., AR/VR headsets) typically need to complete timely inference tasks while operating with limited on-board computing and energy resources. In this paper, we investigate the problem of collaborative inference in…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-28 Fatemeh Zahra Safaeipour , Jacob Chakareski , Morteza Hashemi

Successive Refinement of Images with Deep Joint Source-Channel Coding

We introduce deep learning based communication methods for successive refinement of images over wireless channels. We present three different strategies for progressive image transmission with deep JSCC, with different…

Information Theory · Computer Science 2019-05-30 David Burth Kurka , Deniz Gunduz

Group Fisher Pruning for Practical Network Compression

Network compression has been widely studied since it is able to reduce the memory and computation cost during inference. However, previous methods seldom deal with complicated structures like residual connections, group/depth-wise…

Computer Vision and Pattern Recognition · Computer Science 2021-08-03 Liyang Liu , Shilong Zhang , Zhanghui Kuang , Aojun Zhou , Jing-Hao Xue , Xinjiang Wang , Yimin Chen , Wenming Yang , Qingmin Liao , Wayne Zhang

Pruned Convolutional Attention Network Based Wideband Spectrum Sensing with Sub-Nyquist Sampling

Wideband spectrum sensing (WSS) is critical for orchestrating multitudinous wireless transmissions via spectrum sharing, but may incur excessive costs of hardware, power and computation due to the high sampling rate. In this article, a deep…

Signal Processing · Electrical Eng. & Systems 2024-12-03 Peihao Dong , Jibin Jia , Shen Gao , Fuhui Zhou , Qihui Wu

Supervised Compression for Resource-Constrained Edge Computing Systems

There has been much interest in deploying deep learning algorithms on low-powered devices, including smartphones, drones, and medical sensors. However, full-scale deep neural networks are often too resource-intensive in terms of energy and…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Yoshitomo Matsubara , Ruihan Yang , Marco Levorato , Stephan Mandt

FINED: Fast Inference Network for Edge Detection

In this paper, we address the design of lightweight deep learning-based edge detection. The deep learning technology offers a significant improvement on the edge detection accuracy. However, typical neural network designs have very high…

Computer Vision and Pattern Recognition · Computer Science 2020-12-16 Jan Kristanto Wibisono , Hsueh-Ming Hang

DistrEdge: Speeding up Convolutional Neural Network Inference on Distributed Edge Devices

As the number of edge devices with computing resources (e.g., embedded GPUs, mobile phones, and laptops) increases, recent studies demonstrate that it can be beneficial to collaboratively run convolutional neural network (CNN) inference on…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-02-09 Xueyu Hou , Yongjie Guan , Tao Han , Ning Zhang

Pruning by Explaining: A Novel Criterion for Deep Neural Network Pruning

The success of convolutional neural networks (CNNs) in various applications is accompanied by a significant increase in computation and parameter storage costs. Recent efforts to reduce these overheads involve pruning and compressing the…

Machine Learning · Computer Science 2021-03-15 Seul-Ki Yeom , Philipp Seegerer , Sebastian Lapuschkin , Alexander Binder , Simon Wiedemann , Klaus-Robert Müller , Wojciech Samek

Integrated Sensing-Communication-Computation for Over-the-Air Edge AI Inference

Edge-device co-inference refers to deploying well-trained artificial intelligent (AI) models at the network edge under the cooperation of devices and edge servers for providing ambient intelligent services. For enhancing the utilization of…

Information Theory · Computer Science 2023-08-15 Zeming Zhuang , Dingzhu Wen , Yuanming Shi , Guangxu Zhu , Sheng Wu , Dusit Niyato

Differentiable Transportation Pruning

Deep learning algorithms are increasingly employed at the edge. However, edge devices are resource constrained and thus require efficient deployment of deep neural networks. Pruning methods are a key tool for edge deployment as they can…

Computer Vision and Pattern Recognition · Computer Science 2023-08-01 Yunqiang Li , Jan C. van Gemert , Torsten Hoefler , Bert Moons , Evangelos Eleftheriou , Bram-Ernst Verhoef

Distributed Image Transmission using Deep Joint Source-Channel Coding

We study the problem of deep joint source-channel coding (D-JSCC) for correlated image sources, where each source is transmitted through a noisy independent channel to the common receiver. In particular, we consider a pair of images…

Information Theory · Computer Science 2022-01-26 Sixian Wang , Ke Yang , Jincheng Dai , Kai Niu

The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks

The growing demand for large artificial intelligence model (LAIM) services is driving a paradigm shift from traditional cloud-based inference to edge-based inference for low-latency, privacy-preserving applications. In particular,…

Machine Learning · Computer Science 2025-05-15 Zhonghao Lyu , Ming Xiao , Jie Xu , Mikael Skoglund , Marco Di Renzo

Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks

As a paradigm of distributed machine learning, federated learning typically requires all edge devices to train a complete model locally. However, with the increasing scale of artificial intelligence models, the limited resources on edge…

Machine Learning · Computer Science 2024-12-11 Junhe Zhang , Wanli Ni , Dongyu Wang

Network Pruning for Low-Rank Binary Indexing

Pruning is an efficient model compression technique to remove redundancy in the connectivity of deep neural networks (DNNs). Computations using sparse matrices obtained by pruning parameters, however, exhibit vastly different parallelism…

Machine Learning · Computer Science 2019-05-15 Dongsoo Lee , Se Jung Kwon , Byeongwook Kim , Parichay Kapoor , Gu-Yeon Wei

Learning-Based Interface for Semantic Communication with Bit Importance Awareness

Joint source-channel coding (JSCC) is an effective approach for semantic communication. However, current JSCC methods are difficult to integrate with existing communication network architectures, where application and network providers are…

Information Theory · Computer Science 2025-07-18 Wenzheng Kong , Wenyi Zhang

Distilling with Performance Enhanced Students

The task of accelerating large neural networks on general purpose hardware has, in recent years, prompted the use of channel pruning to reduce network size. However, the efficacy of pruning based approaches has since been called into…

Machine Learning · Statistics 2019-03-08 Jack Turner , Elliot J. Crowley , Valentin Radu , José Cano , Amos Storkey , Michael O'Boyle

Automated Pruning for Deep Neural Network Compression

In this work we present a method to improve the pruning step of the current state-of-the-art methodology to compress neural networks. The novelty of the proposed pruning technique is in its differentiability, which allows pruning to be…

Computer Vision and Pattern Recognition · Computer Science 2019-01-08 Franco Manessi , Alessandro Rozza , Simone Bianco , Paolo Napoletano , Raimondo Schettini

Efficient Inference of CNNs via Channel Pruning

The deployment of Convolutional Neural Networks (CNNs) on resource constrained platforms such as mobile devices and embedded systems has been greatly hindered by their high implementation cost, and thus motivated a lot research interest in…

Computer Vision and Pattern Recognition · Computer Science 2019-08-12 Boyu Zhang , Azadeh Davoodi , Yu Hen Hu