Related papers: A Multi-task Supervised Compression Model for Spli…

SC2 Benchmark: Supervised Compression for Split Computing

With the increasing demand for deep learning models on mobile devices, splitting neural network computation between the device and a more powerful edge server has become an attractive solution. However, existing split computing approaches…

Machine Learning · Computer Science 2025-01-06 Yoshitomo Matsubara , Ruihan Yang , Marco Levorato , Stephan Mandt

Memory- and Latency-Constrained Inference of Large Language Models via Adaptive Split Computing

Large language models (LLMs) have achieved near-human performance across diverse reasoning tasks, yet their deployment on resource-constrained Internet-of-Things (IoT) devices remains impractical due to massive parameter footprints and…

Machine Learning · Computer Science 2025-11-07 Mingyu Sung , Vikas Palakonda , Suhwan Im , Sunghwan Moon , Il-Min Kim , Sangseok Yun , Jae-Mo Kang

Supervised Compression for Resource-Constrained Edge Computing Systems

There has been much interest in deploying deep learning algorithms on low-powered devices, including smartphones, drones, and medical sensors. However, full-scale deep neural networks are often too resource-intensive in terms of energy and…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Yoshitomo Matsubara , Ruihan Yang , Marco Levorato , Stephan Mandt

MTL-Split: Multi-Task Learning for Edge Devices using Split Computing

Split Computing (SC), where a Deep Neural Network (DNN) is intelligently split with a part of it deployed on an edge device and the rest on a remote server is emerging as a promising approach. It allows the power of DNNs to be leveraged for…

Machine Learning · Computer Science 2024-07-09 Luigi Capogrosso , Enrico Fraccaroli , Samarjit Chakraborty , Franco Fummi , Marco Cristani

Split Computing for Complex Object Detectors: Challenges and Preliminary Results

Following the trends of mobile and edge computing for DNN models, an intermediate option, split computing, has been attracting attentions from the research community. Previous studies empirically showed that while mobile and edge computing…

Computer Vision and Pattern Recognition · Computer Science 2021-11-17 Yoshitomo Matsubara , Marco Levorato

A Low-Complexity Approach to Rate-Distortion Optimized Variable Bit-Rate Compression for Split DNN Computing

Split computing has emerged as a recent paradigm for implementation of DNN-based AI workloads, wherein a DNN model is split into two parts, one of which is executed on a mobile/client device and the other on an edge-server (or cloud). Data…

Machine Learning · Computer Science 2022-08-25 Parual Datta , Nilesh Ahuja , V. Srinivasa Somayazulu , Omesh Tickoo

High-Efficiency Split Computing for Cooperative Edge Systems: A Novel Compressed Sensing Bottleneck

The advent of big data and AI has precipitated a demand for computational frameworks that ensure real-time performance, accuracy, and privacy. While edge computing mitigates latency and privacy concerns, its scalability is constrained by…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-04-23 Hailin Zhong , Donglong Chen

3D Point Cloud Object Detection on Edge Devices for Split Computing

The field of autonomous driving technology is rapidly advancing, with deep learning being a key component. Particularly in the field of sensing, 3D point cloud data collected by LiDAR is utilized to run deep neural network models for 3D…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-05 Taisuke Noguchi , Takuya Azumi

SplitBrain: Hybrid Data and Model Parallel Deep Learning

The recent success of deep learning applications has coincided with those widely available powerful computational resources for training sophisticated machine learning models with huge datasets. Nonetheless, training large models such as…

Machine Learning · Computer Science 2022-01-03 Farley Lai , Asim Kadav , Erik Kruus

Communication Efficient Split Learning of ViTs with Attention-based Double Compression

This paper proposes a novel communication-efficient Split Learning (SL) framework, named Attention-based Double Compression (ADC), which reduces the communication overhead required for transmitting intermediate Vision Transformers…

Machine Learning · Computer Science 2025-09-19 Federico Alvetreti , Jary Pomponi , Paolo Di Lorenzo , Simone Scardapane

Dynamic Encoding and Decoding of Information for Split Learning in Mobile-Edge Computing: Leveraging Information Bottleneck Theory

Split learning is a privacy-preserving distributed learning paradigm in which an ML model (e.g., a neural network) is split into two parts (i.e., an encoder and a decoder). The encoder shares so-called latent representation, rather than raw…

Machine Learning · Computer Science 2023-09-07 Omar Alhussein , Moshi Wei , Arashmid Akhavain

Split Computing and Early Exiting for Deep Learning Applications: Survey and Research Challenges

Mobile devices such as smartphones and autonomous vehicles increasingly rely on deep neural networks (DNNs) to execute complex inference tasks such as image classification and speech recognition, among others. However, continuously…

Signal Processing · Electrical Eng. & Systems 2025-01-03 Yoshitomo Matsubara , Marco Levorato , Francesco Restuccia

Fine-tuning Multimodal Transformers on Edge: A Parallel Split Learning Approach

Multimodal transformers integrate diverse data types like images, audio, and text, advancing tasks such as audio-visual understanding and image-text retrieval; yet their high parameterization limits deployment on resource-constrained edge…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-10 Timo Fudala , Vasileios Tsouvalas , Nirvana Meratnia

Communication and Computation Reduction for Split Learning using Asynchronous Training

Split learning is a promising privacy-preserving distributed learning scheme that has low computation requirement at the edge device but has the disadvantage of high communication overhead between edge device and server. To reduce the…

Machine Learning · Computer Science 2022-03-10 Xing Chen , Jingtao Li , Chaitali Chakrabarti

Split Learning in Computer Vision for Semantic Segmentation Delay Minimization

In this paper, we propose a novel approach to minimize the inference delay in semantic segmentation using split learning (SL), tailored to the needs of real-time computer vision (CV) applications for resource-constrained devices. Semantic…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Nikos G. Evgenidis , Nikos A. Mitsiou , Sotiris A. Tegos , Panagiotis D. Diamantoulakis , George K. Karagiannidis

Communication-and-Computation Efficient Split Federated Learning: Gradient Aggregation and Resource Management

With the prevalence of Large Learning Models (LLM), Split Federated Learning (SFL), which divides a learning model into server-side and client-side models, has emerged as an appealing technology to deal with the heavy computational burden…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-03 Yipeng Liang , Qimei Chen , Guangxu Zhu , Muhammad Kaleem Awan , Hao Jiang

SL-ACC: A Communication-Efficient Split Learning Framework with Adaptive Channel-wise Compression

The increasing complexity of neural networks poses a significant barrier to the deployment of distributed machine learning (ML) on resource-constrained devices, such as federated learning (FL). Split learning (SL) offers a promising…

Machine Learning · Computer Science 2025-08-19 Zehang Lin , Zheng Lin , Miao Yang , Jianhao Huang , Yuxin Zhang , Zihan Fang , Xia Du , Zhe Chen , Shunzhi Zhu , Wei Ni

SplitCom: Communication-efficient Split Federated Fine-tuning of LLMs via Temporal Compression

Federated fine-tuning of on-device large language models (LLMs) mitigates privacy concerns by preventing raw data sharing. However, the intensive computational and memory demands pose significant challenges for resource-constrained edge…

Networking and Internet Architecture · Computer Science 2026-02-13 Tao Li , Yulin Tang , Yiyang Song , Cong Wu , Xihui Liu , Pan Li , Xianhao Chen

SL-FAC: A Communication-Efficient Split Learning Framework with Frequency-Aware Compression

The growing complexity of neural networks hinders the deployment of distributed machine learning on resource-constrained devices. Split learning (SL) offers a promising solution by partitioning the large model and offloading the primary…

Machine Learning · Computer Science 2026-04-09 Zehang Lin , Miao Yang , Haihan Zhu , Zheng Lin , Jianhao Huang , Jing Yang , Guangjin Pan , Dianxin Luan , Zihan Fang , Shunzhi Zhu , Wei Ni , John Thompson

SplitNets: Designing Neural Architectures for Efficient Distributed Computing on Head-Mounted Systems

We design deep neural networks (DNNs) and corresponding networks' splittings to distribute DNNs' workload to camera sensors and a centralized aggregator on head mounted devices to meet system performance targets in inference accuracy and…

Machine Learning · Computer Science 2022-04-12 Xin Dong , Barbara De Salvo , Meng Li , Chiao Liu , Zhongnan Qu , H. T. Kung , Ziyun Li