English
Related papers

Related papers: A QoE-Aware Split Inference Accelerating Algorithm…

200 papers

Splitting the inference model between device, edge server, and cloud can improve the performance of EI greatly. Additionally, the non-orthogonal multiple access (NOMA), which is the key supporting technologies of B5G/6G, can achieve massive…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-12-27 Xin Yuan , Ning Li , Tuo Zhang , Muqing Li , Yuwen Chen , Jose Fernan Martinez Ortega , Song Guo

With the proliferation of edge AI applications, satisfying user quality of experience (QoE) requirements, such as model inference latency, has become a first class objective, as these models operate in resource constrained settings and…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-12-15 Jianli Jin , Ziyang Lin , Qianli Dong , Yi Chen , Jayanth Srinivasa , Myungjin Lee , Zhaowei Tan , Fan Lai

In 5G smart cities, edge computing is employed to provide nearby computing services for end devices, and the large-scale models (e.g., GPT and LLaMA) can be deployed at the network edge to boost the service quality. However, due to the…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-12-12 Zuan Xie , Yang Xu , Hongli Xu , Yunming Liao , Zhiyuan Yao

Edge computing has emerged as a popular paradigm for supporting mobile and IoT applications with low latency or high bandwidth needs. The attractiveness of edge computing has been further enhanced due to the recent availability of…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-30 Qianlin Liang , Prashant Shenoy , David Irwin

Edge AI has been recently proposed to facilitate the training and deployment of Deep Neural Network (DNN) models in proximity to the sources of data. To enable the training of large models on resource-constraint edge devices and protect…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-26 Mingjin Zhang , Jiannong Cao , Yuvraj Sahni , Xiangchun Chen , Shan Jiang

Deploying deep neural networks (DNNs) on IoT and mobile devices is a challenging task due to their limited computational resources. Thus, demanding tasks are often entirely offloaded to edge servers which can accelerate inference, however,…

Computer Vision and Pattern Recognition · Computer Science 2022-06-20 Arian Bakhtiarnia , Nemanja Milošević , Qi Zhang , Dragana Bajović , Alexandros Iosifidis

The edge intelligence (EI) has been widely applied recently. Spliting the model between device, edge server, and cloud can improve the performance of EI greatly. The model segmentation without user mobility has been investigated deeply by…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-12-29 Xin Yuan , Ning Li , kang Wei , Wenchao Xu , Quan Chen , Hao Chen , Song Guo

Mobile edge computing pushes computationally-intensive services closer to the user to provide reduced delay due to physical proximity. This has led many to consider deploying deep learning models on the edge -- commonly known as edge…

Networking and Internet Architecture · Computer Science 2021-05-03 Nathaniel Hudson , Hana Khamfroush , Daniel E. Lucani

The inference of Neural Networks is usually restricted by the resources (e.g., computing power, memory, bandwidth) on edge devices. In addition to improving the hardware design and deploying efficient models, it is possible to aggregate the…

Machine Learning · Computer Science 2021-11-05 Jun-Liang Lin , Sheng-De Wang

The growth of Artificial Intelligence (AI) and large language models has enabled the use of Generative AI (GenAI) in cloud data centers for diverse AI-Generated Content (AIGC) tasks. Models like Stable Diffusion introduce unavoidable delays…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-07-15 Zhifei Xu , Zhiqing Tang , Jiong Lou , Zhi Yao , Xuan Xie , Tian Wang , Yinglong Wang , Weijia Jia

The recent breakthrough in artificial intelligence (AI), especially deep neural networks (DNNs), has affected every branch of science and technology. Particularly, edge AI has been envisioned as a major application scenario to provide…

Machine Learning · Computer Science 2024-10-30 Jiawei Shao , Jun Zhang

Cooperative inference in Mobile Edge Computing (MEC), achieved by deploying partitioned Deep Neural Network (DNN) models between resource-constrained user equipments (UEs) and edge servers (ESs), has emerged as a promising paradigm.…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-10-20 Xinrui Ye , Yanzan Sun , Dingzhu Wen , Guanjin Pan , Shunqing Zhang

Edge intelligence enables AI inference at the network edge, co-located with or near the radio access network, rather than in centralized clouds or on mobile devices. It targets low-latency, resource-constrained applications with large data…

Networking and Internet Architecture · Computer Science 2026-01-26 Jaume Anguera Peris , Joakim Jaldén

The deployment of ML models on edge devices is challenged by limited computational resources and energy availability. While split computing enables the decomposition of large neural networks (NNs) and allows partial computation on both edge…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-11-01 Daniel May , Alessandro Tundo , Shashikant Ilager , Ivona Brandic

Deep Neural Network (DNN) splitting is one of the key enablers of edge Artificial Intelligence (AI), as it allows end users to pre-process data and offload part of the computational burden to nearby Edge Cloud Servers (ECSs). This opens new…

Signal Processing · Electrical Eng. & Systems 2024-01-31 Francesco Binucci , Mattia Merluzzi , Paolo Banelli , Emilio Calvanese Strinati , Paolo Di Lorenzo

In the resource-constrained IoT-edge computing environment, Split Federated (SplitFed) learning is implemented to enhance training efficiency. This method involves each terminal device dividing its full DNN model at a designated layer into…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-12-23 Binbin Huang , Hailiang Zhao , Lingbin Wang , Wenzhuo Qian , Yuyu Yin , Shuiguang Deng

Mobile edge devices (e.g., AR/VR headsets) typically need to complete timely inference tasks while operating with limited on-board computing and energy resources. In this paper, we investigate the problem of collaborative inference in…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-28 Fatemeh Zahra Safaeipour , Jacob Chakareski , Morteza Hashemi

Recently, deep neural networks have been outperforming conventional machine learning algorithms in many computer vision-related tasks. However, it is not computationally acceptable to implement these models on mobile and IoT devices and the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-24 Behnam Zeinali , Di Zhuang , J. Morris Chang

With the fast development of Internet of things (IoT), the fifth generation (5G) wireless networks need to provide massive connectivity of IoT devices and meet the demand for low latency. To satisfy these requirements, Non-Orthogonal…

Networking and Internet Architecture · Computer Science 2018-02-07 Abbas Kiani , Nirwan Ansari

Deep Neural Networks (DNNs) have drawn attention because of their outstanding performance on various tasks. However, deploying full-fledged DNNs in resource-constrained devices (edge, mobile, IoT) is difficult due to their large size. To…

Machine Learning · Computer Science 2023-09-19 Divya J. Bajpai , Vivek K. Trivedi , Sohan L. Yadav , Manjesh K. Hanawal
‹ Prev 1 2 3 10 Next ›