English
Related papers

Related papers: Towards Collaborative Intelligence Friendly Archit…

200 papers

As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for deep learning model inference. Historically, the models run on mobile devices have been smaller…

Machine Learning · Computer Science 2023-06-27 Mateen Ulhaq

Deep learning models are being deployed in many mobile intelligent applications. End-side services, such as intelligent personal assistants, autonomous cars, and smart home services often employ either simple local models on the mobile or…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-06 Amir Erfan Eshratifar , Mohammad Saeed Abrishami , Massoud Pedram

A promising way to deploy Artificial Intelligence (AI)-based services on mobile devices is to run a part of the AI model (a deep neural network) on the mobile itself, and the rest in the cloud. This is sometimes referred to as collaborative…

Multimedia · Computer Science 2019-05-17 Saeed Ranjbar Alvar , Ivan V. Bajić

Recent studies have shown the latency and energy consumption of deep neural networks can be significantly improved by splitting the network between the mobile device and cloud. This paper introduces a new deep learning architecture, called…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-05 Amir Erfan Eshratifar , Amirhossein Esmaili , Massoud Pedram

Modern mobile applications are benefiting significantly from the advancement in deep learning, e.g., implementing real-time image recognition and conversational system. Given a trained deep learning model, applications usually need to…

Performance · Computer Science 2019-03-01 Tian Guo

Collaborative intelligence is a new paradigm for efficient deployment of deep neural networks across the mobile-cloud infrastructure. By dividing the network between the mobile and the cloud, it is possible to distribute the computational…

Image and Video Processing · Electrical Eng. & Systems 2018-06-19 Hyomin Choi , Ivan V. Bajic

As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for neural model inference. Historically, the models run on mobile devices have been smaller and…

Artificial Intelligence · Computer Science 2020-02-04 Mateen Ulhaq , Ivan V. Bajić

We propose a learning algorithm to design a light-weight neural multiplexer that given the input and computational resource requirements, calls the model that will consume the minimum compute resources for a successful inference. Mobile…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-09-18 Amir Erfan Eshratifar , Massoud Pedram

Recent studies have shown that the efficiency of deep neural networks in mobile applications can be significantly improved by distributing the computational workload between the mobile device and the cloud. This paradigm, termed…

Computer Vision and Pattern Recognition · Computer Science 2018-02-13 Hyomin Choi , Ivan V. Bajic

Recently, deep neural networks have been outperforming conventional machine learning algorithms in many computer vision-related tasks. However, it is not computationally acceptable to implement these models on mobile and IoT devices and the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-24 Behnam Zeinali , Di Zhuang , J. Morris Chang

Large language models (LLMs) are transforming society, powering applications from smartphone assistants to autonomous driving. Yet cloud-based LLM services alone cannot serve a growing class of applications, including those operating under…

Signal Processing · Electrical Eng. & Systems 2026-05-12 Liangqi Yuan , Wenzhi Fang , Shiqiang Wang , H. Vincent Poor , Christopher G. Brinton

Deep Neural Networks are allowing mobile devices to incorporate a wide range of features into user applications. However, the computational complexity of these models makes it difficult to run them effectively on resource-constrained mobile…

Performance · Computer Science 2020-04-02 Samuel S. Ogden , Tian Guo

The collaboration of large artificial intelligence (AI) models in mobile edge networks has emerged as a promising paradigm to meet the growing demand for intelligent services at the network edge. By enabling multiple devices to…

Networking and Internet Architecture · Computer Science 2026-02-17 Peichun Li , Liping Qian , Dusit Niyato , Shiwen Mao , Yuan Wu

Modern mobile devices have access to a wealth of data suitable for learning models, which in turn can greatly improve the user experience on the device. For example, language models can improve speech recognition and text entry, and image…

Machine Learning · Computer Science 2023-01-30 H. Brendan McMahan , Eider Moore , Daniel Ramage , Seth Hampson , Blaise Agüera y Arcas

Deploying deep neural networks on mobile devices is increasingly important but remains challenging due to limited computing resources. On the other hand, their unified memory architecture and narrower gap between CPU and GPU performance…

Machine Learning · Computer Science 2026-02-20 Zhuojin Li , Marco Paolieri , Leana Golubchik

Today's mobile applications are increasingly leveraging deep neural networks to provide novel features, such as image and speech recognitions. To use a pre-trained deep neural network, mobile developers can either host it in a cloud server,…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-09-12 Samuel S. Ogden , Tian Guo

Although the computing power of mobile devices is increasing, machine learning models are also growing in size. This trend creates problems for mobile devices due to limitations like their memory capacity and battery life. While many…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-02-27 Ruiqi Xu , Tianchi Zhang

This paper presents AppealNet, a novel edge/cloud collaborative architecture that runs deep learning (DL) tasks more efficiently than state-of-the-art solutions. For a given input, AppealNet accurately predicts on-the-fly whether it can be…

Machine Learning · Computer Science 2021-11-29 Min Li , Yu Li , Ye Tian , Li Jiang , Qiang Xu

Recently, deep neural networks (DNNs) have been widely applied in mobile intelligent applications. The inference for the DNNs is usually performed in the cloud. However, it leads to a large overhead of transmitting data via wireless…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-12-19 Guangli Li , Lei Liu , Xueying Wang , Xiao Dong , Peng Zhao , Xiaobing Feng

Recent breakthrough technological progressions of powerful mobile computing resources such as low-cost mobile GPUs along with cutting-edge, open-source software architectures have enabled high-performance deep learning on mobile platforms.…

Machine Learning · Computer Science 2023-02-02 G. Dumindu Samaraweera , Hung Nguyen , Hadi Zanddizari , Behnam Zeinali , J. Morris Chang
‹ Prev 1 2 3 10 Next ›