Related papers: Shared Mobile-Cloud Inference for Collaborative In…

Mobile-Cloud Inference for Collaborative Intelligence

As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for deep learning model inference. Historically, the models run on mobile devices have been smaller…

Machine Learning · Computer Science 2023-06-27 Mateen Ulhaq

Towards Collaborative Intelligence Friendly Architectures for Deep Learning

Modern mobile devices are equipped with high-performance hardware resources such as graphics processing units (GPUs), making the end-side intelligent services more feasible. Even recently, specialized silicons as neural engines are being…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-04 Amir Erfan Eshratifar , Amirhossein Esmaili , Massoud Pedram

Multi-task learning with compressible features for Collaborative Intelligence

A promising way to deploy Artificial Intelligence (AI)-based services on mobile devices is to run a part of the AI model (a deep neural network) on the mobile itself, and the rest in the cloud. This is sometimes referred to as collaborative…

Multimedia · Computer Science 2019-05-17 Saeed Ranjbar Alvar , Ivan V. Bajić

Near-Lossless Deep Feature Compression for Collaborative Intelligence

Collaborative intelligence is a new paradigm for efficient deployment of deep neural networks across the mobile-cloud infrastructure. By dividing the network between the mobile and the cloud, it is possible to distribute the computational…

Image and Video Processing · Electrical Eng. & Systems 2018-06-19 Hyomin Choi , Ivan V. Bajic

Cloud-based or On-device: An Empirical Study of Mobile Deep Inference

Modern mobile applications are benefiting significantly from the advancement in deep learning, e.g., implementing real-time image recognition and conversational system. Given a trained deep learning model, applications usually need to…

Performance · Computer Science 2019-03-01 Tian Guo

Runtime Deep Model Multiplexing for Reduced Latency and Energy Consumption Inference

We propose a learning algorithm to design a light-weight neural multiplexer that given the input and computational resource requirements, calls the model that will consume the minimum compute resources for a successful inference. Mobile…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-09-18 Amir Erfan Eshratifar , Massoud Pedram

Combining Cloud and Mobile Computing for Machine Learning

Although the computing power of mobile devices is increasing, machine learning models are also growing in size. This trend creates problems for mobile devices due to limitations like their memory capacity and battery life. While many…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-02-27 Ruiqi Xu , Tianchi Zhang

MDInference: Balancing Inference Accuracy and Latency for Mobile Applications

Deep Neural Networks are allowing mobile devices to incorporate a wide range of features into user applications. However, the computational complexity of these models makes it difficult to run them effectively on resource-constrained mobile…

Performance · Computer Science 2020-04-02 Samuel S. Ogden , Tian Guo

JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services

Deep learning models are being deployed in many mobile intelligent applications. End-side services, such as intelligent personal assistants, autonomous cars, and smart home services often employ either simple local models on the mobile or…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-02-06 Amir Erfan Eshratifar , Mohammad Saeed Abrishami , Massoud Pedram

Privacy-Preserving Deep Inference for Rich User Data on The Cloud

Deep neural networks are increasingly being used in a variety of machine learning applications applied to rich user data on the cloud. However, this approach introduces a number of privacy and efficiency challenges, as the cloud operator…

Computer Vision and Pattern Recognition · Computer Science 2017-10-13 Seyed Ali Osia , Ali Shahin Shamsabadi , Ali Taheri , Kleomenis Katevas , Hamid R. Rabiee , Nicholas D. Lane , Hamed Haddadi

Collaborative Inference for AI-Empowered IoT Devices

Artificial intelligence (AI) technologies, and particularly deep learning systems, are traditionally the domain of large-scale cloud servers, which have access to high computational and energy resources. Nonetheless, in Internet-of-Things…

Signal Processing · Electrical Eng. & Systems 2022-07-26 Nir Shlezinger , Ivan V. Bajic

Distributed Inference on Mobile Edge and Cloud: A Data-Cartography based Clustering Approach

The large size of DNNs poses a significant challenge for deployment on devices with limited resources, such as mobile, edge, and IoT platforms. To address this issue, a distributed inference framework can be utilized. In this framework, a…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-12-24 Divya Jyoti Bajpai , Manjesh Kumar Hanawal

PriMask: Cascadable and Collusion-Resilient Data Masking for Mobile Cloud Inference

Mobile cloud offloading is indispensable for inference tasks based on large-scale deep models. However, transmitting privacy-rich inference data to the cloud incurs concerns. This paper presents the design of a system called PriMask, in…

Cryptography and Security · Computer Science 2022-11-15 Linshan Jiang , Qun Song , Rui Tan , Mo Li

Exploiting Local and Cloud Sensor Fusion in Intermittently Connected Sensor Networks

We consider a detection problem where sensors experience noisy measurements and intermittent communication opportunities to a centralized fusion center (or cloud). The objective of the problem is to arrive at the correct estimate of event…

Systems and Control · Electrical Eng. & Systems 2020-09-23 Michal Yemini , Stephanie Gil , Andrea Goldsmith

PrivaScissors: Enhance the Privacy of Collaborative Inference through the Lens of Mutual Information

Edge-cloud collaborative inference empowers resource-limited IoT devices to support deep learning applications without disclosing their raw data to the cloud server, thus preserving privacy. Nevertheless, prior research has shown that…

Cryptography and Security · Computer Science 2023-06-16 Lin Duan , Jingwei Sun , Yiran Chen , Maria Gorlatova

Collaborative Inference over Wireless Channels with Feature Differential Privacy

Collaborative inference among multiple wireless edge devices has the potential to significantly enhance Artificial Intelligence (AI) applications, particularly for sensing and computer vision. This approach typically involves a three-stage…

Cryptography and Security · Computer Science 2024-10-29 Mohamed Seif , Yuqi Nie , Andrea J. Goldsmith , H. Vincent Poor

Not Just Privacy: Improving Performance of Private Deep Learning in Mobile Cloud

The increasing demand for on-device deep learning services calls for a highly efficient manner to deploy deep neural networks (DNNs) on mobile devices with limited capacity. The cloud-based solution is a promising approach to enabling deep…

Machine Learning · Computer Science 2019-01-08 Ji Wang , Jianguo Zhang , Weidong Bao , Xiaomin Zhu , Bokai Cao , Philip S. Yu

A Framework for Hybrid Collective Inference in Distributed Sensor Networks

With the ever-increasing range of applications of Internet in Things (IoT) and sensor networks, challenges are emerging in various categories of classification tasks. Applications such as vehicular networking, UAV swarm coordination and…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-01 Andrew Nash , Dirk Pesch , Krishnendu Guha

Deep feature compression for collaborative object detection

Recent studies have shown that the efficiency of deep neural networks in mobile applications can be significantly improved by distributing the computational workload between the mobile device and the cloud. This paradigm, termed…

Computer Vision and Pattern Recognition · Computer Science 2018-02-13 Hyomin Choi , Ivan V. Bajic

Cloud Is Closer Than It Appears: Revisiting the Tradeoffs of Distributed Real-Time Inference

The increasing deployment of deep neural networks (DNNs) in cyber-physical systems (CPS) enhances perception fidelity, but imposes substantial computational demands on execution platforms, posing challenges to real-time control deadlines.…

Machine Learning · Computer Science 2026-05-04 Pragya Sharma , Hang Qiu , Mani Srivastava