Related papers: Multi-task learning with compressible features for…

Deep feature compression for collaborative object detection

Recent studies have shown that the efficiency of deep neural networks in mobile applications can be significantly improved by distributing the computational workload between the mobile device and the cloud. This paradigm, termed…

Computer Vision and Pattern Recognition · Computer Science 2018-02-13 Hyomin Choi , Ivan V. Bajic

Near-Lossless Deep Feature Compression for Collaborative Intelligence

Collaborative intelligence is a new paradigm for efficient deployment of deep neural networks across the mobile-cloud infrastructure. By dividing the network between the mobile and the cloud, it is possible to distribute the computational…

Image and Video Processing · Electrical Eng. & Systems 2018-06-19 Hyomin Choi , Ivan V. Bajic

Bit Allocation for Multi-Task Collaborative Intelligence

Recent studies have shown that collaborative intelligence (CI) is a promising framework for deployment of Artificial Intelligence (AI)-based services on mobile devices. In CI, a deep neural network is split between the mobile device and the…

Machine Learning · Computer Science 2020-02-18 Saeed Ranjbar Alvar , Ivan V. Bajić

Towards Collaborative Intelligence Friendly Architectures for Deep Learning

Modern mobile devices are equipped with high-performance hardware resources such as graphics processing units (GPUs), making the end-side intelligent services more feasible. Even recently, specialized silicons as neural engines are being…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-04 Amir Erfan Eshratifar , Amirhossein Esmaili , Massoud Pedram

Mobile-Cloud Inference for Collaborative Intelligence

As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for deep learning model inference. Historically, the models run on mobile devices have been smaller…

Machine Learning · Computer Science 2023-06-27 Mateen Ulhaq

Shared Mobile-Cloud Inference for Collaborative Intelligence

As AI applications for mobile devices become more prevalent, there is an increasing need for faster execution and lower energy consumption for neural model inference. Historically, the models run on mobile devices have been smaller and…

Artificial Intelligence · Computer Science 2020-02-04 Mateen Ulhaq , Ivan V. Bajić

Intermediate Deep Feature Compression: the Next Battlefield of Intelligent Sensing

The recent advances of hardware technology have made the intelligent analysis equipped at the front-end with deep learning more prevailing and practical. To better enable the intelligent sensing at the front-end, instead of compressing and…

Multimedia · Computer Science 2018-09-18 Zhuo Chen , Weisi Lin , Shiqi Wang , Lingyu Duan , Alex C. Kot

Compressing Representations for Embedded Deep Learning

Despite recent advances in architectures for mobile devices, deep learning computational requirements remains prohibitive for most embedded devices. To address that issue, we envision sharing the computational costs of inference between…

Machine Learning · Computer Science 2019-11-26 Juliano S. Assine , Alan Godoy , Eduardo Valle

Supervised Compression for Resource-Constrained Edge Computing Systems

There has been much interest in deploying deep learning algorithms on low-powered devices, including smartphones, drones, and medical sensors. However, full-scale deep neural networks are often too resource-intensive in terms of energy and…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Yoshitomo Matsubara , Ruihan Yang , Marco Levorato , Stephan Mandt

Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency

In recent years, the integration of artificial intelligence (AI) and cloud computing has emerged as a promising avenue for addressing the growing computational demands of AI applications. This paper presents a comprehensive study of…

Machine Learning · Computer Science 2023-04-28 Neelesh Mungoli

BottleNet: A Deep Learning Architecture for Intelligent Mobile Cloud Computing Services

Recent studies have shown the latency and energy consumption of deep neural networks can be significantly improved by splitting the network between the mobile device and cloud. This paper introduces a new deep learning architecture, called…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-02-05 Amir Erfan Eshratifar , Amirhossein Esmaili , Massoud Pedram

Toward Resource-Efficient Collaboration of Large AI Models in Mobile Edge Networks

The collaboration of large artificial intelligence (AI) models in mobile edge networks has emerged as a promising paradigm to meet the growing demand for intelligent services at the network edge. By enabling multiple devices to…

Networking and Internet Architecture · Computer Science 2026-02-17 Peichun Li , Liping Qian , Dusit Niyato , Shiwen Mao , Yuan Wu

Collaborative Inference for AI-Empowered IoT Devices

Artificial intelligence (AI) technologies, and particularly deep learning systems, are traditionally the domain of large-scale cloud servers, which have access to high computational and energy resources. Nonetheless, in Internet-of-Things…

Signal Processing · Electrical Eng. & Systems 2022-07-26 Nir Shlezinger , Ivan V. Bajic

Auto-tuning Neural Network Quantization Framework for Collaborative Inference Between the Cloud and Edge

Recently, deep neural networks (DNNs) have been widely applied in mobile intelligent applications. The inference for the DNNs is usually performed in the cloud. However, it leads to a large overhead of transmitting data via wireless…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-12-19 Guangli Li , Lei Liu , Xueying Wang , Xiao Dong , Peng Zhao , Xiaobing Feng

Fully-adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

Multi-task learning aims to improve generalization performance of multiple prediction tasks by appropriately sharing relevant information across them. In the context of deep neural networks, this idea is often realized by hand-designed…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 Yongxi Lu , Abhishek Kumar , Shuangfei Zhai , Yu Cheng , Tara Javidi , Rogerio Feris

Federated learning compression designed for lightweight communications

Federated Learning (FL) is a promising distributed method for edge-level machine learning, particularly for privacysensitive applications such as those in military and medical domains, where client data cannot be shared or transferred to a…

Machine Learning · Computer Science 2024-06-27 Lucas Grativol Ribeiro , Mathieu Leonardon , Guillaume Muller , Virginie Fresse , Matthieu Arzel

Combining Cloud and Mobile Computing for Machine Learning

Although the computing power of mobile devices is increasing, machine learning models are also growing in size. This trend creates problems for mobile devices due to limitations like their memory capacity and battery life. While many…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-02-27 Ruiqi Xu , Tianchi Zhang

Compressive Feature Selection for Remote Visual Multi-Task Inference

Deep models produce a number of features in each internal layer. A key problem in applications such as feature compression for remote inference is determining how important each feature is for the task(s) performed by the model. The problem…

Image and Video Processing · Electrical Eng. & Systems 2024-05-16 Saeed Ranjbar Alvar , Ivan V. Bajić

ESAI: Efficient Split Artificial Intelligence via Early Exiting Using Neural Architecture Search

Recently, deep neural networks have been outperforming conventional machine learning algorithms in many computer vision-related tasks. However, it is not computationally acceptable to implement these models on mobile and IoT devices and the…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-06-24 Behnam Zeinali , Di Zhuang , J. Morris Chang

Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets

Learning and Artificial Intelligence (ML/AI) techniques have become increasingly prevalent in high performance computing (HPC). However, these methods depend on vast volumes of floating point data for training and validation which need…

Machine Learning · Computer Science 2024-03-26 Robert Underwood , Jon C. Calhoun , Sheng Di , Franck Cappello