Related papers: Optimising Resource Management for Embedded Machin…

Dynamic DNNs Meet Runtime Resource Management on Mobile and Embedded Platforms

Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to low latency and better privacy. However, efficient deployment on these platforms is challenging due to the intensive computation and…

Hardware Architecture · Computer Science 2022-06-08 Lei Xun , Bashir M. Al-Hashimi , Jonathon Hare , Geoff V. Merrett

Resource-Efficient Neural Networks for Embedded Systems

While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation, and the vision of the Internet of Things fuel the interest in resource-efficient approaches. These approaches aim for a carefully…

Machine Learning · Statistics 2024-04-09 Wolfgang Roth , Günther Schindler , Bernhard Klein , Robert Peharz , Sebastian Tschiatschek , Holger Fröning , Franz Pernkopf , Zoubin Ghahramani

Efficient and Robust Machine Learning for Real-World Systems

While machine learning is traditionally a resource intensive task, embedded systems, autonomous navigation and the vision of the Internet-of-Things fuel the interest in resource efficient approaches. These approaches require a carefully…

Machine Learning · Computer Science 2018-12-07 Franz Pernkopf , Wolfgang Roth , Matthias Zoehrer , Lukas Pfeifenberger , Guenther Schindler , Holger Froening , Sebastian Tschiatschek , Robert Peharz , Matthew Mattina , Zoubin Ghahramani

Dynamic DNNs and Runtime Management for Efficient Inference on Mobile/Embedded Devices

Deep neural network (DNN) inference is increasingly being executed on mobile and embedded platforms due to several key advantages in latency, privacy and always-on availability. However, due to limited computing resources, efficient DNN…

Computer Vision and Pattern Recognition · Computer Science 2024-01-18 Lei Xun , Jonathon Hare , Geoff V. Merrett

Compilation and Optimizations for Efficient Machine Learning on Embedded Systems

Deep Neural Networks (DNNs) have achieved great success in a variety of machine learning (ML) applications, delivering high-quality inferencing solutions in computer vision, natural language processing, and virtual reality, etc. However,…

Machine Learning · Computer Science 2022-08-29 Xiaofan Zhang , Yao Chen , Cong Hao , Sitao Huang , Yuhong Li , Deming Chen

GOGH: Correlation-Guided Orchestration of GPUs in Heterogeneous Clusters

The growing demand for computational resources in machine learning has made efficient resource allocation a critical challenge, especially in heterogeneous hardware clusters where devices vary in capability, age, and energy efficiency.…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-10-20 Ahmad Raeisi , Mahdi Dolati , Sina Darabi , Sadegh Talebi , Patrick Eugster , Ahmad Khonsari

Research on Heterogeneous Computation Resource Allocation based on Data-driven Method

The rapid development of the mobile Internet and the Internet of Things is leading to a diversification of user devices and the emergence of new mobile applications on a regular basis. Such applications include those that are…

Computational Engineering, Finance, and Science · Computer Science 2024-08-13 Xirui Tang , Zeyu Wang , Xiaowei Cai , Honghua Su , Changsong Wei

Performance landscape of resource-constrained platforms targeting DNNs

Over the recent years, a significant number of complex, deep neural networks have been developed for a variety of applications including speech and face recognition, computer vision in the areas of health-care, automatic translation, image…

Performance · Computer Science 2021-11-04 Panagiotis Miliadis , Christos-Savvas Bouganis , Dionisios Pnevmatikatos

A Note on Latency Variability of Deep Neural Networks for Mobile Inference

Running deep neural network (DNN) inference on mobile devices, i.e., mobile inference, has become a growing trend, making inference less dependent on network connections and keeping private data locally. The prior studies on optimizing DNNs…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-04 Luting Yang , Bingqian Lu , Shaolei Ren

Embedded Distributed Inference of Deep Neural Networks: A Systematic Review

Embedded distributed inference of Neural Networks has emerged as a promising approach for deploying machine-learning models on resource-constrained devices in an efficient and scalable manner. The inference task is distributed across a…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-05-07 Federico Nicolás Peccia , Oliver Bringmann

Automated Deep Neural Network Inference Partitioning for Distributed Embedded Systems

Distributed systems can be found in various applications, e.g., in robotics or autonomous driving, to achieve higher flexibility and robustness. Thereby, data flow centric applications such as Deep Neural Network (DNN) inference benefit…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-14 Fabian Kreß , El Mahdi El Annabi , Tim Hotfilter , Julian Hoefer , Tanja Harbaum , Juergen Becker

Optimizing Streaming Parallelism on Heterogeneous Many-Core Architectures: A Machine Learning Based Approach

This article presents an automatic approach to quickly derive a good solution for hardware resource partition and task granularity for task-based parallel applications on heterogeneous many-core architectures. Our approach employs a…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-10 Peng Zhang , Jianbin Fang , Canqun Yang , Chun Huang , Tao Tang , Zheng Wang

Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls

There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the…

Machine Learning · Computer Science 2024-05-06 Sicong Liu , Wentao Zhou , Zimu Zhou , Bin Guo , Minfan Wang , Cheng Fang , Zheng Lin , Zhiwen Yu

Towards Implementing Energy-aware Data-driven Intelligence for Smart Health Applications on Mobile Platforms

Recent breakthrough technological progressions of powerful mobile computing resources such as low-cost mobile GPUs along with cutting-edge, open-source software architectures have enabled high-performance deep learning on mobile platforms.…

Machine Learning · Computer Science 2023-02-02 G. Dumindu Samaraweera , Hung Nguyen , Hadi Zanddizari , Behnam Zeinali , J. Morris Chang

Designing Interpretable Approximations to Deep Reinforcement Learning

In an ever expanding set of research and application areas, deep neural networks (DNNs) set the bar for algorithm performance. However, depending upon additional constraints such as processing power and execution time limits, or…

Machine Learning · Computer Science 2021-06-22 Nathan Dahlin , Krishna Chaitanya Kalagarla , Nikhil Naik , Rahul Jain , Pierluigi Nuzzo

EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices

In recent years, advances in deep learning have resulted in unprecedented leaps in diverse tasks spanning from speech and object recognition to context awareness and health monitoring. As a result, an increasing number of AI-enabled…

Machine Learning · Computer Science 2019-05-20 Mario Almeida , Stefanos Laskaridis , Ilias Leontiadis , Stylianos I. Venieris , Nicholas D. Lane

Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)

Almost in every heavily computation-dependent application, from 6G communication systems to autonomous driving platforms, a large portion of computing should be near to the client side. Edge computing (AI at Edge) in mobile devices is one…

Hardware Architecture · Computer Science 2024-07-29 Seyed Nima Omidsajedi , Rekha Reddy , Jianming Yi , Jan Herbst , Christoph Lipps , Hans Dieter Schotten

Towards A Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms

With the rapid development of Deep Learning, more and more applications on the cloud and edge tend to utilize large DNN (Deep Neural Network) models for improved task execution efficiency as well as decision-making quality. Due to memory…

Machine Learning · Computer Science 2024-07-02 Jingran Shen , Nikos Tziritas , Georgios Theodoropoulos

Performance Analysis and Characterization of Training Deep Learning Models on Mobile Devices

Training deep learning models on mobile devices recently becomes possible, because of increasing computation power on mobile hardware and the advantages of enabling high user experiences. Most of the existing work on machine learning at…

Machine Learning · Computer Science 2019-09-10 Jie Liu , Jiawen Liu , Wan Du , Dong Li

Enabling Deep Learning on Edge Devices

Deep neural networks (DNNs) have succeeded in many different perception tasks, e.g., computer vision, natural language processing, reinforcement learning, etc. The high-performed DNNs heavily rely on intensive resource consumption. For…

Machine Learning · Computer Science 2022-10-10 Zhongnan Qu