English
Related papers

Related papers: On Accelerating Edge AI: Optimizing Resource-Const…

200 papers

The deep neural network (DNN) based AI applications on the edge require both low-cost computing platforms and high-quality services. However, the limited memory, computing resources, and power budget of the edge devices constrain the…

Machine Learning · Computer Science 2021-05-14 Yao Chen , Cole Hawkins , Kaiqi Zhang , Zheng Zhang , Cong Hao

Large Language Models (LLMs) have revolutionized many areas of artificial intelligence (AI), but their substantial resource requirements limit their deployment on mobile and edge devices. This survey paper provides a comprehensive overview…

Machine Learning · Computer Science 2025-09-03 Sanjay Surendranath Girija , Shashank Kapoor , Lakshit Arora , Dipen Pradhan , Aman Raj , Ankit Shetgaonkar

Edge computing aims to enable edge devices, such as IoT devices, to process data locally instead of relying on the cloud. However, deep learning techniques like computer vision and natural language processing can be computationally…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Oshin Dutta , Tanu Kanvar , Sumeet Agarwal

Artificial intelligence and machine learning models deployed on edge devices, e.g., for quality control in Additive Manufacturing (AM), are frequently small in size. Such models usually have to deliver highly accurate results within a short…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-26 Marcel Aach , Cyril Blanc , Andreas Lintermann , Kurt De Grave

Edge AI, which brings artificial intelligence to the edge of the network for real-time processing and decision-making, has emerged as a transformative technology across various applications. However, the deployment of Edge AI systems faces…

Signal Processing · Electrical Eng. & Systems 2025-11-11 Zhiyuan Zhai , Wei Ni , Xin Wang

The rapid proliferation of computing domains relying on Internet of Things (IoT) devices has created a pressing need for efficient and accurate deep-learning (DL) models that can run on low-power devices. However, traditional DL models tend…

Recent progress in deep learning has been driven by increasingly larger models. However, their computational and energy demands have grown proportionally, creating significant barriers to their deployment and to a wider adoption of deep…

Machine Learning · Computer Science 2025-09-16 Pedro Savarese

Deep neural networks (DNNs) have achieved unprecedented success in the field of artificial intelligence (AI), including computer vision, natural language processing and speech recognition. However, their superior performance comes at the…

Machine Learning · Computer Science 2022-04-26 Han Cai , Ji Lin , Yujun Lin , Zhijian Liu , Haotian Tang , Hanrui Wang , Ligeng Zhu , Song Han

Since emerging edge applications such as Internet of Things (IoT) analytics and augmented reality have tight latency constraints, hardware AI accelerators have been recently proposed to speed up deep neural network (DNN) inference run by…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-01-20 Qianlin Liang , Walid A. Hanafy , Ahmed Ali-Eldin , Prashant Shenoy

Training task in classical machine learning models, such as deep neural networks, is generally implemented at a remote cloud center for centralized learning, which is typically time-consuming and resource-hungry. It also incurs serious…

Machine Learning · Computer Science 2020-10-27 Jinke Ren , Guanding Yu , Guangyao Ding

Computing-in-memory (CIM) is an emerging computing paradigm, offering noteworthy potential for accelerating neural networks with high parallelism, low latency, and energy efficiency compared to conventional von Neumann architectures.…

Neural and Evolutionary Computing · Computer Science 2024-09-30 Kam Chi Loong , Shihao Han , Sishuo Liu , Ning Lin , Zhongrui Wang

Recent advances in Artificial Intelligence (AI) on the Internet of Things (IoT)-enabled network edge has realized edge intelligence in several applications such as smart agriculture, smart hospitals, and smart factories by enabling…

Machine Learning · Computer Science 2024-01-18 Muhammad Zawish , Steven Davy , Lizy Abraham

The surging development of Artificial Intelligence-Generated Content (AIGC) marks a transformative era of the content creation and production. Edge servers promise attractive benefits, e.g., reduced service delay and backhaul traffic load,…

Machine Learning · Computer Science 2024-09-10 Yuxin Liang , Peng Yang , Yuanyuan He , Feng Lyu

Artificial intelligence have contributed to advancements across various industries. However, the rapid growth of artificial intelligence technologies also raises concerns about their environmental impact, due to associated carbon footprints…

Image and Video Processing · Electrical Eng. & Systems 2024-05-28 Szymon Mazurek , Monika Pytlarz , Sylwia Malec , Alessandro Crimi

Neural Architecture Search (NAS) is quickly becoming the go-to approach to optimize the structure of Deep Learning (DL) models for complex tasks such as Image Classification or Object Detection. However, many other relevant applications of…

Machine learning at the edge offers great benefits such as increased privacy and security, low latency, and more autonomy. However, a major challenge is that many devices, in particular edge devices, have very limited memory, weak…

Machine Learning · Computer Science 2019-09-05 Yang Li , Thomas Strohmer

Designing deep networks that meet strict latency and accuracy constraints on edge accelerators increasingly relies on hardware-aware optimization, including neural architecture search (NAS) guided by device-level metrics. Yet most…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Parampuneet Kaur Thind , Vaibhav Katturu , Giacomo Zema , Roberto Del Prete

Optimizing resource utilization in target platforms is key to achieving high performance during DNN inference. While optimizations have been proposed for inference latency, memory footprint, and energy consumption, prior hardware-aware…

Machine Learning · Computer Science 2022-03-24 Ahmet Caner Yüzügüler , Nikolaos Dimitriadis , Pascal Frossard

Emerging research in edge devices and micro-controller units (MCU) enables on-device computation of Deep Learning Training and Inferencing tasks. More recently, contemporary trends focus on making the Deep Neural Net (DNN) Models runnable…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-01-30 Ziliang Zhang

Training time budget and size of the dataset are among the factors affecting the performance of a Deep Neural Network (DNN). This paper shows that Neural Architecture Search (NAS), Hyper Parameters Optimization (HPO), and Data Augmentation…

Machine Learning · Computer Science 2023-01-24 Mahdi Zolnouri , Dounia Lakhmiri , Christophe Tribes , Eyyüb Sari , Sébastien Le Digabel
‹ Prev 1 2 3 10 Next ›