English
Related papers

Related papers: CoCoPIE XGen: A Full-Stack AI-Oriented Optimizing …

200 papers

Assuming hardware is the major constraint for enabling real-time mobile intelligence, the industry has mainly dedicated their efforts to developing specialized hardware accelerators for machine learning and inference. This article…

Machine Learning · Computer Science 2020-05-18 Shaoshan Liu , Bin Ren , Xipeng Shen , Yanzhi Wang

Deep neural networks (DNNs) have been increasingly deployed on and integrated with edge devices, such as mobile phones, drones, robots and wearables. To run DNN inference directly on edge devices (a.k.a. edge inference) with a satisfactory…

Machine Learning · Computer Science 2020-09-18 Bingqian Lu , Jianyi Yang , Shaolei Ren

Almost in every heavily computation-dependent application, from 6G communication systems to autonomous driving platforms, a large portion of computing should be near to the client side. Edge computing (AI at Edge) in mobile devices is one…

Hardware Architecture · Computer Science 2024-07-29 Seyed Nima Omidsajedi , Rekha Reddy , Jianming Yi , Jan Herbst , Christoph Lipps , Hans Dieter Schotten

The unprecedented performance of deep neural networks (DNNs) has led to large strides in various Artificial Intelligence (AI) inference tasks, such as object and speech recognition. Nevertheless, deploying such AI models across commodity…

Machine Learning · Computer Science 2021-06-30 Stylianos I. Venieris , Ioannis Panopoulos , Ilias Leontiadis , Iakovos S. Venieris

As the backbone technology of machine learning, deep neural networks (DNNs) have have quickly ascended to the spotlight. Running DNNs on resource-constrained mobile devices is, however, by no means trivial, since it incurs high performance…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-12-31 En Li , Zhi Zhou , Xu Chen

The rapid advancement of deep learning has catalyzed the development of novel IoT applications, which often deploy pre-trained deep neural network (DNN) models across multiple edge devices for collaborative inference.

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-24 Runhua Zhang , Hongxu Jiang , Jinkun Geng , Yuhang Ma , Chenhui Zhu , Haojie Wang

Deep neural networks (DNNs) have been ubiquitously applied in many applications, and accelerators are emerged as an enabler to support the fast and efficient inference tasks of these applications. However, to achieve high model coverage…

Machine Learning · Computer Science 2021-05-10 Zhi Chen , Cody Hao Yu , Trevor Morris , Jorn Tuyls , Yi-Hsiang Lai , Jared Roesch , Elliott Delaye , Vin Sharma , Yida Wang

As artificial intelligence (AI) applications continue to expand in next-generation networks, there is a growing need for deep neural network (DNN) models. Although DNN models deployed at the edge are promising for providing AI as a service…

Networking and Internet Architecture · Computer Science 2024-08-22 Alireza Maleki , Hamed Shah-Mansouri , Babak H. Khalaj

Recent advances in artificial intelligence have driven increasing intelligent applications at the network edge, such as smart home, smart factory, and smart city. To deploy computationally intensive Deep Neural Networks (DNNs) on…

Networking and Internet Architecture · Computer Science 2020-12-08 Liekang Zeng , Xu Chen , Zhi Zhou , Lei Yang , Junshan Zhang

Deep neural networks (DNNs) are essential for performing advanced tasks on edge or mobile devices, yet their deployment is often hindered by severe resource constraints, including limited memory, energy, and computational power. While…

Machine Learning · Computer Science 2026-03-04 Qunyou Liu , Pengbo Yu , Marina Zapater , David Atienza

Ensembles of Deep Neural Networks (DNNs) have achieved qualitative predictions but they are computing and memory intensive. Therefore, the demand is growing to make them answer a heavy workload of requests with available computational…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-08-31 Pierrick Pochelu , Serge G. Petiton , Bruno Conche

With recent advancements in deep neural networks (DNNs), we are able to solve traditionally challenging problems. Since DNNs are compute intensive, consumers, to deploy a service, need to rely on expensive and scarce compute resources in…

Computer Vision and Pattern Recognition · Computer Science 2019-01-10 Ramyad Hadidi , Jiashen Cao , Micheal S. Ryoo , Hyesoon Kim

As a key technology of enabling Artificial Intelligence (AI) applications in 5G era, Deep Neural Networks (DNNs) have quickly attracted widespread attention. However, it is challenging to run computation-intensive DNN-based tasks on mobile…

Networking and Internet Architecture · Computer Science 2019-10-14 En Li , Liekang Zeng , Zhi Zhou , Xu Chen

The recent advancements of three-dimensional (3D) data acquisition devices have spurred a new breed of applications that rely on point cloud data processing. However, processing a large volume of point cloud data brings a significant…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-06-06 Jiawei Shao , Haowei Zhang , Yuyi Mao , Jun Zhang

In recent years, the use of artificial intelligence on resource-constrained IoT devices has grown significantly. However, existing approaches to DNN partitioning and offloading across the edge-cloud continuum typically rely on static…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-12 Akuen Akoi Deng , Eimantas Butkus , Alfreds Lapkovskis , Praveen Kumar Donta

Hardware accelerations of deep learning systems have been extensively investigated in industry and academia. The aim of this paper is to achieve ultra-high energy efficiency and performance for hardware implementations of deep neural…

Machine Learning · Computer Science 2018-02-20 Yanzhi Wang , Caiwen Ding , Zhe Li , Geng Yuan , Siyu Liao , Xiaolong Ma , Bo Yuan , Xuehai Qian , Jian Tang , Qinru Qiu , Xue Lin

Ubiquitous artificial intelligence (AI) is considered one of the key services in 6G systems. AI services typically rely on deep neural network (DNN) requiring heavy computation. Hence, in order to support ubiquitous AI, it is crucial to…

Networking and Internet Architecture · Computer Science 2022-07-27 Sehun Jung , Hyang-Won Lee

Generative Artificial Intelligence (GenAI) applies models and algorithms such as Large Language Model (LLM) and Foundation Model (FM) to generate new data. GenAI, as a promising approach, enables advanced capabilities in various…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-02-25 Mozhgan Navardi , Romina Aalishah , Yuzhe Fu , Yueqian Lin , Hai Li , Yiran Chen , Tinoosh Mohsenin

The rise of power-efficient embedded computers based on highly-parallel accelerators opens a number of opportunities and challenges for researchers and engineers, and paved the way to the era of edge computing. At the same time, advances in…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-10-13 Paolo Burgio , Gianluca Brilli

Running Deep Neural Network (DNN) models on devices with limited computational capability is a challenge due to large compute and memory requirements. Quantized Neural Networks (QNNs) have emerged as a potential solution to this problem,…

Computer Vision and Pattern Recognition · Computer Science 2018-05-31 Yaman Umuroglu , Magnus Jahre
‹ Prev 1 2 3 10 Next ›