English
Related papers

Related papers: Deep Learning Inference Frameworks Benchmark

200 papers

Big data powered Deep Learning (DL) and its applications have blossomed in recent years, fueled by three technological trends: a large amount of digitized data openly accessible, a growing number of DL software frameworks in open source and…

Performance · Computer Science 2019-08-20 Yanzhao Wu , Ling Liu , Calton Pu , Wenqi Cao , Semih Sahin , Wenqi Wei , Qi Zhang

Deep Learning (DL) models have achieved superior performance. Meanwhile, computing hardware like NVIDIA GPUs also demonstrated strong computing scaling trends with 2x throughput and memory bandwidth for each generation. With such strong…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-05-26 Fuxun Yu , Di Wang , Longfei Shangguan , Minjia Zhang , Chenchen Liu , Xiang Chen

Deep Learning (DL) has recently achieved tremendous success. A variety of DL frameworks and platforms play a key role to catalyze such progress. However, the differences in architecture designs and implementations of existing frameworks and…

Machine Learning · Computer Science 2019-09-17 Qianyu Guo , Sen Chen , Xiaofei Xie , Lei Ma , Qiang Hu , Hongtao Liu , Yang Liu , Jianjun Zhao , Xiaohong Li

There is a growing demand to deploy computation-intensive deep learning (DL) models on resource-constrained mobile devices for real-time intelligent applications. Equipped with a variety of processing units such as CPUs, GPUs, and NPUs, the…

Machine Learning · Computer Science 2024-05-06 Sicong Liu , Wentao Zhou , Zimu Zhou , Bin Guo , Minfan Wang , Cheng Fang , Zheng Lin , Zhiwen Yu

Web applications have increasingly adopted Deep Learning (DL) through in-browser inference, wherein DL inference performs directly within Web browsers. The actual performance of in-browser inference and its impacts on the quality of…

Machine Learning · Computer Science 2024-07-26 Qipeng Wang , Shiqi Jiang , Zhenpeng Chen , Xu Cao , Yuanchun Li , Aoyu Li , Yun Ma , Ting Cao , Xuanzhe Liu

Deep learning (DL) models have become core modules for many applications. However, deploying these models without careful performance benchmarking that considers both hardware and software's impact often leads to poor service and costly…

Machine Learning · Computer Science 2021-01-06 Huaizheng Zhang , Yizheng Huang , Yonggang Wen , Jianxiong Yin , Kyle Guan

The field of deep learning has witnessed a remarkable shift towards extremely compute- and memory-intensive neural networks. These newer larger models have enabled researchers to advance state-of-the-art tools across a variety of fields.…

Machine Learning · Computer Science 2022-07-04 Daniel Nichols , Siddharth Singh , Shu-Huai Lin , Abhinav Bhatele

Deep learning frameworks have been widely deployed on GPU servers for deep learning applications in both academia and industry. In training deep neural networks (DNNs), there are many standard processes or algorithms, such as convolution…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-08-21 Shaohuai Shi , Qiang Wang , Xiaowen Chu

Currently, Machine Learning (ML) is becoming ubiquitous in everyday life. Deep Learning (DL) is already present in many applications ranging from computer vision for medicine to autonomous driving of modern cars as well as other sectors in…

Hardware Architecture · Computer Science 2020-12-22 Maurizio Capra , Beatrice Bussolino , Alberto Marchisio , Guido Masera , Maurizio Martina , Muhammad Shafique

The deployment of large-scale models, such as large language models (LLMs), incurs substantial costs due to their computational demands. To mitigate these costs and address challenges related to scalability and data security, there is a…

Deep Learning (DL) has developed to become a corner-stone in many everyday applications that we are now relying on. However, making sure that the DL model uses the underlying hardware efficiently takes a lot of effort. Knowledge about…

Performance · Computer Science 2023-03-22 Karthick Panner Selvam , Mats Brorsson

The widespread use of Deep Learning (DL) applications in science and industry has created a large demand for efficient inference systems. This has resulted in a rapid increase of available Hardware Accelerators (HWAs) making comparison…

In the field of sequential recommendation, deep learning (DL)-based methods have received a lot of attention in the past few years and surpassed traditional models such as Markov chain-based and factorization-based ones. However, there is…

Information Retrieval · Computer Science 2020-10-13 Hui Fang , Danning Zhang , Yiheng Shu , Guibing Guo

Deploying deep learning (DL) on mobile devices has been a notable trend in recent years. To support fast inference of on-device DL, DL libraries play a critical role as algorithms and hardware do. Unfortunately, no prior work ever dives…

Machine Learning · Computer Science 2022-07-07 Qiyang Zhang , Xiang Li , Xiangying Che , Xiao Ma , Ao Zhou , Mengwei Xu , Shangguang Wang , Yun Ma , Xuanzhe Liu

Deep Learning (DL) has had an immense success in the recent past, leading to state-of-the-art results in various domains such as image recognition and natural language processing. One of the reasons for this success is the increasing size…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-09-26 Ruben Mayer , Hans-Arno Jacobsen

Deep Learning (DL) has been widely adopted in diverse industrial domains, including autonomous driving, intelligent healthcare, and aided programming. Like traditional software, DL systems are also prone to faults, whose malfunctioning may…

Machine Learning · Computer Science 2026-01-01 Hanmo You , Zan Wang , Zishuo Dong , Luanqi Mo , Jianjun Zhao , Junjie Chen

Graphics processing units (GPUs) are the de facto standard for processing deep learning (DL) tasks. Meanwhile, GPU failures, which are inevitable, cause severe consequences in DL tasks: they disrupt distributed trainings, crash inference…

Machine Learning · Computer Science 2022-01-31 Heting Liu , Zhichao Li , Cheng Tan , Rongqiu Yang , Guohong Cao , Zherui Liu , Chuanxiong Guo

Large Language Models (LLMs) have propelled groundbreaking advancements across several domains and are commonly used for text generation applications. However, the computational demands of these complex models pose significant challenges,…

The deployment of large-scale models, such as large language models (LLMs) and sophisticated image generation systems, incurs substantial costs due to their computational demands. To mitigate these costs and address challenges related to…

Machine Learning · Computer Science 2024-10-30 Yuzhe Yang , Yipeng Du , Ahmad Farhan , Claudio Angione , Yue Zhao , Harry Yang , Fielding Johnston , James Buban , Patrick Colangelo

Deep learning (DL) has recently achieved tremendous success in a variety of cutting-edge applications, e.g., image recognition, speech and natural language processing, and autonomous driving. Besides the available big data and hardware…

Machine Learning · Computer Science 2018-11-18 Qianyu Guo , Xiaofei Xie , Lei Ma , Qiang Hu , Ruitao Feng , Li Li , Yang Liu , Jianjun Zhao , Xiaohong Li
‹ Prev 1 2 3 10 Next ›