Performance · Computer Science
A Comparative Measurement Study of Deep Learning as a Service Framework
Yanzhao Wu, Ling Liu, Calton Pu, Wenqi Cao +3
2019-08-20
Distributed, Parallel, and Cluster Computing · Computer Science
A Survey of Multi-Tenant Deep Learning Inference on GPU
Fuxun Yu, Di Wang, Longfei Shangguan, Minjia Zhang +2
2022-05-26
Machine Learning · Computer Science
An Empirical Study towards Characterizing Deep Learning Development and Deployment across Different Frameworks and Platforms
Qianyu Guo, Sen Chen, Xiaofei Xie, Lei Ma +5
2019-09-17
Machine Learning · Computer Science
Deep Learning Inference on Heterogeneous Mobile Processors: Potentials and Pitfalls
Sicong Liu, Wentao Zhou, Zimu Zhou, Bin Guo +4
2024-05-06
Machine Learning · Computer Science
Anatomizing Deep Learning Inference in Web Browsers
Qipeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao +5
2024-07-26
Machine Learning · Computer Science
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System
Huaizheng Zhang, Yizheng Huang, Yonggang Wen, Jianxiong Yin +1
2021-01-06
Machine Learning · Computer Science
A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks
Daniel Nichols, Siddharth Singh, Shu-Huai Lin, Abhinav Bhatele
2022-07-04
Distributed, Parallel, and Cluster Computing · Computer Science
Performance Modeling and Evaluation of Distributed Deep Learning Frameworks on GPUs
Shaohuai Shi, Qiang Wang, Xiaowen Chu
2018-08-21
Hardware Architecture · Computer Science
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Maurizio Capra, Beatrice Bussolino, Alberto Marchisio, Guido Masera +2
2020-12-22
Machine Learning · Computer Science
Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments
Yipeng Du, Zihao Wang, Ahmad Farhan, Claudio Angione +6
2025-08-14
Machine Learning · Computer Science
Bosch Deep Learning Hardware Benchmark
Armin Runge, Thomas Wenzel, Dimitrios Bariamis, Benedikt Sebastian Staffler +2
2020-08-25
Information Retrieval · Computer Science
Deep Learning for Sequential Recommendation: Algorithms, Influential Factors, and Evaluations
Hui Fang, Danning Zhang, Yiheng Shu, Guibing Guo
2020-10-13
Machine Learning · Computer Science
Benchmarking of DL Libraries and Models on Mobile Devices
Qiyang Zhang, Xiang Li, Xiangying Che, Xiao Ma +5
2022-07-07
Distributed, Parallel, and Cluster Computing · Computer Science
Scalable Deep Learning on Distributed Infrastructures: Challenges, Techniques and Tools
Ruben Mayer, Hans-Arno Jacobsen
2019-09-26
Machine Learning · Computer Science
A Comprehensive Study of Deep Learning Model Fixing Approaches
Hanmo You, Zan Wang, Zishuo Dong, Luanqi Mo +2
2026-01-01
Machine Learning · Computer Science
Prediction of GPU Failures Under Deep Learning Workloads
Heting Liu, Zhichao Li, Cheng Tan, Rongqiu Yang +3
2022-01-31
Machine Learning · Computer Science
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators
Krishna Teja Chitty-Venkata, Siddhisanket Raskar, Bharat Kale, Farah Ferdaus +5
2024-11-04
Machine Learning · Computer Science
Meta-Learning for Speeding Up Large Model Inference in Decentralized Environments
Yuzhe Yang, Yipeng Du, Ahmad Farhan, Claudio Angione +5
2024-10-30
Machine Learning · Computer Science
An Orchestrated Empirical Study on Deep Learning Frameworks and Platforms
Qianyu Guo, Xiaofei Xie, Lei Ma, Qiang Hu +5
2018-11-18
Distributed, Parallel, and Cluster Computing · Computer Science
An efficient and flexible inference system for serving heterogeneous ensembles of deep neural networks
Pierrick Pochelu, Serge G. Petiton, Bruno Conche
2022-08-31
Machine Learning · Computer Science
A detailed comparative study of open source deep learning frameworks
Ghadeer Al-Bdour, Raffi Al-Qurran, Mahmoud Al-Ayyoub, Ali Shatnawi
2020-05-07
Machine Learning · Computer Science
A Survey of Large-Scale Deep Learning Serving System Optimization: Challenges and Opportunities
Fuxun Yu, Di Wang, Longfei Shangguan, Minjia Zhang +3
2022-02-22
Distributed, Parallel, and Cluster Computing · Computer Science
SpecInF: Exploiting Idle GPU Resources in Distributed DL Training via Speculative Inference Filling
Cunchi Lv, Xiao Shi, Dong Liang, Wenting Tan +1
2025-03-27