English
Related papers

Related papers: Accelerating Deep Learning Inference via Freezing

200 papers

The success of deep neural networks (DNN) in machine perception applications such as image classification and speech recognition comes at the cost of high computation and storage complexity. Inference of uncompressed large scale DNN models…

Machine Learning · Computer Science 2020-07-06 Yihao Fang , Shervin Manzuri Shalmani , Rong Zheng

Deep Neural Networks (DNNs) are witnessing increased adoption in multiple domains owing to their high accuracy in solving real-world problems. However, this high accuracy has been achieved by building deeper networks, posing a fundamental…

Machine Learning · Computer Science 2021-01-20 Arjun Balasubramanian , Adarsh Kumar , Yuhan Liu , Han Cao , Shivaram Venkataraman , Aditya Akella

Deep Neural Networks (DNNs) have become an essential component in many application domains including web-based services. A variety of these services require high throughput and (close to) real-time features, for instance, to respond or…

Machine Learning · Computer Science 2022-09-20 Mohammadamin Abedi , Yanni Iouannou , Pooyan Jamshidi , Hadi Hemmati

With the growing size of deep neural networks and datasets, the computational costs of training have significantly increased. The layer-freezing technique has recently attracted great attention as a promising method to effectively reduce…

Machine Learning · Computer Science 2025-08-22 Chence Yang , Ci Zhang , Lei Lu , Qitao Tan , Sheng Li , Ao Li , Xulong Tang , Shaoyi Huang , Jinzhen Wang , Guoming Li , Jundong Li , Xiaoming Zhai , Jin Lu , Geng Yuan

The early layers of a deep neural net have the fewest parameters, but take up the most computation. In this extended abstract, we propose to only train the hidden layers for a set portion of the training run, freezing them out one-by-one…

Machine Learning · Statistics 2017-06-20 Andrew Brock , Theodore Lim , J. M. Ritchie , Nick Weston

Graphics rendering applications increasingly leverage neural networks in tasks such as denoising, supersampling, and frame extrapolation to improve image quality while maintaining frame rates. The temporal coherence inherent in these tasks…

Graphics · Computer Science 2025-06-18 Lufei Liu , Tor M. Aamodt

The widespread application of deep learning has changed the landscape of computation in the data center. In particular, personalized recommendation for content ranking is now largely accomplished leveraging deep neural networks. However,…

Many techniques have been developed, such as model compression, to make Deep Neural Networks (DNNs) inference more efficiently. Nevertheless, DNNs still lack excellent run-time dynamic inference capability to enable users trade-off accuracy…

Computer Vision and Pattern Recognition · Computer Science 2020-09-15 Li Yang , Zhezhi He , Yu Cao , Deliang Fan

With the edge computing becoming an increasingly adopted concept in system architectures, it is expected its utilization will be additionally heightened when combined with deep learning (DL) techniques. The idea behind integrating demanding…

Networking and Internet Architecture · Computer Science 2020-03-12 Mounir Bensalem , Jasenka Dizdarević , Admela Jukan

Modern machine learning tools such as deep neural networks (DNNs) are playing a revolutionary role in many fields such as natural language processing, computer vision, and the internet of things. Once they are trained, deep learning models…

Machine Learning · Computer Science 2022-01-19 Arjun Parthasarathy , Bhaskar Krishnamachari

Deep neural networks ( DNNs ) are becoming a key enabling technology for many application domains. However, on-device inference on battery-powered, resource-constrained embedding systems is often infeasible due to prohibitively long…

Machine Learning · Computer Science 2019-11-13 Vicent Sanz Marco , Ben Taylor , Zheng Wang , Yehia Elkhatib

Training deep neural networks (DNNs) is time-consuming. While most existing solutions try to overlap/schedule computation and communication for efficient training, this paper goes one step further by skipping computing and communication…

Machine Learning · Computer Science 2023-03-14 Yiding Wang , Decang Sun , Kai Chen , Fan Lai , Mosharaf Chowdhury

Deep convolution Neural Network (DCNN) has been widely used in computer vision tasks. However, for edge devices even inference has too large computational complexity and data access amount. The inference latency of state-of-the-art models…

Hardware Architecture · Computer Science 2025-09-09 Kuan-Ting Lin , Ching-Te Chiu , Jheng-Yi Chang , Shi-Zong Huang , Yu-Ting Li

Recently, deep neural networks (DNNs) have been widely applied in mobile intelligent applications. The inference for the DNNs is usually performed in the cloud. However, it leads to a large overhead of transmitting data via wireless…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-12-19 Guangli Li , Lei Liu , Xueying Wang , Xiao Dong , Peng Zhao , Xiaobing Feng

Accelerating deep neural network (DNN) inference on resource-limited devices is one of the most important barriers to ensuring a wider and more inclusive adoption. To alleviate this, DNN binary quantization for faster convolution and memory…

Machine Learning · Computer Science 2021-08-24 Meshia Cédric Oveneke

The large computing and memory cost of deep neural networks (DNNs) often precludes their use in resource-constrained devices. Quantizing the parameters and operations to lower bit-precision offers substantial memory and energy savings for…

Machine Learning · Computer Science 2023-09-01 Clemens JS Schaefer , Siddharth Joshi , Shan Li , Raul Blazquez

DNNs are ubiquitous on edge devices nowadays. With its increasing importance and use cases, it's not likely to pack all DNNs into device memory and expect that each inference has been warmed up. Therefore, cold inference, the process to…

Machine Learning · Computer Science 2023-08-29 Rongjie Yi , Ting Cao , Ao Zhou , Xiao Ma , Shangguang Wang , Mengwei Xu

Reducing inference time and energy usage while maintaining prediction accuracy has become a significant concern for deep neural networks (DNN) inference on resource-constrained edge devices. To address this problem, we propose a novel…

Machine Learning · Computer Science 2024-03-13 Hasanul Mahmud , Peng Kang , Kevin Desai , Palden Lama , Sushil Prasad

Convolutional Neural Networks (CNNs) has revolutionized computer vision, but training very deep networks has been challenging due to the vanishing gradient problem. This paper explores Residual Networks (ResNet), introduced by He et al.…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Xingyu Liu , Kun Ming Goh

The success of deep neural networks (DNNs) is heavily dependent on computational resources. While DNNs are often employed on cloud servers, there is a growing need to operate DNNs on edge devices. Edge devices are typically limited in their…

Machine Learning · Computer Science 2022-06-08 May Malka , Erez Farhan , Hai Morgenstern , Nir Shlezinger
‹ Prev 1 2 3 10 Next ›