English
Related papers

Related papers: Deep Neural Network inference with reduced word le…

200 papers

The state-of-the-art hardware platforms for training Deep Neural Networks (DNNs) are moving from traditional single precision (32-bit) computations towards 16 bits of precision -- in large part due to the high energy efficiency and smaller…

Machine Learning · Computer Science 2018-12-20 Naigang Wang , Jungwook Choi , Daniel Brand , Chia-Yu Chen , Kailash Gopalakrishnan

Reduced precision computation for deep neural networks is one of the key areas addressing the widening compute gap driven by an exponential growth in model size. In recent years, deep learning training has largely migrated to 16-bit…

Machine Learning · Computer Science 2019-05-30 Naveen Mellempudi , Sudarshan Srinivasan , Dipankar Das , Bharat Kaul

While Deep Neural Networks (DNNs) push the state-of-the-art in many machine learning applications, they often require millions of expensive floating-point operations for each input classification. This computation overhead limits the…

Neural and Evolutionary Computing · Computer Science 2017-05-12 Hokchhay Tann , Soheil Hashemi , Iris Bahar , Sherief Reda

With the increasing size of Deep Neural Network (DNN) models, the high memory space requirements and computational complexity have become an obstacle for efficient DNN implementations. To ease this problem, using reduced-precision…

Machine Learning · Computer Science 2019-09-10 Jinming Lu , Siyuan Lu , Zhisheng Wang , Chao Fang , Jun Lin , Zhongfeng Wang , Li Du

Deep Neural Networks (DNN) represent a performance-hungry application. Floating-Point (FP) and custom floating-point-like arithmetic satisfies this hunger. While there is need for speed, inference in DNNs does not seem to have any need for…

Machine Learning · Computer Science 2020-02-11 Christoph Lauter , Anastasia Volkova

The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-23 Zachariah Carmichael , Hamed F. Langroudi , Char Khazanov , Jeffrey Lillie , John L. Gustafson , Dhireesha Kudithipudi

Deep convolutional neural network (CNN) inference requires significant amount of memory and computation, which limits its deployment on embedded devices. To alleviate these problems to some extent, prior research utilize low precision…

Machine Learning · Computer Science 2017-03-10 Liangzhen Lai , Naveen Suda , Vikas Chandra

Reduced numerical precision is a common technique to reduce computational cost in many Deep Neural Networks (DNNs). While it has been observed that DNNs are resilient to small errors and noise, no general result exists that is capable of…

Machine Learning · Statistics 2018-05-04 Zhaoqi Li , Yu Ma , Catalina Vajiac , Yunkai Zhang

Training with larger number of parameters while keeping fast iterations is an increasingly adopted strategy and trend for developing better performing Deep Neural Network (DNN) models. This necessitates increased memory footprint and…

Machine Learning · Computer Science 2020-01-17 Léopold Cambier , Anahita Bhiwandiwalla , Ting Gong , Mehran Nekuii , Oguz H Elibol , Hanlin Tang

Large-scale deep neural networks (DNN) have been successfully used in a number of tasks from image recognition to natural language processing. They are trained using large training sets on large models, making them computationally and…

Machine Learning · Computer Science 2017-03-28 Sek Chai , Aswin Raghavan , David Zhang , Mohamed Amer , Tim Shields

Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-27 Zachariah Carmichael , Hamed F. Langroudi , Char Khazanov , Jeffrey Lillie , John L. Gustafson , Dhireesha Kudithipudi

Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision ([5..8]-bit). However, majority of studies focus only on DNN inference. In this work, we propose DNN training using…

Machine Learning · Computer Science 2019-08-01 Hamed F. Langroudi , Zachariah Carmichael , Dhireesha Kudithipudi

Deep neural networks (DNNs) have demonstrated remarkable success in various fields. However, the large number of floating-point operations (FLOPs) in DNNs poses challenges for their deployment in resource-constrained applications, e.g.,…

Artificial Intelligence · Computer Science 2024-02-20 Mengnan Jiang , Jingcun Wang , Amro Eldebiky , Xunzhao Yin , Cheng Zhuo , Ing-Chao Lin , Grace Li Zhang

Fixed-point optimization of deep neural networks plays an important role in hardware based design and low-power implementations. Many deep neural networks show fairly good performance even with 2- or 3-bit precision when quantized weights…

Machine Learning · Computer Science 2017-02-28 Sungho Shin , Yoonho Boo , Wonyong Sung

Deep Neural Networks (DNNs) are computationally and memory intensive, which makes their hardware implementation a challenging task especially for resource constrained devices such as IoT nodes. To address this challenge, this paper…

Computer Vision and Pattern Recognition · Computer Science 2021-05-10 Mohammed F. Tolba , Huruy Tekle Tesfai , Hani Saleh , Baker Mohammad , Mahmoud Al-Qutayri

Modern deep neural network (DNN) models generally require a huge amount of weight and activation values to achieve good inference outcomes. Those data inevitably demand a massive off-chip memory capacity/bandwidth, and the situation gets…

Machine Learning · Computer Science 2021-04-27 Cheng-Wei Huang , Tim-Wei Chen , Juinn-Dar Huang

Deep neural networks ( DNNs ) are becoming a key enabling technology for many application domains. However, on-device inference on battery-powered, resource-constrained embedding systems is often infeasible due to prohibitively long…

Machine Learning · Computer Science 2019-11-13 Vicent Sanz Marco , Ben Taylor , Zheng Wang , Yehia Elkhatib

The majority of the research on the quantization of Deep Neural Networks (DNNs) is focused on reducing the precision of tensors visible by high-level frameworks (e.g., weights, activations, and gradients). However, current hardware still…

Machine Learning · Computer Science 2024-01-26 Yaniv Blumenfeld , Itay Hubara , Daniel Soudry

Deep neural networks (DNNs) are successful in many computer vision tasks. However, the most accurate DNNs require millions of parameters and operations, making them energy, computation and memory intensive. This impedes the deployment of…

Computer Vision and Pattern Recognition · Computer Science 2020-03-26 Abhinav Goel , Caleb Tung , Yung-Hsiang Lu , George K. Thiruvathukal

Low-precision formats have proven to be an efficient way to reduce not only the memory footprint but also the hardware resources and power consumption of deep learning computations. Under this premise, the posit numerical format appears to…

Machine Learning · Computer Science 2021-05-17 Gonçalo Raposo , Pedro Tomás , Nuno Roma
‹ Prev 1 2 3 10 Next ›