Related papers: Deep Neural Network inference with reduced word le…

Training Deep Neural Networks with 8-bit Floating Point Numbers

The state-of-the-art hardware platforms for training Deep Neural Networks (DNNs) are moving from traditional single precision (32-bit) computations towards 16 bits of precision -- in large part due to the high energy efficiency and smaller…

Machine Learning · Computer Science 2018-12-20 Naigang Wang , Jungwook Choi , Daniel Brand , Chia-Yu Chen , Kailash Gopalakrishnan

Mixed Precision Training With 8-bit Floating Point

Reduced precision computation for deep neural networks is one of the key areas addressing the widening compute gap driven by an exponential growth in model size. In recent years, deep learning training has largely migrated to 16-bit…

Machine Learning · Computer Science 2019-05-30 Naveen Mellempudi , Sudarshan Srinivasan , Dipankar Das , Bharat Kaul

Hardware-Software Codesign of Accurate, Multiplier-free Deep Neural Networks

While Deep Neural Networks (DNNs) push the state-of-the-art in many machine learning applications, they often require millions of expensive floating-point operations for each input classification. This computation overhead limits the…

Neural and Evolutionary Computing · Computer Science 2017-05-12 Hokchhay Tann , Soheil Hashemi , Iris Bahar , Sherief Reda

Training Deep Neural Networks Using Posit Number System

With the increasing size of Deep Neural Network (DNN) models, the high memory space requirements and computational complexity have become an obstacle for efficient DNN implementations. To ease this problem, using reduced-precision…

Machine Learning · Computer Science 2019-09-10 Jinming Lu , Siyuan Lu , Zhisheng Wang , Chao Fang , Jun Lin , Zhongfeng Wang , Li Du

A Framework for Semi-Automatic Precision and Accuracy Analysis for Fast and Rigorous Deep Learning

Deep Neural Networks (DNN) represent a performance-hungry application. Floating-Point (FP) and custom floating-point-like arithmetic satisfies this hunger. While there is need for speed, inference in DNNs does not seem to have any need for…

Machine Learning · Computer Science 2020-02-11 Christoph Lauter , Anastasia Volkova

Deep Positron: A Deep Neural Network Using the Posit Number System

The recent surge of interest in Deep Neural Networks (DNNs) has led to increasingly complex networks that tax computational and memory resources. Many DNNs presently use 16-bit or 32-bit floating point operations. Significant performance…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-01-23 Zachariah Carmichael , Hamed F. Langroudi , Char Khazanov , Jeffrey Lillie , John L. Gustafson , Dhireesha Kudithipudi

Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations

Deep convolutional neural network (CNN) inference requires significant amount of memory and computation, which limits its deployment on embedded devices. To alleviate these problems to some extent, prior research utilize low precision…

Machine Learning · Computer Science 2017-03-10 Liangzhen Lai , Naveen Suda , Vikas Chandra

Exploration of Numerical Precision in Deep Neural Networks

Reduced numerical precision is a common technique to reduce computational cost in many Deep Neural Networks (DNNs). While it has been observed that DNNs are resilient to small errors and noise, no general result exists that is capable of…

Machine Learning · Statistics 2018-05-04 Zhaoqi Li , Yu Ma , Catalina Vajiac , Yunkai Zhang

Shifted and Squeezed 8-bit Floating Point format for Low-Precision Training of Deep Neural Networks

Training with larger number of parameters while keeping fast iterations is an increasingly adopted strategy and trend for developing better performing Deep Neural Network (DNN) models. This necessitates increased memory footprint and…

Machine Learning · Computer Science 2020-01-17 Léopold Cambier , Anahita Bhiwandiwalla , Ting Gong , Mehran Nekuii , Oguz H Elibol , Hanlin Tang

Low Precision Neural Networks using Subband Decomposition

Large-scale deep neural networks (DNN) have been successfully used in a number of tasks from image recognition to natural language processing. They are trained using large training sets on large models, making them computationally and…

Machine Learning · Computer Science 2017-03-28 Sek Chai , Aswin Raghavan , David Zhang , Mohamed Amer , Tim Shields

Performance-Efficiency Trade-off of Low-Precision Numerical Formats in Deep Neural Networks

Deep neural networks (DNNs) have been demonstrated as effective prognostic models across various domains, e.g. natural language processing, computer vision, and genomics. However, modern-day DNNs demand high compute and memory storage for…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-03-27 Zachariah Carmichael , Hamed F. Langroudi , Char Khazanov , Jeffrey Lillie , John L. Gustafson , Dhireesha Kudithipudi

Deep Learning Training on the Edge with Low-Precision Posits

Recently, the posit numerical format has shown promise for DNN data representation and compute with ultra-low precision ([5..8]-bit). However, majority of studies focus only on DNN inference. In this work, we propose DNN training using…

Machine Learning · Computer Science 2019-08-01 Hamed F. Langroudi , Zachariah Carmichael , Dhireesha Kudithipudi

Class-Aware Pruning for Efficient Neural Networks

Deep neural networks (DNNs) have demonstrated remarkable success in various fields. However, the large number of floating-point operations (FLOPs) in DNNs poses challenges for their deployment in resource-constrained applications, e.g.,…

Artificial Intelligence · Computer Science 2024-02-20 Mengnan Jiang , Jingcun Wang , Amro Eldebiky , Xunzhao Yin , Cheng Zhuo , Ing-Chao Lin , Grace Li Zhang

Fixed-point optimization of deep neural networks with adaptive step size retraining

Fixed-point optimization of deep neural networks plays an important role in hardware based design and low-power implementations. Many deep neural networks show fairly good performance even with 2- or 3-bit precision when quantized weights…

Machine Learning · Computer Science 2017-02-28 Sungho Shin , Yoonho Boo , Wonyong Sung

Deep Neural Networks Based Weight Approximation and Computation Reuse for 2-D Image Classification

Deep Neural Networks (DNNs) are computationally and memory intensive, which makes their hardware implementation a challenging task especially for resource constrained devices such as IoT nodes. To address this challenge, this paper…

Computer Vision and Pattern Recognition · Computer Science 2021-05-10 Mohammed F. Tolba , Huruy Tekle Tesfai , Hani Saleh , Baker Mohammad , Mahmoud Al-Qutayri

All-You-Can-Fit 8-Bit Flexible Floating-Point Format for Accurate and Memory-Efficient Inference of Deep Neural Networks

Modern deep neural network (DNN) models generally require a huge amount of weight and activation values to achieve good inference outcomes. Those data inevitably demand a massive off-chip memory capacity/bandwidth, and the situation gets…

Machine Learning · Computer Science 2021-04-27 Cheng-Wei Huang , Tim-Wei Chen , Juinn-Dar Huang

Optimizing Deep Learning Inference on Embedded Systems Through Adaptive Model Selection

Deep neural networks ( DNNs ) are becoming a key enabling technology for many application domains. However, on-device inference on battery-powered, resource-constrained embedding systems is often infeasible due to prohibitively long…

Machine Learning · Computer Science 2019-11-13 Vicent Sanz Marco , Ben Taylor , Zheng Wang , Yehia Elkhatib

Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators

The majority of the research on the quantization of Deep Neural Networks (DNNs) is focused on reducing the precision of tensors visible by high-level frameworks (e.g., weights, activations, and gradients). However, current hardware still…

Machine Learning · Computer Science 2024-01-26 Yaniv Blumenfeld , Itay Hubara , Daniel Soudry

A Survey of Methods for Low-Power Deep Learning and Computer Vision

Deep neural networks (DNNs) are successful in many computer vision tasks. However, the most accurate DNNs require millions of parameters and operations, making them energy, computation and memory intensive. This impedes the deployment of…

Computer Vision and Pattern Recognition · Computer Science 2020-03-26 Abhinav Goel , Caleb Tung , Yung-Hsiang Lu , George K. Thiruvathukal

PositNN: Training Deep Neural Networks with Mixed Low-Precision Posit

Low-precision formats have proven to be an efficient way to reduce not only the memory footprint but also the hardware resources and power consumption of deep learning computations. Under this premise, the posit numerical format appears to…

Machine Learning · Computer Science 2021-05-17 Gonçalo Raposo , Pedro Tomás , Nuno Roma