Related papers: Convolutional Neural Networks using Logarithmic Da…

Towards Effective Low-bitwidth Convolutional Neural Networks

This paper tackles the problem of training a deep convolutional neural network with both low-precision weights and low-bitwidth activations. Optimizing a low-precision network is very challenging since the training process can easily get…

Computer Vision and Pattern Recognition · Computer Science 2021-06-05 Bohan Zhuang , Chunhua Shen , Mingkui Tan , Lingqiao Liu , Ian Reid

Deep Learning with Limited Numerical Precision

Training of large-scale deep neural networks is often constrained by the available computational resources. We study the effect of limited precision data representation and computation on neural network training. Within the context of…

Machine Learning · Computer Science 2015-02-11 Suyog Gupta , Ankur Agrawal , Kailash Gopalakrishnan , Pritish Narayanan

Convolutional Networks for Fast, Energy-Efficient Neuromorphic Computing

Deep networks are now able to achieve human-level performance on a broad spectrum of recognition tasks. Independently, neuromorphic computing has now demonstrated unprecedented energy-efficiency through a new chip architecture based on…

Neural and Evolutionary Computing · Computer Science 2016-10-13 Steven K. Esser , Paul A. Merolla , John V. Arthur , Andrew S. Cassidy , Rathinakumar Appuswamy , Alexander Andreopoulos , David J. Berg , Jeffrey L. McKinstry , Timothy Melano , Davis R. Barch , Carmelo di Nolfo , Pallab Datta , Arnon Amir , Brian Taba , Myron D. Flickner , Dharmendra S. Modha

Neural Network Training with Approximate Logarithmic Computations

The high computational complexity associated with training deep neural networks limits online and real-time training on edge devices. This paper proposed an end-to-end training and inference scheme that eliminates multiplications by…

Machine Learning · Computer Science 2026-05-05 Arnab Sanyal , Peter A. Beerel , Keith M. Chugg

Low-Precision Floating-Point Schemes for Neural Network Training

The use of low-precision fixed-point arithmetic along with stochastic rounding has been proposed as a promising alternative to the commonly used 32-bit floating point arithmetic to enhance training neural networks training in terms of…

Machine Learning · Computer Science 2018-04-17 Marc Ortiz , Adrián Cristal , Eduard Ayguadé , Marc Casas

Evolving Deep Convolutional Neural Networks for Image Classification

Evolutionary computation methods have been successfully applied to neural networks since two decades ago, while those methods cannot scale well to the modern deep neural networks due to the complicated architectures and large quantities of…

Neural and Evolutionary Computing · Computer Science 2019-03-12 Yanan Sun , Bing Xue , Mengjie Zhang , Gary G. Yen

Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations

This paper tackles the problem of training a deep convolutional neural network of both low-bitwidth weights and activations. Optimizing a low-precision network is very challenging due to the non-differentiability of the quantizer, which may…

Computer Vision and Pattern Recognition · Computer Science 2021-06-07 Bohan Zhuang , Jing Liu , Mingkui Tan , Lingqiao Liu , Ian Reid , Chunhua Shen

Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations

Deep convolutional neural network (CNN) inference requires significant amount of memory and computation, which limits its deployment on embedded devices. To alleviate these problems to some extent, prior research utilize low precision…

Machine Learning · Computer Science 2017-03-10 Liangzhen Lai , Naveen Suda , Vikas Chandra

Mixed Precision Training With 8-bit Floating Point

Reduced precision computation for deep neural networks is one of the key areas addressing the widening compute gap driven by an exponential growth in model size. In recent years, deep learning training has largely migrated to 16-bit…

Machine Learning · Computer Science 2019-05-30 Naveen Mellempudi , Sudarshan Srinivasan , Dipankar Das , Bharat Kaul

Binarized Convolutional Neural Networks for Efficient Inference on GPUs

Convolutional neural networks have recently achieved significant breakthroughs in various image classification tasks. However, they are computationally expensive,which can make their feasible mplementation on embedded and low-power devices…

Machine Learning · Computer Science 2018-08-02 Mir Khan , Heikki Huttunen , Jani Boutellier

Learning the Non-linearity in Convolutional Neural Networks

We propose the introduction of nonlinear operation into the feature generation process in convolutional neural networks. This nonlinearity can be implemented in various ways. First we discuss the use of nonlinearities in the process of data…

Machine Learning · Computer Science 2019-05-30 Gavneet Singh Chadha , Andreas Schwung

8-bit Numerical Formats for Deep Neural Networks

Given the current trend of increasing size and complexity of machine learning architectures, it has become of critical importance to identify new approaches to improve the computational efficiency of model training. In this context, we…

Machine Learning · Computer Science 2022-06-08 Badreddine Noune , Philip Jones , Daniel Justus , Dominic Masters , Carlo Luschi

Training Deep Neural Networks with 8-bit Floating Point Numbers

The state-of-the-art hardware platforms for training Deep Neural Networks (DNNs) are moving from traditional single precision (32-bit) computations towards 16 bits of precision -- in large part due to the high energy efficiency and smaller…

Machine Learning · Computer Science 2018-12-20 Naigang Wang , Jungwook Choi , Daniel Brand , Chia-Yu Chen , Kailash Gopalakrishnan

Deep Learning Inference on Embedded Devices: Fixed-Point vs Posit

Performing the inference step of deep learning in resource constrained environments, such as embedded devices, is challenging. Success requires optimization at both software and hardware levels. Low precision arithmetic and specifically low…

Computer Vision and Pattern Recognition · Computer Science 2018-05-23 Seyed H. F. Langroudi , Tej Pandit , Dhireesha Kudithipudi

Bit-wise Training of Neural Network Weights

We introduce an algorithm where the individual bits representing the weights of a neural network are learned. This method allows training weights with integer values on arbitrary bit-depths and naturally uncovers sparse networks, without…

Machine Learning · Computer Science 2022-02-22 Cristian Ivan

Low-Resolution Neural Networks

The expanding scale of large neural network models introduces significant challenges, driving efforts to reduce memory usage and enhance computational efficiency. Such measures are crucial to ensure the practical implementation and…

Machine Learning · Computer Science 2025-02-14 Eduardo Lobo Lustosa Cabral , Larissa Driemeier

Random 2.5D U-net for Fully 3D Segmentation

Convolutional neural networks are state-of-the-art for various segmentation tasks. While for 2D images these networks are also computationally efficient, 3D convolutions have huge storage requirements and therefore, end-to-end training is…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Christoph Angermann , Markus Haltmeier

Exploring the Potential of Low-bit Training of Convolutional Neural Networks

In this work, we propose a low-bit training framework for convolutional neural networks, which is built around a novel multi-level scaling (MLS) tensor format. Our framework focuses on reducing the energy consumption of convolution…

Machine Learning · Computer Science 2021-07-15 Kai Zhong , Xuefei Ning , Guohao Dai , Zhenhua Zhu , Tianchen Zhao , Shulin Zeng , Yu Wang , Huazhong Yang

Post-training 4-bit quantization of convolution networks for rapid-deployment

Convolutional neural networks require significant memory bandwidth and storage for intermediate computations, apart from substantial computing resources. Neural network quantization has significant benefits in reducing the amount of…

Computer Vision and Pattern Recognition · Computer Science 2019-05-30 Ron Banner , Yury Nahshan , Elad Hoffer , Daniel Soudry

Power-of-Two Quantization for Low Bitwidth and Hardware Compliant Neural Networks

Deploying Deep Neural Networks in low-power embedded devices for real time-constrained applications requires optimization of memory and computational complexity of the networks, usually by quantizing the weights. Most of the existing works…

Machine Learning · Computer Science 2022-03-11 Dominika Przewlocka-Rus , Syed Shakib Sarwar , H. Ekin Sumbul , Yuecheng Li , Barbara De Salvo