English
Related papers

Related papers: Learning Recurrent Binary/Ternary Weights

200 papers

Binary and sparse ternary weights in neural networks enable faster computations and lighter representations, facilitating their use on edge devices with limited computational power. Meanwhile, vanilla RNNs are highly sensitive to changes in…

Machine Learning · Computer Science 2025-05-07 Armand Foucault , Franck Mamalet , François Malgouyres

Associative memory using fast weights is a short-term memory mechanism that substantially improves the memory capacity and time scale of recurrent neural networks (RNNs). As recent studies introduced fast weights only to regular RNNs, it is…

Neural and Evolutionary Computing · Computer Science 2018-04-19 T. Anderson Keller , Sharath Nittur Sridhar , Xin Wang

This paper presents a storage-efficient learning model titled Recursive Binary Neural Networks for sensing devices having a limited amount of on-chip data storage such as < 100's kilo-Bytes. The main idea of the proposed model is to…

Neural and Evolutionary Computing · Computer Science 2017-09-18 Tianchan Guan , Xiaoyang Zeng , Mingoo Seok

The design of systems implementing low precision neural networks with emerging memories such as resistive random access memory (RRAM) is a major lead for reducing the energy consumption of artificial intelligence (AI). Multiple works have…

The design of systems implementing low precision neural networks with emerging memories such as resistive random access memory (RRAM) is a significant lead for reducing the energy consumption of artificial intelligence. To achieve maximum…

Recurrent Neural Networks (RNNs) produce state-of-art performance on many machine learning tasks but their demand on resources in terms of memory and computational power are often high. Therefore, there is a great interest in optimizing the…

Neural and Evolutionary Computing · Computer Science 2017-02-28 Joachim Ott , Zhouhan Lin , Ying Zhang , Shih-Chii Liu , Yoshua Bengio

Recurrent Neural Networks (RNNs) with Long Short-Term Memory units (LSTM) are widely used because they are expressive and are easy to train. Our interest lies in empirically evaluating the expressiveness and the learnability of LSTMs in the…

Neural and Evolutionary Computing · Computer Science 2015-11-24 Wojciech Zaremba , Ilya Sutskever

Long short-term memory (LSTM) is one of the robust recurrent neural network architectures for learning sequential data. However, it requires considerable computational power to learn and implement both software and hardware aspects. This…

Machine Learning · Computer Science 2023-01-13 Nelly Elsayed , Zag ElSayed , Anthony S. Maida

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

Despite their tremendous success and versatility, Deep Neural Networks (DNNs) such as Large Language Models (LLMs) suffer from inference inefficiency and rely on advanced computational infrastructure. To address these challenges and make…

Machine Learning · Computer Science 2025-05-05 Mohsen Dehghankar , Mahdi Erfanian , Abolfazl Asudeh

Recent breakthroughs in computer vision make use of large deep neural networks, utilizing the substantial speedup offered by GPUs. For applications running on limited hardware, however, high precision real-time processing can still be a…

Machine Learning · Computer Science 2018-02-05 Oran Shayer , Dan Levi , Ethan Fetaya

Binary neural networks (BNNs) have received ever-increasing popularity for their great capability of reducing storage burden as well as quickening inference time. However, there is a severe performance drop compared with real-valued…

Machine Learning · Computer Science 2023-02-07 Sheng Xu , Yanjing Li , Teli Ma , Mingbao Lin , Hao Dong , Baochang Zhang , Peng Gao , Jinhu Lv

Low-bit quantized neural networks are of great interest in practical applications because they significantly reduce the consumption of both memory and computational resources. Binary neural networks are memory and computationally efficient…

Machine Learning · Computer Science 2022-05-20 Anton Trusov , Elena Limonova , Dmitry Nikolaev , Vladimir V. Arlazarov

Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large…

Machine Learning · Computer Science 2021-11-11 Charles C. Onu , Jacob E. Miller , Doina Precup

Recurrent Neural Networks (RNN) are widely used for learning sequences in applications such as EEG classification. Complex RNNs could be hardly deployed on wearable devices due to their computation and memory-intensive processing patterns.…

Signal Processing · Electrical Eng. & Systems 2020-04-21 Seyed Ahmad Mirsalari , Sima Sinaei , Mostafa E. Salehi , Masoud Daneshtalab

The computation and storage requirements for Deep Neural Networks (DNNs) are usually high. This issue limits their deployability on ubiquitous computing devices such as smart phones, wearables and autonomous drones. In this paper, we…

Machine Learning · Computer Science 2017-02-28 Hande Alemdar , Vincent Leroy , Adrien Prost-Boucle , Frédéric Pétrot

Deep Neural Networks (DNN) have achieved state-of-the-art results in a wide range of tasks, with the best results obtained with large training sets and large models. In the past, GPUs enabled these breakthroughs because of their greater…

Machine Learning · Computer Science 2016-04-19 Matthieu Courbariaux , Yoshua Bengio , Jean-Pierre David

Recurrent neural networks (RNNs), including long short-term memory (LSTM) RNNs, have produced state-of-the-art results on a variety of speech recognition tasks. However, these models are often too large in size for deployment on mobile…

Machine Learning · Computer Science 2016-04-12 Zhiyun Lu , Vikas Sindhwani , Tara N. Sainath

In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional…

Machine Learning · Computer Science 2023-02-17 Tayfun Gokmen , Malte Rasch , Wilfried Haensch

Recurrent Neural Networks (RNNs) produce state-of-art performance on many machine learning tasks but their demand on resources in terms of memory and computational power are often high. Therefore, there is a great interest in optimizing the…

Neural and Evolutionary Computing · Computer Science 2017-02-28 Joachim Ott , Zhouhan Lin , Ying Zhang , Shih-Chii Liu , Yoshua Bengio
‹ Prev 1 2 3 10 Next ›