English
Related papers

Related papers: TinyProp -- Adaptive Sparse Backpropagation for Ef…

200 papers

This study introduces TinyPropv2, an innovative algorithm optimized for on-device learning in deep neural networks, specifically designed for low-power microcontroller units. TinyPropv2 refines sparse backpropagation by dynamically…

Machine Learning · Computer Science 2024-09-12 Marcus Rüb , Axel Sikora , Daniel Mueller-Gritschneder

We provide a new efficient version of the backpropagation algorithm, specialized to the case where the weights of the neural network being trained are sparse. Our algorithm is general, as it applies to arbitrary (unstructured) sparsity and…

Machine Learning · Computer Science 2023-02-10 Mahdi Nikdan , Tommaso Pegolotti , Eugenia Iofinova , Eldar Kurtic , Dan Alistarh

On-device training is essential for user personalisation and privacy. With the pervasiveness of IoT devices and microcontroller units (MCUs), this task becomes more challenging due to the constrained memory and compute resources, and the…

Machine Learning · Computer Science 2024-06-12 Young D. Kwon , Rui Li , Stylianos I. Venieris , Jagmohan Chauhan , Nicholas D. Lane , Cecilia Mascolo

Recently, deep learning has made remarkable strides, especially with generative modeling, such as large language models and probabilistic diffusion models. However, training these models often involves significant computational resources,…

Machine Learning · Computer Science 2024-12-31 Lujia Zhong , Shuo Huang , Yonggang Shi

The growing size of datasets and deep learning models has made faster and memory-efficient training crucial. Reversible transformers have recently been introduced as an exciting new method for extremely memory-efficient training, but they…

Machine Learning · Computer Science 2023-06-16 Tyler Zhu , Karttikeya Mangalam

Recent advances in deep neural networks (DNNs) owe their success to training algorithms that use backpropagation and gradient-descent. Backpropagation, while highly effective on von Neumann architectures, becomes inefficient when scaling to…

Neural and Evolutionary Computing · Computer Science 2019-05-10 Brian Crafton , Abhinav Parihar , Evan Gebhardt , Arijit Raychowdhury

Deep Neural Networks are successful but highly computationally expensive learning systems. One of the main sources of time and energy drains is the well known backpropagation (backprop) algorithm, which roughly accounts for 2/3 of the…

Machine Learning · Computer Science 2020-04-17 Simon Wiedemann , Temesgen Mehari , Kevin Kepp , Wojciech Samek

We propose a simple yet effective technique for neural network learning. The forward propagation is computed as usual. In back propagation, only a small subset of the full gradient is computed to update the model parameters. The gradient…

Machine Learning · Computer Science 2019-03-12 Xu Sun , Xuancheng Ren , Shuming Ma , Houfeng Wang

Training convolutional neural network models is memory intensive since back-propagation requires storing activations of all intermediate layers. This presents a practical concern when seeking to deploy very deep architectures in production,…

Machine Learning · Computer Science 2019-10-30 Ayan Chakrabarti , Benjamin Moseley

We propose proximal backpropagation (ProxProp) as a novel algorithm that takes implicit instead of explicit gradient steps to update the network parameters during neural network training. Our algorithm is motivated by the step size…

Machine Learning · Computer Science 2018-02-21 Thomas Frerix , Thomas Möllenhoff , Michael Moeller , Daniel Cremers

Spiking neural networks combine analog computation with event-based communication using discrete spikes. While the impressive advances of deep learning are enabled by training non-spiking artificial neural networks using the backpropagation…

Neurons and Cognition · Quantitative Biology 2021-06-22 Timo C. Wunderlich , Christian Pehle

On-device learning enables edge devices to continually adapt the AI models to new data, which requires a small memory footprint to fit the tight memory constraint of edge devices. Existing work solves this problem by reducing the number of…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Han Cai , Chuang Gan , Ligeng Zhu , Song Han

Deep neural networks are powerful parametric models that can be trained efficiently using the backpropagation algorithm. Stochastic neural networks combine the power of large parametric functions with that of graphical models, which makes…

Machine Learning · Computer Science 2016-02-26 Shixiang Gu , Sergey Levine , Ilya Sutskever , Andriy Mnih

Spiking Neural Networks (SNNs) are biologically-inspired models that are capable of processing information in streams of action potentials. However, simulating and training SNNs is computationally expensive due to the need to solve large…

Neurons and Cognition · Quantitative Biology 2023-12-29 Rainer Engelken

With increasing scale in model and dataset size, the training of deep neural networks becomes a massive computational burden. One approach to speed up the training process is Selective Backprop. For this approach, we perform a forward pass…

Machine Learning · Computer Science 2023-12-11 Lukas Balles , Cedric Archambeau , Giovanni Zappella

This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward…

Neural network learning is usually time-consuming since backpropagation needs to compute full gradients and backpropagate them across multiple layers. Despite its success of existing works in accelerating propagation through sparseness, the…

Machine Learning · Computer Science 2020-10-28 Zhiyuan Zhang , Pengcheng Yang , Xuancheng Ren , Qi Su , Xu Sun

We demonstrate the possibility of what we call sparse learning: accelerated training of deep neural networks that maintain sparse weights throughout training while achieving dense performance levels. We accomplish this by developing sparse…

Machine Learning · Computer Science 2019-08-27 Tim Dettmers , Luke Zettlemoyer

Deep neural networks are state-of-the-art models for understanding the content of images, video and raw input data. However, implementing a deep neural network in embedded systems is a challenging task, because a typical deep neural…

Machine Learning · Computer Science 2016-04-22 Xichuan Zhou , Shengli Li , Kai Qin , Kunping Li , Fang Tang , Shengdong Hu , Shujun Liu , Zhi Lin

Spiking Neural Networks (SNNs) are being explored for their potential energy efficiency resulting from sparse, event-driven computations. Many recent works have demonstrated effective backpropagation for deep Spiking Neural Networks (SNNs)…

Neural and Evolutionary Computing · Computer Science 2020-03-04 Jason M. Allred , Steven J. Spencer , Gopalakrishnan Srinivasan , Kaushik Roy
‹ Prev 1 2 3 10 Next ›