Related papers: TinyProp -- Adaptive Sparse Backpropagation for Ef…

Advancing On-Device Neural Network Training with TinyPropv2: Dynamic, Sparse, and Efficient Backpropagation

This study introduces TinyPropv2, an innovative algorithm optimized for on-device learning in deep neural networks, specifically designed for low-power microcontroller units. TinyPropv2 refines sparse backpropagation by dynamically…

Machine Learning · Computer Science 2024-09-12 Marcus Rüb , Axel Sikora , Daniel Mueller-Gritschneder

SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks

We provide a new efficient version of the backpropagation algorithm, specialized to the case where the weights of the neural network being trained are sparse. Our algorithm is general, as it applies to arbitrary (unstructured) sparsity and…

Machine Learning · Computer Science 2023-02-10 Mahdi Nikdan , Tommaso Pegolotti , Eugenia Iofinova , Eldar Kurtic , Dan Alistarh

TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge

On-device training is essential for user personalisation and privacy. With the pervasiveness of IoT devices and microcontroller units (MCUs), this task becomes more challenging due to the constrained memory and compute resources, and the…

Machine Learning · Computer Science 2024-06-12 Young D. Kwon , Rui Li , Stylianos I. Venieris , Jagmohan Chauhan , Nicholas D. Lane , Cecilia Mascolo

ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation

Recently, deep learning has made remarkable strides, especially with generative modeling, such as large language models and probabilistic diffusion models. However, training these models often involves significant computational resources,…

Machine Learning · Computer Science 2024-12-31 Lujia Zhong , Shuo Huang , Yonggang Shi

PaReprop: Fast Parallelized Reversible Backpropagation

The growing size of datasets and deep learning models has made faster and memory-efficient training crucial. Reversible transformers have recently been introduced as an exciting new method for extremely memory-efficient training, but they…

Machine Learning · Computer Science 2023-06-16 Tyler Zhu , Karttikeya Mangalam

Direct Feedback Alignment with Sparse Connections for Local Learning

Recent advances in deep neural networks (DNNs) owe their success to training algorithms that use backpropagation and gradient-descent. Backpropagation, while highly effective on von Neumann architectures, becomes inefficient when scaling to…

Neural and Evolutionary Computing · Computer Science 2019-05-10 Brian Crafton , Abhinav Parihar , Evan Gebhardt , Arijit Raychowdhury

Dithered backprop: A sparse and quantized backpropagation algorithm for more efficient deep neural network training

Deep Neural Networks are successful but highly computationally expensive learning systems. One of the main sources of time and energy drains is the well known backpropagation (backprop) algorithm, which roughly accounts for 2/3 of the…

Machine Learning · Computer Science 2020-04-17 Simon Wiedemann , Temesgen Mehari , Kevin Kepp , Wojciech Samek

meProp: Sparsified Back Propagation for Accelerated Deep Learning with Reduced Overfitting

We propose a simple yet effective technique for neural network learning. The forward propagation is computed as usual. In back propagation, only a small subset of the full gradient is computed to update the model parameters. The gradient…

Machine Learning · Computer Science 2019-03-12 Xu Sun , Xuancheng Ren , Shuming Ma , Houfeng Wang

Backprop with Approximate Activations for Memory-efficient Network Training

Training convolutional neural network models is memory intensive since back-propagation requires storing activations of all intermediate layers. This presents a practical concern when seeking to deploy very deep architectures in production,…

Machine Learning · Computer Science 2019-10-30 Ayan Chakrabarti , Benjamin Moseley

Proximal Backpropagation

We propose proximal backpropagation (ProxProp) as a novel algorithm that takes implicit instead of explicit gradient steps to update the network parameters during neural network training. Our algorithm is motivated by the step size…

Machine Learning · Computer Science 2018-02-21 Thomas Frerix , Thomas Möllenhoff , Michael Moeller , Daniel Cremers

Event-Based Backpropagation can compute Exact Gradients for Spiking Neural Networks

Spiking neural networks combine analog computation with event-based communication using discrete spikes. While the impressive advances of deep learning are enabled by training non-spiking artificial neural networks using the backpropagation…

Neurons and Cognition · Quantitative Biology 2021-06-22 Timo C. Wunderlich , Christian Pehle

TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning

On-device learning enables edge devices to continually adapt the AI models to new data, which requires a small memory footprint to fit the tight memory constraint of edge devices. Existing work solves this problem by reducing the number of…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Han Cai , Chuang Gan , Ligeng Zhu , Song Han

MuProp: Unbiased Backpropagation for Stochastic Neural Networks

Deep neural networks are powerful parametric models that can be trained efficiently using the backpropagation algorithm. Stochastic neural networks combine the power of large parametric functions with that of graphical models, which makes…

Machine Learning · Computer Science 2016-02-26 Shixiang Gu , Sergey Levine , Ilya Sutskever , Andriy Mnih

SparseProp: Efficient Event-Based Simulation and Training of Sparse Recurrent Spiking Neural Networks

Spiking Neural Networks (SNNs) are biologically-inspired models that are capable of processing information in streams of action potentials. However, simulating and training SNNs is computationally expensive due to the need to solve large…

Neurons and Cognition · Quantitative Biology 2023-12-29 Rainer Engelken

A Negative Result on Gradient Matching for Selective Backprop

With increasing scale in model and dataset size, the training of deep neural networks becomes a massive computational burden. One approach to speed up the training process is Selective Backprop. For this approach, we perform a forward pass…

Machine Learning · Computer Science 2023-12-11 Lukas Balles , Cedric Archambeau , Giovanni Zappella

Accelerating Deep Learning by Focusing on the Biggest Losers

This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward…

Machine Learning · Computer Science 2019-10-03 Angela H. Jiang , Daniel L. -K. Wong , Giulio Zhou , David G. Andersen , Jeffrey Dean , Gregory R. Ganger , Gauri Joshi , Michael Kaminksy , Michael Kozuch , Zachary C. Lipton , Padmanabhan Pillai

Memorized Sparse Backpropagation

Neural network learning is usually time-consuming since backpropagation needs to compute full gradients and backpropagate them across multiple layers. Despite its success of existing works in accelerating propagation through sparseness, the…

Machine Learning · Computer Science 2020-10-28 Zhiyuan Zhang , Pengcheng Yang , Xuancheng Ren , Qi Su , Xu Sun

Sparse Networks from Scratch: Faster Training without Losing Performance

We demonstrate the possibility of what we call sparse learning: accelerated training of deep neural networks that maintain sparse weights throughout training while achieving dense performance levels. We accomplish this by developing sparse…

Machine Learning · Computer Science 2019-08-27 Tim Dettmers , Luke Zettlemoyer

Deep Adaptive Network: An Efficient Deep Neural Network with Sparse Binary Connections

Deep neural networks are state-of-the-art models for understanding the content of images, video and raw input data. However, implementing a deep neural network in embedded systems is a challenging task, because a typical deep neural…

Machine Learning · Computer Science 2016-04-22 Xichuan Zhou , Shengli Li , Kai Qin , Kunping Li , Fang Tang , Shengdong Hu , Shujun Liu , Zhi Lin

Explicitly Trained Spiking Sparsity in Spiking Neural Networks with Backpropagation

Spiking Neural Networks (SNNs) are being explored for their potential energy efficiency resulting from sparse, event-driven computations. Many recent works have demonstrated effective backpropagation for deep Spiking Neural Networks (SNNs)…

Neural and Evolutionary Computing · Computer Science 2020-03-04 Jason M. Allred , Steven J. Spencer , Gopalakrishnan Srinivasan , Kaushik Roy