English
Related papers

Related papers: Backpropagation in matrix notation

200 papers

A Deep Neural Network (DNN) is a composite function of vector-valued functions, and in order to train a DNN, it is necessary to calculate the gradient of the loss function with respect to all parameters. This calculation can be a…

Machine Learning · Computer Science 2023-06-02 Saeed Damadi , Golnaz Moharrer , Mostafa Cham

We present a simplified computational rule for the back-propagation formulas for artificial neural networks. In this work, we provide a generic two-step rule for the back-propagation algorithm in matrix notation. Moreover, this rule…

Neural and Evolutionary Computing · Computer Science 2023-05-17 Ahmed Boughammoura

This paper provides a comprehensive and detailed derivation of the backpropagation algorithm for graph convolutional neural networks using matrix calculus. The derivation is extended to include arbitrary element-wise activation functions…

Machine Learning · Computer Science 2024-08-05 Yen-Che Hsiao , Rongting Yue , Abhishek Dutta

Backpropagation (BP) is a core component of the contemporary deep learning incarnation of neural networks. Briefly, BP is an algorithm that exploits the computational architecture of neural networks to efficiently evaluate the gradient of a…

Machine Learning · Statistics 2021-07-21 Dirk Ostwald , Franziska Usée

Using backpropagation to compute gradients of objective functions for optimization has remained a mainstay of machine learning. Backpropagation, or reverse-mode differentiation, is a special case within the general family of automatic…

Machine Learning · Computer Science 2022-02-18 Atılım Güneş Baydin , Barak A. Pearlmutter , Don Syme , Frank Wood , Philip Torr

Backpropagation algorithm is the cornerstone for neural network analysis. Paper extends it for training any derivatives of neural network's output with respect to its input. By the dint of it feedforward networks can be used to solve or…

Neural and Evolutionary Computing · Computer Science 2017-12-13 V. I. Avrutskiy

Backpropagation is a classic automatic differentiation algorithm computing the gradient of functions specified by a certain class of simple, first-order programs, called computational graphs. It is a fundamental tool in several fields, most…

Logic in Computer Science · Computer Science 2019-11-07 Alois Brunel , Damiano Mazza , Michele Pagani

The study of Newton's method in complex-valued neural networks faces many difficulties. In this paper, we derive Newton's method backpropagation algorithms for complex-valued holomorphic multilayer perceptrons, and investigate the…

Complex Variables · Mathematics 2014-06-23 Diana Thomson La Corte , Yi Ming Zou

While backpropagation--reverse-mode automatic differentiation--has been extraordinarily successful in deep learning, it requires two passes (forward and backward) through the neural network and the storage of intermediate activations.…

Machine Learning · Computer Science 2025-11-06 Daniel Wang , Evan Markou , Dylan Campbell

The backpropagation algorithm for neural networks is widely felt hard to understand, despite the existence of some well-written explanations and/or derivations. This paper provides a new derivation of this algorithm based on the concept of…

Machine Learning · Computer Science 2022-05-03 Yiping Cheng

The gradients used to train neural networks are typically computed using backpropagation. While an efficient way to obtain exact gradients, backpropagation is computationally expensive, hinders parallelization, and is biologically…

Machine Learning · Computer Science 2026-01-14 Katharina Flügel , Daniel Coquelin , Marie Weiel , Charlotte Debus , Achim Streit , Markus Götz

In modern neural networks like Transformers, linear layers require significant memory to store activations during backward pass. This study proposes a memory reduction approach to perform backpropagation through linear layers. Since the…

Machine Learning · Computer Science 2022-02-04 Daniel Bershatsky , Aleksandr Mikhalev , Alexandr Katrutsa , Julia Gusak , Daniil Merkulov , Ivan Oseledets

We introduce a new technique for gradient normalization during neural network training. The gradients are rescaled during the backward pass using normalization layers introduced at certain points within the network architecture. These…

Machine Learning · Computer Science 2021-06-18 Alejandro Cabana , Luis F. Lago-Fernández

This work proposes an algorithm for taking advantage of backpropagation gradients to determine feature importance at different stages of training. Additionally, we propose a way to represent the learning process qualitatively. Experiments…

Machine Learning · Computer Science 2020-12-08 Roberto Reyes-Ochoa , Servando Lopez-Aguayo

We propose a diagrammatic notation for matrix differentiation. Our new notation enables us to derive formulas for matrix differentiation more easily than the usual matrix (or index) notation. We demonstrate the effectiveness of our notation…

Signal Processing · Electrical Eng. & Systems 2022-07-12 Kenji Nakahira

Neural networks have been able to achieve groundbreaking accuracy at tasks conventionally considered only doable by humans. Using stochastic gradient descent, optimization in many dimensions is made possible, albeit at a relatively high…

Machine Learning · Computer Science 2017-07-17 Hirsh R. Agarwal , Andrew Huang

In this paper we solve the problem: how to determine maximal allowable errors, possible for signals and parameters of each element of a network proceeding from the condition that the vector of output signals of the network should be…

Disordered Systems and Neural Networks · Physics 2022-05-18 M. Yu. Senashova , A. N. Gorban , D. C. Wunsch

The development of the back-propagation algorithm represents a landmark in neural networks. We provide an approach that conducts the back-propagation again to reverse the traditional back-propagation process to optimize the input loss at…

Machine Learning · Computer Science 2022-02-15 Weiming Xiong , Ruoyu Yang

This course, intended for undergraduates familiar with elementary calculus and linear algebra, introduces the extension of differential calculus to functions on more general vector spaces, such as functions that take as input a matrix and…

History and Overview · Mathematics 2025-01-28 Paige Bright , Alan Edelman , Steven G. Johnson

Many popular feature-attribution methods for interpreting deep neural networks rely on computing the gradients of a model's output with respect to its inputs. While these methods can indicate which input features may be important for the…

Machine Learning · Computer Science 2023-07-11 Kevin Du , Lucas Torroba Hennigen , Niklas Stoehr , Alexander Warstadt , Ryan Cotterell
‹ Prev 1 2 3 10 Next ›