Related papers: The Backpropagation algorithm for a math student

A Theoretical View of Linear Backpropagation and Its Convergence

Backpropagation (BP) is widely used for calculating gradients in deep neural networks (DNNs). Applied often along with stochastic gradient descent (SGD) or its variants, BP is considered as a de-facto choice in a variety of machine learning…

Machine Learning · Computer Science 2024-01-11 Ziang Li , Yiwen Guo , Haodi Liu , Changshui Zhang

An induction proof of the backpropagation algorithm in matrix notation

Backpropagation (BP) is a core component of the contemporary deep learning incarnation of neural networks. Briefly, BP is an algorithm that exploits the computational architecture of neural networks to efficiently evaluate the gradient of a…

Machine Learning · Statistics 2021-07-21 Dirk Ostwald , Franziska Usée

Backpropagation generalized for output derivatives

Backpropagation algorithm is the cornerstone for neural network analysis. Paper extends it for training any derivatives of neural network's output with respect to its input. By the dint of it feedforward networks can be used to solve or…

Neural and Evolutionary Computing · Computer Science 2017-12-13 V. I. Avrutskiy

Deep learning for pedestrians: backpropagation in CNNs

The goal of this document is to provide a pedagogical introduction to the main concepts underpinning the training of deep neural networks using gradient descent; a process known as backpropagation. Although we focus on a very influential…

Machine Learning · Computer Science 2018-11-30 Laurent Boué

A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training

Training deep neural networks (DNNs) efficiently is a challenge due to the associated highly nonconvex optimization. The backpropagation (backprop) algorithm has long been the most widely used algorithm for gradient computation of…

Machine Learning · Statistics 2018-03-28 Tim Tsz-Kit Lau , Jinshan Zeng , Baoyuan Wu , Yuan Yao

Towards Scalable Backpropagation-Free Gradient Estimation

While backpropagation--reverse-mode automatic differentiation--has been extraordinarily successful in deep learning, it requires two passes (forward and backward) through the neural network and the storage of intermediate activations.…

Machine Learning · Computer Science 2025-11-06 Daniel Wang , Evan Markou , Dylan Campbell

Estimating the Jacobian matrix of an unknown multivariate function from sample values by means of a neural network

We describe, implement and test a novel method for training neural networks to estimate the Jacobian matrix $J$ of an unknown multivariate function $F$. The training set is constructed from finitely many pairs $(x,F(x))$ and it contains no…

Machine Learning · Computer Science 2022-04-04 Frédéric Latrémolière , Sadananda Narayanappa , Petr Vojtěchovský

A Novel Representation of Neural Networks

Deep Neural Networks (DNNs) have become very popular for prediction in many areas. Their strength is in representation with a high number of parameters that are commonly learned via gradient descent or similar optimization methods. However,…

Machine Learning · Statistics 2016-10-11 Anthony Caterini , Dong Eui Chang

Backpropagation in the Simply Typed Lambda-calculus with Linear Negation

Backpropagation is a classic automatic differentiation algorithm computing the gradient of functions specified by a certain class of simple, first-order programs, called computational graphs. It is a fundamental tool in several fields, most…

Logic in Computer Science · Computer Science 2019-11-07 Alois Brunel , Damiano Mazza , Michele Pagani

Moonwalk: Inverse-Forward Differentiation

Backpropagation's main limitation is its need to store intermediate activations (residuals) during the forward pass, which restricts the depth of trainable networks. This raises a fundamental question: can we avoid storing these…

Machine Learning · Computer Science 2026-05-25 Dmitrii Krylov , Armin Karamzade , Roy Fox

Efficient Deep Learning with Decorrelated Backpropagation

The backpropagation algorithm remains the dominant and most successful method for training deep neural networks (DNNs). At the same time, training DNNs at scale comes at a significant computational cost and therefore a high carbon…

Machine Learning · Computer Science 2025-11-12 Sander Dalm , Joshua Offergeld , Nasir Ahmad , Marcel van Gerven

Alternating optimization method based on nonnegative matrix factorizations for deep neural networks

The backpropagation algorithm for calculating gradients has been widely used in computation of weights for deep neural networks (DNNs). This method requires derivatives of objective functions and has some difficulties finding appropriate…

Machine Learning · Computer Science 2016-05-17 Tetsuya Sakurai , Akira Imakura , Yuto Inoue , Yasunori Futamura

A comparative study of back propagation and its alternatives on multilayer perceptrons

The de facto algorithm for training the back pass of a feedforward neural network is backpropagation (BP). The use of almost-everywhere differentiable activation functions made it efficient and effective to propagate the gradient backwards…

Neural and Evolutionary Computing · Computer Science 2022-06-14 John Waldo

Integrated photonic deep neural network with end-to-end on-chip backpropagation training

Integrated photonic neural networks (PNNs) have demonstrated significant potential to complement the digital electronic counterparts [1-3]. Nevertheless, robust and repeatable performance of scalable integrated PNNs is directly tied to the…

Optics · Physics 2025-06-18 Farshid Ashtiani , Mohamad Hossein Idjadi , Kwangwoong Kim

The Golden Ratio of Learning and Momentum

Gradient descent has been a central training principle for artificial neural networks from the early beginnings to today's deep learning networks. The most common implementation is the backpropagation algorithm for training feed-forward…

Machine Learning · Computer Science 2020-06-09 Stefan Jaeger

Learning in the Machine: Random Backpropagation and the Deep Learning Channel

Random backpropagation (RBP) is a variant of the backpropagation algorithm for training neural networks, where the transpose of the forward matrices are replaced by fixed random matrices in the calculation of the weight updates. It is…

Machine Learning · Computer Science 2017-12-25 Pierre Baldi , Peter Sadowski , Zhiqin Lu

Newton's Method Backpropagation for Complex-Valued Holomorphic Multilayer Perceptrons

The study of Newton's method in complex-valued neural networks faces many difficulties. In this paper, we derive Newton's method backpropagation algorithms for complex-valued holomorphic multilayer perceptrons, and investigate the…

Complex Variables · Mathematics 2014-06-23 Diana Thomson La Corte , Yi Ming Zou

Training Neural Networks Using Features Replay

Training a neural network using backpropagation algorithm requires passing error gradients sequentially through the network. The backward locking prevents us from updating network layers in parallel and fully leveraging the computing…

Machine Learning · Computer Science 2019-05-30 Zhouyuan Huo , Bin Gu , Heng Huang

A deep learning theory for neural networks grounded in physics

In the last decade, deep learning has become a major component of artificial intelligence. The workhorse of deep learning is the optimization of loss functions by stochastic gradient descent (SGD). Traditionally in deep learning, neural…

Machine Learning · Computer Science 2021-04-27 Benjamin Scellier

Decoupled Parallel Backpropagation with Convergence Guarantee

Backpropagation algorithm is indispensable for the training of feedforward neural networks. It requires propagating error gradients sequentially from the output layer all the way back to the input layer. The backward locking in…

Machine Learning · Computer Science 2018-07-24 Zhouyuan Huo , Bin Gu , Qian Yang , Heng Huang