Related papers: Extraction Propagation

Long Timescale Credit Assignment in NeuralNetworks with External Memory

Credit assignment in traditional recurrent neural networks usually involves back-propagating through a long chain of tied weight matrices. The length of this chain scales linearly with the number of time-steps as the same network is run at…

Artificial Intelligence · Computer Science 2017-01-17 Steven Stenberg Hansen

Formal derivation of Mesh Neural Networks with their Forward-Only gradient Propagation

This paper proposes the Mesh Neural Network (MNN), a novel architecture which allows neurons to be connected in any topology, to efficiently route information. In MNNs, information is propagated between neurons throughout a state transition…

Machine Learning · Computer Science 2021-10-01 Federico A. Galatolo , Mario G. C. A. Cimino , Gigliola Vaglini

Differentiable Architecture Pruning for Transfer Learning

We propose a new gradient-based approach for extracting sub-architectures from a given large model. Contrarily to existing pruning methods, which are unable to disentangle the network architecture and the corresponding weights, our…

Machine Learning · Computer Science 2021-07-08 Nicolo Colombo , Yang Gao

The Forward-Forward Algorithm: Some Preliminary Investigations

The aim of this paper is to introduce a new learning procedure for neural networks and to demonstrate that it works well enough on a few small problems to be worth further investigation. The Forward-Forward algorithm replaces the forward…

Machine Learning · Computer Science 2022-12-29 Geoffrey Hinton

Difference Target Propagation

Back-propagation has been the workhorse of recent successes of deep learning but it relies on infinitesimal effects (partial derivatives) in order to perform credit assignment. This could become a serious issue as one considers deeper and…

Machine Learning · Computer Science 2015-11-26 Dong-Hyun Lee , Saizheng Zhang , Asja Fischer , Yoshua Bengio

Beyond Backpropagation: Optimization with Multi-Tangent Forward Gradients

The gradients used to train neural networks are typically computed using backpropagation. While an efficient way to obtain exact gradients, backpropagation is computationally expensive, hinders parallelization, and is biologically…

Machine Learning · Computer Science 2026-01-14 Katharina Flügel , Daniel Coquelin , Marie Weiel , Charlotte Debus , Achim Streit , Markus Götz

Minimal Effort Back Propagation for Convolutional Neural Networks

As traditional neural network consumes a significant amount of computing resources during back propagation, \citet{Sun2017mePropSB} propose a simple yet effective technique to alleviate this problem. In this technique, only a small subset…

Machine Learning · Computer Science 2017-09-27 Bingzhen Wei , Xu Sun , Xuancheng Ren , Jingjing Xu

Towards Scalable Backpropagation-Free Gradient Estimation

While backpropagation--reverse-mode automatic differentiation--has been extraordinarily successful in deep learning, it requires two passes (forward and backward) through the neural network and the storage of intermediate activations.…

Machine Learning · Computer Science 2025-11-06 Daniel Wang , Evan Markou , Dylan Campbell

On the Two Sides of Redundancy in Graph Neural Networks

Message passing neural networks iteratively generate node embeddings by aggregating information from neighboring nodes. With increasing depth, information from more distant nodes is included. However, node embeddings may be unable to…

Machine Learning · Computer Science 2024-03-29 Franka Bause , Samir Moustafa , Johannes Langguth , Wilfried N. Gansterer , Nils M. Kriege

Backpropagation generalized for output derivatives

Backpropagation algorithm is the cornerstone for neural network analysis. Paper extends it for training any derivatives of neural network's output with respect to its input. By the dint of it feedforward networks can be used to solve or…

Neural and Evolutionary Computing · Computer Science 2017-12-13 V. I. Avrutskiy

Decoupled Neural Interfaces using Synthetic Gradients

Training directed neural networks typically requires forward-propagating data through a computation graph, followed by backpropagating error signal, to produce weight updates. All layers, or more generally, modules, of the network are…

Machine Learning · Computer Science 2017-07-04 Max Jaderberg , Wojciech Marian Czarnecki , Simon Osindero , Oriol Vinyals , Alex Graves , David Silver , Koray Kavukcuoglu

Efficient recurrent architectures through activity sparsity and sparse back-propagation through time

Recurrent neural networks (RNNs) are well suited for solving sequence tasks in resource-constrained systems due to their expressivity and low computational requirements. However, there is still a need to bridge the gap between what RNNs are…

Machine Learning · Computer Science 2023-03-13 Anand Subramoney , Khaleelulla Khan Nazeer , Mark Schöne , Christian Mayr , David Kappel

Neural Message Passing on High Order Paths

Graph neural network have achieved impressive results in predicting molecular properties, but they do not directly account for local and hidden structures in the graph such as functional groups and molecular geometry. At each propagation…

Machine Learning · Computer Science 2020-02-25 Daniel Flam-Shepherd , Tony Wu , Pascal Friederich , Alan Aspuru-Guzik

Convolutional neural networks with fractional order gradient method

This paper proposes a fractional order gradient method for the backward propagation of convolutional neural networks. To overcome the problem that fractional order gradient method cannot converge to real extreme point, a simplified…

Optimization and Control · Mathematics 2020-01-07 Dian Sheng , Yiheng Wei , Yuquan Chen , Yong Wang

Scalable Forward-Forward Algorithm

We propose a scalable Forward-Forward (FF) algorithm that eliminates the need for backpropagation by training each layer separately. Unlike backpropagation, FF avoids backward gradients and can be more modular and memory efficient, making…

Machine Learning · Computer Science 2025-01-07 Andrii Krutsylo

Vector Field Based Neural Networks

A novel Neural Network architecture is proposed using the mathematically and physically rich idea of vector fields as hidden layers to perform nonlinear transformations in the data. The data points are interpreted as particles moving along…

Machine Learning · Computer Science 2018-02-23 Daniel Vieira , Fabio Rangel , Fabricio Firmino , Joao Paixao

Tensor-Based Backpropagation in Neural Networks with Non-Sequential Input

Neural networks have been able to achieve groundbreaking accuracy at tasks conventionally considered only doable by humans. Using stochastic gradient descent, optimization in many dimensions is made possible, albeit at a relatively high…

Machine Learning · Computer Science 2017-07-17 Hirsh R. Agarwal , Andrew Huang

Traversing Between Modes in Function Space for Fast Ensembling

Deep ensemble is a simple yet powerful way to improve the performance of deep neural networks. Under this motivation, recent works on mode connectivity have shown that parameters of ensembles are connected by low-loss subspaces, and one can…

Machine Learning · Computer Science 2023-06-21 EungGu Yun , Hyungi Lee , Giung Nam , Juho Lee

Neural Nets via Forward State Transformation and Backward Loss Transformation

This article studies (multilayer perceptron) neural networks with an emphasis on the transformations involved --- both forward and backward --- in order to develop a semantical/logical perspective that is in line with standard program…

Neural and Evolutionary Computing · Computer Science 2018-03-28 Bart Jacobs , David Sprunger

Neural Random-Access Machines

In this paper, we propose and investigate a new neural network architecture called Neural Random Access Machine. It can manipulate and dereference pointers to an external variable-size random-access memory. The model is trained from pure…

Machine Learning · Computer Science 2016-02-11 Karol Kurach , Marcin Andrychowicz , Ilya Sutskever