Related papers: Persistent Neurons

Advances in Optimizing Recurrent Networks

After a more than decade-long period of relatively little research activity in the area of recurrent neural networks, several new developments will be reviewed here that have allowed substantial progress both in understanding and in…

Machine Learning · Computer Science 2012-12-17 Yoshua Bengio , Nicolas Boulanger-Lewandowski , Razvan Pascanu

Is Learning in Biological Neural Networks based on Stochastic Gradient Descent? An analysis using stochastic processes

In recent years, there has been an intense debate about how learning in biological neural networks (BNNs) differs from learning in artificial neural networks. It is often argued that the updating of connections in the brain relies only on…

Neurons and Cognition · Quantitative Biology 2024-04-11 Sören Christensen , Jan Kallsen

RNNs of RNNs: Recursive Construction of Stable Assemblies of Recurrent Neural Networks

Recurrent neural networks (RNNs) are widely used throughout neuroscience as models of local neural activity. Many properties of single RNNs are well characterized theoretically, but experimental neuroscience has moved in the direction of…

Machine Learning · Computer Science 2023-01-31 Leo Kozachkov , Michaela Ennis , Jean-Jacques Slotine

Training neural networks without backpropagation using particles

Neural networks are a group of neurons stacked together in multiple layers to mimic the biological neurons in a human brain. Neural networks have been trained using the backpropagation algorithm based on gradient descent strategy for…

Neural and Evolutionary Computing · Computer Science 2025-04-22 Deepak Kumar

Structured Pruning of Recurrent Neural Networks through Neuron Selection

Recurrent neural networks (RNNs) have recently achieved remarkable successes in a number of applications. However, the huge sizes and computational burden of these models make it difficult for their deployment on edge devices. A practically…

Machine Learning · Computer Science 2019-12-10 Liangjian Wen , Xuanyang Zhang , Haoli Bai , Zenglin Xu

Recurrent neural networks: vanishing and exploding gradients are not the end of the story

Recurrent neural networks (RNNs) notoriously struggle to learn long-term memories, primarily due to vanishing and exploding gradients. The recent success of state-space models (SSMs), a subclass of RNNs, to overcome such difficulties…

Machine Learning · Computer Science 2024-11-06 Nicolas Zucchet , Antonio Orvieto

Recurrent Neural Networks: An Embedded Computing Perspective

Recurrent Neural Networks (RNNs) are a class of machine learning algorithms used for applications with time-series and sequential data. Recently, there has been a strong interest in executing RNNs on embedded devices. However, difficulties…

Neural and Evolutionary Computing · Computer Science 2020-03-23 Nesma M. Rezk , Madhura Purnaprajna , Tomas Nordström , Zain Ul-Abdin

Training Recurrent Neural Networks via Dynamical Trajectory-Based Optimization

This paper introduces a new method to train recurrent neural networks using dynamical trajectory-based optimization. The optimization method utilizes a projected gradient system (PGS) and a quotient gradient system (QGS) to determine the…

Signal Processing · Electrical Eng. & Systems 2019-10-16 Hamid Khodabandehlou , M. Sami Fadali

Recurrent Neural Networks With Limited Numerical Precision

Recurrent Neural Networks (RNNs) produce state-of-art performance on many machine learning tasks but their demand on resources in terms of memory and computational power are often high. Therefore, there is a great interest in optimizing the…

Neural and Evolutionary Computing · Computer Science 2017-02-28 Joachim Ott , Zhouhan Lin , Ying Zhang , Shih-Chii Liu , Yoshua Bengio

Neural Networks in Evolutionary Dynamic Constrained Optimization: Computational Cost and Benefits

Neural networks (NN) have been recently applied together with evolutionary algorithms (EAs) to solve dynamic optimization problems. The applied NN estimates the position of the next optimum based on the previous time best solutions. After…

Neural and Evolutionary Computing · Computer Science 2020-02-03 Maryam Hasani-Shoreh , Renato Hermoza Aragonés , Frank Neumann

Neurogenesis Deep Learning

Neural machine learning methods, such as deep neural networks (DNN), have achieved remarkable success in a number of complex data processing tasks. These methods have arguably had their strongest impact on tasks such as image and audio…

Neural and Evolutionary Computing · Computer Science 2017-07-11 Timothy J. Draelos , Nadine E. Miner , Christopher C. Lamb , Jonathan A. Cox , Craig M. Vineyard , Kristofor D. Carlson , William M. Severa , Conrad D. James , James B. Aimone

A Meta-Learned Neuron model for Continual Learning

Continual learning is the ability to acquire new knowledge without forgetting the previously learned one, assuming no further access to past training data. Neural network approximators trained with gradient descent are known to fail in this…

Machine Learning · Computer Science 2021-11-05 Rodrigue Siry

Autoencoder Based Residual Deep Networks for Robust Regression Prediction and Spatiotemporal Estimation

To have a superior generalization, a deep learning neural network often involves a large size of training sample. With increase of hidden layers in order to increase learning ability, neural network has potential degradation in accuracy.…

Machine Learning · Computer Science 2019-01-01 Lianfa Li , Ying Fang , Jun Wu , Jinfeng Wang

Long short-term memory and learning-to-learn in networks of spiking neurons

Recurrent networks of spiking neurons (RSNNs) underlie the astounding computing and learning capabilities of the brain. But computing and learning capabilities of RSNN models have remained poor, at least in comparison with artificial neural…

Neural and Evolutionary Computing · Computer Science 2018-12-27 Guillaume Bellec , Darjan Salaj , Anand Subramoney , Robert Legenstein , Wolfgang Maass

Are ResNets Provably Better than Linear Predictors?

A residual network (or ResNet) is a standard deep neural net architecture, with state-of-the-art performance across numerous applications. The main premise of ResNets is that they allow the training of each layer to focus on fitting just…

Machine Learning · Computer Science 2018-09-28 Ohad Shamir

Self-Expanding Neural Networks

The results of training a neural network are heavily dependent on the architecture chosen; and even a modification of only its size, however small, typically involves restarting the training process. In contrast to this, we begin training…

Machine Learning · Computer Science 2024-02-12 Rupert Mitchell , Robin Menzenbach , Kristian Kersting , Martin Mundt

Towards Guided Descent: Optimization Algorithms for Training Neural Networks At Scale

Neural network optimization remains one of the most consequential yet poorly understood challenges in modern AI research, where improvements in training algorithms can lead to enhanced feature learning in foundation models,…

Machine Learning · Computer Science 2025-12-23 Ansh Nagwekar

Optimizing Deep Neural Networks through Neuroevolution with Stochastic Gradient Descent

Deep neural networks (DNNs) have achieved remarkable success in computer vision; however, training DNNs for satisfactory performance remains challenging and suffers from sensitivity to empirical selections of an optimization algorithm for…

Computer Vision and Pattern Recognition · Computer Science 2020-12-22 Haichao Zhang , Kuangrong Hao , Lei Gao , Bing Wei , Xuesong Tang

Learning Neural Networks by Neuron Pursuit

The first part of this paper studies the evolution of gradient flow for homogeneous neural networks near a class of saddle points exhibiting a sparsity structure. The choice of these saddle points is motivated from previous works on…

Machine Learning · Computer Science 2025-09-16 Akshay Kumar , Jarvis Haupt

Persistency of Excitation for Robustness of Neural Networks

When an online learning algorithm is used to estimate the unknown parameters of a model, the signals interacting with the parameter estimates should not decay too quickly for the optimal values to be discovered correctly. This requirement…

Machine Learning · Computer Science 2019-11-05 Kamil Nar , S. Shankar Sastry