English
Related papers

Related papers: Maelstrom Networks

200 papers

Recursive neural networks (RNN) and their recently proposed extension recursive long short term memory networks (RLSTM) are models that compute representations for sentences, by recursively combining word embeddings according to an…

Artificial Intelligence · Computer Science 2016-03-02 Phong Le , Willem Zuidema

We review several of the most widely used techniques for training recurrent neural networks to approximate dynamical systems, then describe a novel algorithm for this task. The algorithm is based on an earlier theoretical result that…

Neural and Evolutionary Computing · Computer Science 2016-06-09 Adam Trischler , Gabriele MT D'Eleuterio

One notable weakness of current machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning paradigm has emerged as a protocol to systematically…

Machine Learning · Computer Science 2022-11-16 Heinke Hihn , Daniel A. Braun

During the last years, there has been a lot of interest in achieving some kind of complex reasoning using deep neural networks. To do that, models like Memory Networks (MemNNs) have combined external memory storages and attention…

Computation and Language · Computer Science 2018-05-25 Juan Pavez , Héctor Allende , Héctor Allende-Cid

In this paper, we study novel neural network structures to better model long term dependency in sequential data. We propose to use more memory units to keep track of more preceding states in recurrent neural networks (RNNs), which are all…

Neural and Evolutionary Computing · Computer Science 2016-05-03 Rohollah Soltani , Hui Jiang

Forgetting is often seen as an unwanted characteristic in both human and machine learning. However, we propose that forgetting can in fact be favorable to learning. We introduce "forget-and-relearn" as a powerful paradigm for shaping the…

Machine Learning · Computer Science 2022-02-02 Hattie Zhou , Ankit Vani , Hugo Larochelle , Aaron Courville

Neural networks powered with external memory simulate computer behaviors. These models, which use the memory to store data for a neural controller, can learn algorithms and other complex tasks. In this paper, we introduce a new memory to…

Neural and Evolutionary Computing · Computer Science 2019-12-30 Hung Le , Truyen Tran , Svetha Venkatesh

Purpose: We propose a novel method for continual learning based on the increasing depth of neural networks. This work explores whether extending neural network depth may be beneficial in a life-long learning setting. Methods: We propose a…

Machine Learning · Computer Science 2023-05-09 Jędrzej Kozal , Michał Woźniak

Ability of deep networks to extract high level features and of recurrent networks to perform time-series inference have been studied. In view of universality of one hidden layer network at approximating functions under weak constraints, the…

Neural and Evolutionary Computing · Computer Science 2014-12-19 Sharat C. Prasad , Piyush Prasad

Deep neural networks are used in many state-of-the-art systems for machine perception. Once a network is trained to do a specific task, e.g., bird classification, it cannot easily be trained to do new tasks, e.g., incrementally learning to…

Artificial Intelligence · Computer Science 2017-11-10 Ronald Kemker , Marc McClure , Angelina Abitino , Tyler Hayes , Christopher Kanan

Artificial neural networks can acquire many aspects of human knowledge from data, making them promising as models of human learning. But what those networks can learn depends upon their inductive biases -- the factors other than the data…

Machine Learning · Computer Science 2025-02-28 Gianluca Bencomo , Max Gupta , Ioana Marinescu , R. Thomas McCoy , Thomas L. Griffiths

Disease progression modeling (DPM) using longitudinal data is a challenging task in machine learning for healthcare that can provide clinicians with better tools for diagnosis and monitoring of disease. Existing DPM algorithms neglect…

Computer Vision and Pattern Recognition · Computer Science 2018-08-17 Mostafa Mehdipour Ghazi , Mads Nielsen , Akshay Pai , M. Jorge Cardoso , Marc Modat , Sebastien Ourselin , Lauge Sørensen

Current training regimes for deep learning usually involve exposure to a single task / dataset at a time. Here we start from the observation that in this context the trained model is not given any knowledge of anything outside its…

Artificial Intelligence · Computer Science 2020-02-11 Giacomo Spigler

Deep artificial neural networks famously struggle to learn from non-stationary streams of data. Without dedicated mitigation strategies, continual learning is associated with continuous forgetting of previous tasks and a progressive loss of…

Neurons and Cognition · Quantitative Biology 2025-12-29 Suzanne van der Veldt , Gido M. van de Ven , Sanne Moorman , Guillaume Etter

Effectively learning from sequential data is a longstanding goal of Artificial Intelligence, especially in the case of long sequences. From the dawn of Machine Learning, several researchers have pursued algorithms and architectures capable…

Machine Learning · Computer Science 2025-08-19 Matteo Tiezzi , Michele Casoni , Alessandro Betti , Marco Gori , Stefano Melacci

In the domain of sequence modelling, Recurrent Neural Networks (RNN) have been capable of achieving impressive results in a variety of application areas including visual question answering, part-of-speech tagging and machine translation.…

Machine Learning · Computer Science 2018-05-22 Tharindu Fernando , Simon Denman , Aaron McFadyen , Sridha Sridharan , Clinton Fookes

For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine…

Machine Learning · Computer Science 2018-04-20 Shaojie Bai , J. Zico Kolter , Vladlen Koltun

Artificial neural networks, trained to perform cognitive tasks, have recently been used as models for neural recordings from animals performing these tasks. While some progress has been made in performing such comparisons, the evolution of…

Neurons and Cognition · Quantitative Biology 2019-05-03 Chen Beer , Omri Barak

In the present era of deep learning, continual learning research is mainly focused on mitigating forgetting when training a neural network with stochastic gradient descent on a non-stationary stream of data. On the other hand, in the more…

Machine Learning · Computer Science 2024-05-30 Soochan Lee , Hyeonseong Jeon , Jaehyeon Son , Gunhee Kim

Humans learn all their life long. They accumulate knowledge from a sequence of learning experiences and remember the essential concepts without forgetting what they have learned previously. Artificial neural networks struggle to learn…

Machine Learning · Computer Science 2020-12-09 Timothée Lesort