Related papers: Maelstrom Networks

Recurrent Neural Networks for Learning Long-term Temporal Dependencies with Reanalysis of Time Scale Representation

Recurrent neural networks with a gating mechanism such as an LSTM or GRU are powerful tools to model sequential data. In the mechanism, a forget gate, which was introduced to control information flow in a hidden state in the RNN, has…

Machine Learning · Statistics 2021-11-08 Kentaro Ohno , Atsutoshi Kumagai

Gated Recurrent Neural Tensor Network

Recurrent Neural Networks (RNNs), which are a powerful scheme for modeling temporal and sequential data need to capture long-term dependencies on datasets and represent them in hidden layers with a powerful model to capture more information…

Machine Learning · Computer Science 2017-06-08 Andros Tjandra , Sakriani Sakti , Ruli Manurung , Mirna Adriani , Satoshi Nakamura

Non-local Recurrent Neural Memory for Supervised Sequence Modeling

Typical methods for supervised sequence modeling are built upon the recurrent neural networks to capture temporal dependencies. One potential limitation of these methods is that they only model explicitly information interactions between…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Canmiao Fu , Wenjie Pei , Qiong Cao , Chaopeng Zhang , Yong Zhao , Xiaoyong Shen , Yu-Wing Tai

Automated Architecture Design for Deep Neural Networks

Machine learning has made tremendous progress in recent years and received large amounts of public attention. Though we are still far from designing a full artificially intelligent agent, machine learning has brought us many applications in…

Machine Learning · Computer Science 2019-08-29 Steven Abreu

Nucleus Neural Network: A Data-driven Self-organized Architecture

Artificial neural networks which are inspired from the learning mechanism of brain have achieved great successes in many problems, especially those with deep layers. In this paper, we propose a nucleus neural network (NNN) and corresponding…

Computer Vision and Pattern Recognition · Computer Science 2019-05-15 Jia Liu , Maoguo Gong , Haibo He

Linked Adapters: Linking Past and Future to Present for Effective Continual Learning

Continual learning allows the system to learn and adapt to new tasks while retaining the knowledge acquired from previous tasks. However, deep learning models suffer from catastrophic forgetting of knowledge learned from earlier tasks while…

Machine Learning · Computer Science 2024-12-17 Dupati Srikar Chandra , P. K. Srijith , Dana Rezazadegan , Chris McCarthy

Learning Sequence Attractors in Recurrent Networks with Hidden Neurons

The brain is targeted for processing temporal sequence information. It remains largely unclear how the brain learns to store and retrieve sequence memories. Here, we study how recurrent networks of binary neurons learn sequence attractors…

Neural and Evolutionary Computing · Computer Science 2024-04-04 Yao Lu , Si Wu

Inferring Algorithmic Patterns with Stack-Augmented Recurrent Nets

Despite the recent achievements in machine learning, we are still very far from achieving real artificial intelligence. In this paper, we discuss the limitations of standard deep learning approaches and show that some of these limitations…

Neural and Evolutionary Computing · Computer Science 2015-06-03 Armand Joulin , Tomas Mikolov

Convolutional Bipartite Attractor Networks

In human perception and cognition, a fundamental operation that brains perform is interpretation: constructing coherent neural states from noisy, incomplete, and intrinsically ambiguous evidence. The problem of interpretation is well…

Machine Learning · Computer Science 2019-09-30 Michael Iuzzolino , Yoram Singer , Michael C. Mozer

Learning long-term dependencies for action recognition with a biologically-inspired deep network

Despite a lot of research efforts devoted in recent years, how to efficiently learn long-term dependencies from sequences still remains a pretty challenging task. As one of the key models for sequence learning, recurrent neural network…

Computer Vision and Pattern Recognition · Computer Science 2017-03-21 Yemin Shi , Yonghong Tian , Yaowei Wang , Tiejun Huang

Artificial Neuronal Ensembles with Learned Context Dependent Gating

Biological neural networks are capable of recruiting different sets of neurons to encode different memories. However, when training artificial neural networks on a set of tasks, typically, no mechanism is employed for selectively producing…

Machine Learning · Computer Science 2023-05-17 Matthew J. Tilley , Michelle Miller , David J. Freedman

Learning to reinforcement learn

In recent years deep reinforcement learning (RL) systems have attained superhuman performance in a number of challenging task domains. However, a major limitation of such applications is their demand for massive amounts of training data. A…

Machine Learning · Computer Science 2017-01-24 Jane X Wang , Zeb Kurth-Nelson , Dhruva Tirumala , Hubert Soyer , Joel Z Leibo , Remi Munos , Charles Blundell , Dharshan Kumaran , Matt Botvinick

How Efficient Are Today's Continual Learning Algorithms?

Supervised Continual learning involves updating a deep neural network (DNN) from an ever-growing stream of labeled data. While most work has focused on overcoming catastrophic forgetting, one of the major motivations behind continual…

Computer Vision and Pattern Recognition · Computer Science 2023-04-04 Md Yousuf Harun , Jhair Gallardo , Tyler L. Hayes , Christopher Kanan

Self-Net: Lifelong Learning via Continual Self-Modeling

Learning a set of tasks over time, also known as continual learning (CL), is one of the most challenging problems in artificial intelligence. While recent approaches achieve some degree of CL in deep neural networks, they either (1) grow…

Machine Learning · Computer Science 2019-07-15 Blake Camp , Jaya Krishna Mandivarapu , Rolando Estrada

An Empirical Investigation of the Role of Pre-training in Lifelong Learning

The lifelong learning paradigm in machine learning is an attractive alternative to the more prominent isolated learning scheme not only due to its resemblance to biological learning but also its potential to reduce energy waste by obviating…

Machine Learning · Computer Science 2023-08-30 Sanket Vaibhav Mehta , Darshan Patil , Sarath Chandar , Emma Strubell

Learning to Learn Neural Networks

Meta-learning consists in learning learning algorithms. We use a Long Short Term Memory (LSTM) based network to learn to compute on-line updates of the parameters of another neural network. These parameters are stored in the cell state of…

Machine Learning · Computer Science 2016-10-20 Tom Bosc

Center Loss Regularization for Continual Learning

The ability to learn different tasks sequentially is essential to the development of artificial intelligence. In general, neural networks lack this capability, the major obstacle being catastrophic forgetting. It occurs when the…

Machine Learning · Computer Science 2021-10-22 Kaustubh Olpadkar , Ekta Gavas

Learning What to Remember: Long-term Episodic Memory Networks for Learning from Streaming Data

Current generation of memory-augmented neural networks has limited scalability as they cannot efficiently process data that are too large to fit in the external memory storage. One example of this is lifelong learning scenario where the…

Machine Learning · Computer Science 2018-12-12 Hyunwoo Jung , Moonsu Han , Minki Kang , Sungju Hwang

Learning Recurrent Binary/Ternary Weights

Recurrent neural networks (RNNs) have shown excellent performance in processing sequence data. However, they are both complex and memory intensive due to their recursive nature. These limitations make RNNs difficult to embed on mobile…

Machine Learning · Computer Science 2019-01-28 Arash Ardakani , Zhengyun Ji , Sean C. Smithson , Brett H. Meyer , Warren J. Gross

Understanding the Principles of Recursive Neural networks: A Generative Approach to Tackle Model Complexity

Recursive Neural Networks are non-linear adaptive models that are able to learn deep structured information. However, these models have not yet been broadly accepted. This fact is mainly due to its inherent complexity. In particular, not…

Neural and Evolutionary Computing · Computer Science 2009-11-18 Alejandro Chinea