Related papers: Ordered Memory

Higher Order Recurrent Neural Networks

In this paper, we study novel neural network structures to better model long term dependency in sequential data. We propose to use more memory units to keep track of more preceding states in recurrent neural networks (RNNs), which are all…

Neural and Evolutionary Computing · Computer Science 2016-05-03 Rohollah Soltani , Hui Jiang

Ordered Memory Baselines

Natural language semantics can be modeled using the phrase-structured model, which can be represented using a tree-type architecture. As a result, recent advances in natural language processing have been made utilising recursive neural…

Computation and Language · Computer Science 2023-02-14 Daniel Borisov , Matthew D'Iorio , Jeffrey Hyacinthe

Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks

Natural language is hierarchically structured: smaller units (e.g., phrases) are nested within larger units (e.g., clauses). When a larger constituent ends, all of the smaller constituents that are nested within it must also be closed.…

Computation and Language · Computer Science 2019-05-09 Yikang Shen , Shawn Tan , Alessandro Sordoni , Aaron Courville

Decision Trees That Remember: Gradient-Based Learning of Recurrent Decision Trees with Memory

Neural architectures such as Recurrent Neural Networks (RNNs), Transformers, and State-Space Models have shown great success in handling sequential data by learning temporal dependencies. Decision Trees (DTs), on the other hand, remain a…

Machine Learning · Computer Science 2025-02-07 Sascha Marton , Moritz Schneider

Survey of reasoning using Neural networks

Reason and inference require process as well as memory skills by humans. Neural networks are able to process tasks like image recognition (better than humans) but in memory aspects are still limited (by attention mechanism, size). Recurrent…

Machine Learning · Computer Science 2017-03-03 Amit Sahu

Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks

Deep neural networks have shown superior performance in many regimes to remember familiar patterns with large amounts of data. However, the standard supervised deep learning paradigm is still limited when facing the need to learn new…

Machine Learning · Computer Science 2018-11-16 Jing Shi , Jiaming Xu , Yiqun Yao , Bo Xu

Towards Better Modeling Hierarchical Structure for Self-Attention with Ordered Neurons

Recent studies have shown that a hybrid of self-attention networks (SANs) and recurrent neural networks (RNNs) outperforms both individual architectures, while not much is known about why the hybrid models work. With the belief that…

Computation and Language · Computer Science 2019-11-18 Jie Hao , Xing Wang , Shuming Shi , Jinfeng Zhang , Zhaopeng Tu

Gated Orthogonal Recurrent Units: On Learning to Forget

We present a novel recurrent neural network (RNN) based model that combines the remembering ability of unitary RNNs with the ability of gated RNNs to effectively forget redundant/irrelevant information in its memory. We achieve this by…

Machine Learning · Computer Science 2017-10-26 Li Jing , Caglar Gulcehre , John Peurifoy , Yichen Shen , Max Tegmark , Marin Soljačić , Yoshua Bengio

Structured Memory based Deep Model to Detect as well as Characterize Novel Inputs

While deep learning has pushed the boundaries in various machine learning tasks, the current models are still far away from replicating many functions that a normal human brain can do. Explicit memorization based deep architecture have been…

Computer Vision and Pattern Recognition · Computer Science 2018-01-31 Pratik Prabhanjan Brahma , Qiuyuan Huang , Dapeng Wu

Memory and attention in deep learning

Intelligence necessitates memory. Without memory, humans fail to perform various nontrivial tasks such as reading novels, playing games or solving maths. As the ultimate goal of machine learning is to derive intelligent systems that learn…

Machine Learning · Computer Science 2021-07-06 Hung Le

Gated Recurrent Neural Tensor Network

Recurrent Neural Networks (RNNs), which are a powerful scheme for modeling temporal and sequential data need to capture long-term dependencies on datasets and represent them in hidden layers with a powerful model to capture more information…

Machine Learning · Computer Science 2017-06-08 Andros Tjandra , Sakriani Sakti , Ruli Manurung , Mirna Adriani , Satoshi Nakamura

OrderNet: Ordering by Example

In this paper we introduce a new neural architecture for sorting unordered sequences where the correct sequence order is not easily defined but must rather be inferred from training data. We refer to this architecture as OrderNet and…

Machine Learning · Computer Science 2019-05-29 Robert Porter

Memory and Information Processing in Recurrent Neural Networks

Recurrent neural networks (RNN) are simple dynamical systems whose computational power has been attributed to their short-term memory. Short-term memory of RNNs has been previously studied analytically only for the case of orthogonal…

Neural and Evolutionary Computing · Computer Science 2016-04-26 Alireza Goudarzi , Sarah Marzen , Peter Banda , Guy Feldman , Christof Teuscher , Darko Stefanovic

Multigrid Neural Memory

We introduce a novel approach to endowing neural networks with emergent, long-term, large-scale memory. Distinct from strategies that connect neural networks to external memory banks via intricately crafted controllers and hand-designed…

Machine Learning · Computer Science 2020-08-18 Tri Huynh , Michael Maire , Matthew R. Walter

Recursive Algorithmic Reasoning

Learning models that execute algorithms can enable us to address a key problem in deep learning: generalizing to out-of-distribution data. However, neural networks are currently unable to execute recursive algorithms because they do not…

Machine Learning · Computer Science 2023-11-22 Jonas Jürß , Dulhan Jayalath , Petar Veličković

Linear Memory Networks

Recurrent neural networks can learn complex transduction problems that require maintaining and actively exploiting a memory of their inputs. Such models traditionally consider memory and input-output functionalities indissolubly entangled.…

Machine Learning · Computer Science 2018-11-09 Davide Bacciu , Antonio Carta , Alessandro Sperduti

One-shot Learning with Memory-Augmented Neural Networks

Despite recent breakthroughs in the applications of deep neural networks, one setting that presents a persistent challenge is that of "one-shot learning." Traditional gradient-based networks require a lot of data to learn, often through…

Machine Learning · Computer Science 2016-05-20 Adam Santoro , Sergey Bartunov , Matthew Botvinick , Daan Wierstra , Timothy Lillicrap

Learning to Remember More with Less Memorization

Memory-augmented neural networks consisting of a neural controller and an external memory have shown potentials in long-term sequential learning. Current RAM-like memory models maintain memory accessing every timesteps, thus they do not…

Machine Learning · Computer Science 2019-03-21 Hung Le , Truyen Tran , Svetha Venkatesh

From Nodes to Networks: Evolving Recurrent Neural Networks

Gated recurrent networks such as those composed of Long Short-Term Memory (LSTM) nodes have recently been used to improve state of the art in many sequential processing tasks such as speech recognition and machine translation. However, the…

Neural and Evolutionary Computing · Computer Science 2018-06-11 Aditya Rawal , Risto Miikkulainen

Neural Architecture Search with Reinforcement Learning

Neural networks are powerful and flexible models that work well for many difficult learning tasks in image, speech and natural language understanding. Despite their success, neural networks are still hard to design. In this paper, we use a…

Machine Learning · Computer Science 2017-02-16 Barret Zoph , Quoc V. Le