Related papers: Multigrid Neural Memory

Memory and attention in deep learning

Intelligence necessitates memory. Without memory, humans fail to perform various nontrivial tasks such as reading novels, playing games or solving maths. As the ultimate goal of machine learning is to derive intelligent systems that learn…

Machine Learning · Computer Science 2021-07-06 Hung Le

Multigrid Neural Architectures

We propose a multigrid extension of convolutional neural networks (CNNs). Rather than manipulating representations living on a single spatial grid, our network layers operate across scale space, on a pyramid of grids. They consume multigrid…

Computer Vision and Pattern Recognition · Computer Science 2017-05-15 Tsung-Wei Ke , Michael Maire , Stella X. Yu

Memory-Augmented Transformers: A Systematic Review from Neuroscience Principles to Enhanced Model Architectures

Memory is fundamental to intelligence, enabling learning, reasoning, and adaptability across biological and artificial systems. While Transformer architectures excel at sequence modeling, they face critical limitations in long-range context…

Machine Learning · Computer Science 2025-08-19 Parsa Omidi , Xingshuai Huang , Axel Laborieux , Bahareh Nikpour , Tianyu Shi , Armaghan Eshaghi

Self-recovery of memory via generative replay

A remarkable capacity of the brain is its ability to autonomously reorganize memories during offline periods. Memory replay, a mechanism hypothesized to underlie biological offline learning, has inspired offline methods for reducing…

Neural and Evolutionary Computing · Computer Science 2023-01-18 Zhenglong Zhou , Geshi Yeung , Anna C. Schapiro

Learning Connectivity of Neural Networks from a Topological Perspective

Seeking effective neural networks is a critical and practical field in deep learning. Besides designing the depth, type of convolution, normalization, and nonlinearities, the topological connectivity of neural networks is also important.…

Computer Vision and Pattern Recognition · Computer Science 2020-08-20 Kun Yuan , Quanquan Li , Jing Shao , Junjie Yan

Linear Memory Networks

Recurrent neural networks can learn complex transduction problems that require maintaining and actively exploiting a memory of their inputs. Such models traditionally consider memory and input-output functionalities indissolubly entangled.…

Machine Learning · Computer Science 2018-11-09 Davide Bacciu , Antonio Carta , Alessandro Sperduti

Memory-Gated Recurrent Networks

The essence of multivariate sequential learning is all about how to extract dependencies in data. These data sets, such as hourly medical records in intensive care units and multi-frequency phonetic time series, often time exhibit not only…

Machine Learning · Computer Science 2021-01-01 Yaquan Zhang , Qi Wu , Nanbo Peng , Min Dai , Jing Zhang , Hu Wang

Ordered Memory

Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this…

Machine Learning · Computer Science 2019-11-05 Yikang Shen , Shawn Tan , Arian Hosseini , Zhouhan Lin , Alessandro Sordoni , Aaron Courville

Learning to Generate with Memory

Memory units have been widely used to enrich the capabilities of deep networks on capturing long-term dependencies in reasoning and prediction tasks, but little investigation exists on deep generative models (DGMs) which are good at…

Machine Learning · Computer Science 2016-05-31 Chongxuan Li , Jun Zhu , Bo Zhang

Structured Ensembles: an Approach to Reduce the Memory Footprint of Ensemble Methods

In this paper, we propose a novel ensembling technique for deep neural networks, which is able to drastically reduce the required memory compared to alternative approaches. In particular, we propose to extract multiple sub-networks from a…

Machine Learning · Computer Science 2022-10-07 Jary Pomponi , Simone Scardapane , Aurelio Uncini

Triple Memory Networks: a Brain-Inspired Method for Continual Learning

Continual acquisition of novel experience without interfering previously learned knowledge, i.e. continual learning, is critical for artificial neural networks, but limited by catastrophic forgetting. A neural network adjusts its parameters…

Machine Learning · Computer Science 2022-02-15 Liyuan Wang , Bo Lei , Qian Li , Hang Su , Jun Zhu , Yi Zhong

Incremental Training of a Recurrent Neural Network Exploiting a Multi-Scale Dynamic Memory

The effectiveness of recurrent neural networks can be largely influenced by their ability to store into their dynamical memory information extracted from input sequences at different frequencies and timescales. Such a feature can be…

Machine Learning · Computer Science 2020-07-01 Antonio Carta , Alessandro Sperduti , Davide Bacciu

One-shot Learning with Memory-Augmented Neural Networks

Despite recent breakthroughs in the applications of deep neural networks, one setting that presents a persistent challenge is that of "one-shot learning." Traditional gradient-based networks require a lot of data to learn, often through…

Machine Learning · Computer Science 2016-05-20 Adam Santoro , Sergey Bartunov , Matthew Botvinick , Daan Wierstra , Timothy Lillicrap

Hierarchical Memory Networks

Memory networks are neural networks with an explicit memory component that can be both read and written to by the network. The memory is often addressed in a soft way using a softmax function, making end-to-end training with backpropagation…

Machine Learning · Statistics 2016-05-25 Sarath Chandar , Sungjin Ahn , Hugo Larochelle , Pascal Vincent , Gerald Tesauro , Yoshua Bengio

Autonomous and Ubiquitous In-node Learning Algorithms of Active Directed Graphs and Its Storage Behavior

Memory is an important cognitive function for humans. How a brain with such a small power can complete such a complex memory function, the working mechanism behind this is undoubtedly fascinating. Engram theory views memory as the…

Distributed, Parallel, and Cluster Computing · Computer Science 2023-07-13 Hui Wei , Weihua Miao , Fushun Li

The shape of memory in temporal networks

Temporal networks are widely used models for describing the architecture of complex systems. Network memory -- that is the dependence of a temporal network's structure on its past -- has been shown to play a prominent role in diffusion,…

Physics and Society · Physics 2020-04-28 Oliver E. Williams , Lucas Lacasa , Ana P. Millán , Vito Latora

Long memory network time series

Many scientific areas, from computer science to the environmental sciences and finance, give rise to multivariate time series which exhibit long memory, or loosely put, a slow decay in their autocorrelation structure. Efficient modelling…

Methodology · Statistics 2025-12-12 Chiara Boetti , Matthew A. Nunes , Marina I. Knight

Robust High-dimensional Memory-augmented Neural Networks

Traditional neural networks require enormous amounts of data to build their complex mappings during a slow training procedure that hinders their abilities for relearning and adapting to new data. Memory-augmented neural networks enhance…

Emerging Technologies · Computer Science 2021-06-23 Geethan Karunaratne , Manuel Schmuck , Manuel Le Gallo , Giovanni Cherubini , Luca Benini , Abu Sebastian , Abbas Rahimi

Short-Term Memory Through Persistent Activity: Evolution of Self-Stopping and Self-Sustaining Activity in Spiking Neural Networks

Memories in the brain are separated in two categories: short-term and long-term memories. Long-term memories remain for a lifetime, while short-term ones exist from a few milliseconds to a few minutes. Within short-term memory studies,…

Neural and Evolutionary Computing · Computer Science 2014-11-26 Julien Hubert , Takashi Ikegami

Learning Deep Matrix Representations

We present a new distributed representation in deep neural nets wherein the information is represented in native form as a matrix. This differs from current neural architectures that rely on vector representations. We consider matrices as…

Machine Learning · Computer Science 2018-02-06 Kien Do , Truyen Tran , Svetha Venkatesh