English
Related papers

Related papers: Self-Attentive Associative Memory

200 papers

Continual learning aims to provide intelligent agents capable of learning multiple tasks sequentially with neural networks. One of its main challenging, catastrophic forgetting, is caused by the neural networks non-optimal ability to learn…

Machine Learning · Computer Science 2021-01-29 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

Several guiding principles for thought processes are proposed and a neural-network-type model implementing these principles is presented and studied. We suggest to consider thinking within an associative network built-up of overlapping…

Neurons and Cognition · Quantitative Biology 2007-05-23 Claudius Gros

Long-horizon agentic reasoning requires large language models to act over long interaction histories containing thoughts, tool calls, observations, and partial conclusions. The challenge is not merely that these histories grow long, but…

Artificial Intelligence · Computer Science 2026-05-26 Yuyang Hu , Hongjin Qian , Shuting Wang , Jiongnan Liu , Ziliang Zhao , Jiejun Tan , Zheng Liu , Zhicheng Dou

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in…

Machine Learning · Computer Science 2016-10-31 Jack W Rae , Jonathan J Hunt , Tim Harley , Ivo Danihelka , Andrew Senior , Greg Wayne , Alex Graves , Timothy P Lillicrap

Despite recent progress in memory augmented neural network (MANN) research, associative memory networks with a single external memory still show limited performance on complex relational reasoning tasks. Especially the content-based…

Machine Learning · Computer Science 2021-08-30 Taewon Park , Inchul Choi , Minho Lee

Artificial autonomous agents and robots interacting in complex environments are required to continually acquire and fine-tune knowledge over sustained periods of time. The ability to learn from continuous streams of information is referred…

Artificial Intelligence · Computer Science 2018-12-20 German I. Parisi , Jun Tani , Cornelius Weber , Stefan Wermter

Many important NLP problems can be posed as dual-sequence or sequence-to-sequence modeling tasks. Recent advances in building end-to-end neural architectures have been highly successful in solving such tasks. In this work we propose a new…

Neural and Evolutionary Computing · Computer Science 2016-06-15 Dirk Weissenborn

The Transformer architecture, underpinned by the self-attention mechanism, has become the de facto standard for sequence modeling tasks. However, its core computational primitive scales quadratically with sequence length (O(N^2)), creating…

Computation and Language · Computer Science 2025-09-03 Rishiraj Acharya

Sequential recommendation predicts users' next behaviors with their historical interactions. Recommending with longer sequences improves recommendation accuracy and increases the degree of personalization. As sequences get longer, existing…

Information Retrieval · Computer Science 2022-09-05 Qianying Lin , Wen-Ji Zhou , Yanshi Wang , Qing Da , Qing-Guo Chen , Bing Wang

Sequential memory, the ability to form and accurately recall a sequence of events or stimuli in the correct order, is a fundamental prerequisite for biological and artificial intelligence as it underpins numerous cognitive functions (e.g.,…

Artificial Intelligence · Computer Science 2024-10-04 Ramy Mounir , Sudeep Sarkar

Introduction. Neural network models of autoassociative, distributed memory allow storage and retrieval of many items (vectors) where the number of stored items can exceed the vector dimension (the number of neurons in the network). This…

Neural and Evolutionary Computing · Computer Science 2017-09-05 V. I. Gritsenko , D. A. Rachkovskij , A. A. Frolov , R. Gayler , D. Kleyko , E. Osipov

Human interaction with the external world fundamentally involves the exchange of personal memory, whether with other individuals, websites, applications, or, in the future, AI agents. A significant portion of this interaction is redundant,…

Artificial Intelligence · Computer Science 2025-03-13 Jiale Wei , Xiang Ying , Tao Gao , Fangyi Bao , Felix Tao , Jingbo Shang

We introduce memory association networks(MANs) that memorize and remember any data. This neural network has two memories. One consists of a queue-structured short-term memory to solve the class imbalance problem and long-term memory to…

Artificial Intelligence · Computer Science 2021-12-28 Seokjun Kim , Jaeeun Jang , Yeonju Jang , Seongyune Choi , Hyeoncheol Kim

Recurrent neural networks (RNNs) and self-attention are both widely used sequence-mixing layers that maintain an internal memory. However, this memory is constructed using two orthogonal mechanisms: RNNs compress the entire past into a…

Machine Learning · Computer Science 2026-03-30 Leon Lufkin , Tomás Figliolia , Beren Millidge , Kamesh Krishnamurthy

Natural memories are associative, declarative and distributed. Symbolic computing memories resemble natural memories in their declarative character, and information can be stored and recovered explicitly; however, they lack the associative…

Artificial Intelligence · Computer Science 2020-09-29 Luis A. Pineda , Gibrán Fuentes , Rafael Morales

This paper presents the design of an associative memory with feedback that is capable of on-line temporal sequence learning. A framework for on-line sequence learning has been proposed, and different sequence learning models have been…

Neural and Evolutionary Computing · Computer Science 2007-05-23 J. Bose , S. B. Furber , J. L. Shapiro

Sequential recommendation aims to recommend the next item of users' interest based on their historical interactions. Recently, the self-attention mechanism has been adapted for sequential recommendation, and demonstrated state-of-the-art…

Information Retrieval · Computer Science 2022-09-19 Bo Peng , Srinivasan Parthasarathy , Xia Ning

This paper addresses the challenge of creating a neural architecture for very long sequences that requires constant time for processing new information at each time step. Our approach, Associative Recurrent Memory Transformer (ARMT), is…

Computation and Language · Computer Science 2025-02-17 Ivan Rodkin , Yuri Kuratov , Aydar Bulatov , Mikhail Burtsev

I introduce a novel associative memory model named Correlated Dense Associative Memory (CDAM), which integrates both auto- and hetero-association in a unified framework for continuous-valued memory patterns. Employing an arbitrary graph…

Neural and Evolutionary Computing · Computer Science 2024-06-04 Thomas F Burns

When someone mentions the name of a known person we immediately recall her face and possibly many other traits. This is because we possess the so-called associative memory, that is the ability to correlate different memories to the same…

Neurons and Cognition · Quantitative Biology 2010-08-26 Yuriy V. Pershin , Massimiliano Di Ventra
‹ Prev 1 2 3 10 Next ›