Related papers: Working Memory Graphs

Think Before You Act: Decision Transformers with Working Memory

Decision Transformer-based decision-making agents have shown the ability to generalize across multiple tasks. However, their performance relies on massive data and computation. We argue that this inefficiency stems from the forgetting…

Machine Learning · Computer Science 2024-05-30 Jikun Kang , Romain Laroche , Xingdi Yuan , Adam Trischler , Xue Liu , Jie Fu

Generative Temporal Models with Spatial Memory for Partially Observed Environments

In model-based reinforcement learning, generative and temporal models of environments can be leveraged to boost agent performance, either by tuning the agent's representations during training or via use as part of an explicit planning…

Machine Learning · Statistics 2018-07-20 Marco Fraccaro , Danilo Jimenez Rezende , Yori Zwols , Alexander Pritzel , S. M. Ali Eslami , Fabio Viola

Memory Transformer

Transformer-based models have achieved state-of-the-art results in many natural language processing tasks. The self-attention architecture allows transformer to combine information from all elements of a sequence into context-aware…

Computation and Language · Computer Science 2021-02-17 Mikhail S. Burtsev , Yuri Kuratov , Anton Peganov , Grigory V. Sapunov

Transformers are Meta-Reinforcement Learners

The transformer architecture and variants presented remarkable success across many machine learning tasks in recent years. This success is intrinsically related to the capability of handling long sequences and the presence of…

Machine Learning · Computer Science 2022-06-15 Luckeciano C. Melo

Transformer World Model for Sample Efficient Multi-Agent Reinforcement Learning

We present the Multi-Agent Transformer World Model (MATWM), a novel transformer-based world model designed for multi-agent reinforcement learning in both vector- and image-based environments. MATWM combines a decentralized imagination…

Machine Learning · Computer Science 2025-06-24 Azad Deihim , Eduardo Alonso , Dimitra Apostolopoulou

Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning

Humans leverage rich internal models of the world to reason about the future, imagine counterfactuals, and adapt flexibly to new situations. In Reinforcement Learning (RL), world models aim to capture how the environment evolves in response…

Artificial Intelligence · Computer Science 2025-10-29 Léopold Maytié , Roland Bertin Johannet , Rufin VanRullen

Model-based Meta Reinforcement Learning using Graph Structured Surrogate Models

Reinforcement learning is a promising paradigm for solving sequential decision-making problems, but low data efficiency and weak generalization across tasks are bottlenecks in real-world applications. Model-based meta reinforcement learning…

Machine Learning · Computer Science 2021-02-17 Qi Wang , Herke van Hoof

Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning

In multi-agent reinforcement learning, a commonly considered paradigm is centralized training with decentralized execution. However, in this framework, decentralized execution restricts the development of coordinated policies due to the…

Multiagent Systems · Computer Science 2024-12-30 Wenzhe Fan , Zishun Yu , Chengdong Ma , Changye Li , Yaodong Yang , Xinhua Zhang

Relational Forward Models for Multi-Agent Learning

The behavioral dynamics of multi-agent systems have a rich and orderly structure, which can be leveraged to understand these systems, and to improve how artificial agents learn to operate in them. Here we introduce Relational Forward Models…

Machine Learning · Computer Science 2018-10-01 Andrea Tacchetti , H. Francis Song , Pedro A. M. Mediano , Vinicius Zambaldi , Neil C. Rabinowitz , Thore Graepel , Matthew Botvinick , Peter W. Battaglia

Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

Decision Transformer, a promising approach that applies Transformer architectures to reinforcement learning, relies on causal self-attention to model sequences of states, actions, and rewards. While this method has shown competitive…

Machine Learning · Computer Science 2024-04-01 Toshihiro Ota

From Memories to Maps: Mechanisms of In-Context Reinforcement Learning in Transformers

Humans and animals show remarkable learning efficiency, adapting to new environments with minimal experience. This capability is not well captured by standard reinforcement learning algorithms that rely on incremental value updates. Rapid…

Artificial Intelligence · Computer Science 2025-12-03 Ching Fang , Kanaka Rajan

Better transfer learning with inferred successor maps

Humans and animals show remarkable flexibility in adjusting their behaviour when their goals, or rewards in the environment change. While such flexibility is a hallmark of intelligent behaviour, these multi-task scenarios remain an…

Artificial Intelligence · Computer Science 2020-01-13 Tamas J. Madarasz

Transformer Mechanisms Mimic Frontostriatal Gating Operations When Trained on Human Working Memory Tasks

Models based on the Transformer neural network architecture have seen success on a wide variety of tasks that appear to require complex "cognitive branching" -- or the ability to maintain pursuit of one goal while accomplishing others. In…

Artificial Intelligence · Computer Science 2024-02-14 Aaron Traylor , Jack Merullo , Michael J. Frank , Ellie Pavlick

Spatially-Aware Transformer for Embodied Agents

Episodic memory plays a crucial role in various cognitive processes, such as the ability to mentally recall past events. While cognitive science emphasizes the significance of spatial context in the formation and retrieval of episodic…

Machine Learning · Computer Science 2024-03-04 Junmo Cho , Jaesik Yoon , Sungjin Ahn

Transformers and Slot Encoding for Sample Efficient Physical World Modelling

World modelling, i.e. building a representation of the rules that govern the world so as to predict its evolution, is an essential ability for any agent interacting with the physical world. Recent applications of the Transformer…

Machine Learning · Computer Science 2024-05-31 Francesco Petri , Luigi Asprino , Aldo Gangemi

The Effectiveness of World Models for Continual Reinforcement Learning

World models power some of the most efficient reinforcement learning algorithms. In this work, we showcase that they can be harnessed for continual learning - a situation when the agent faces changing environments. World models typically…

Machine Learning · Computer Science 2023-07-14 Samuel Kessler , Mateusz Ostaszewski , Michał Bortkiewicz , Mateusz Żarski , Maciej Wołczyk , Jack Parker-Holder , Stephen J. Roberts , Piotr Miłoś

Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation

Scalable multi-agent driving simulation requires behavior models that are both realistic and computationally efficient. We address this by optimizing the behavior model that controls individual traffic participants. To improve efficiency,…

Robotics · Computer Science 2026-04-15 Fabian Konstantinidis , Moritz Sackmann , Ulrich Hofmann , Christoph Stiller

Local-to-Global Self-Attention in Vision Transformers

Transformers have demonstrated great potential in computer vision tasks. To avoid dense computations of self-attentions in high-resolution visual data, some recent Transformer models adopt a hierarchical design, where self-attentions are…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Jinpeng Li , Yichao Yan , Shengcai Liao , Xiaokang Yang , Ling Shao

Memory-Augmented Transformers: A Systematic Review from Neuroscience Principles to Enhanced Model Architectures

Memory is fundamental to intelligence, enabling learning, reasoning, and adaptability across biological and artificial systems. While Transformer architectures excel at sequence modeling, they face critical limitations in long-range context…

Machine Learning · Computer Science 2025-08-19 Parsa Omidi , Xingshuai Huang , Axel Laborieux , Bahareh Nikpour , Tianyu Shi , Armaghan Eshaghi

Recurrent Memory Transformer

Transformer-based models show their effectiveness across multiple domains and tasks. The self-attention allows to combine information from all sequence elements into context-aware representations. However, global and local information has…

Computation and Language · Computer Science 2022-12-09 Aydar Bulatov , Yuri Kuratov , Mikhail S. Burtsev