Related papers: Graph Attention Memory for Visual Navigation

Gated Associative Memory: A Parallel O(N) Architecture for Efficient Sequence Modeling

The Transformer architecture, underpinned by the self-attention mechanism, has become the de facto standard for sequence modeling tasks. However, its core computational primitive scales quadratically with sequence length (O(N^2)), creating…

Computation and Language · Computer Science 2025-09-03 Rishiraj Acharya

MemoNav: Working Memory Model for Visual Navigation

Image-goal navigation is a challenging task that requires an agent to navigate to a goal indicated by an image in unfamiliar environments. Existing methods utilizing diverse scene memories suffer from inefficient exploration since they use…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Hongxin Li , Zeyu Wang , Xu Yang , Yuran Yang , Shuqi Mei , Zhaoxiang Zhang

Where to Look: A Unified Attention Model for Visual Recognition with Reinforcement Learning

The idea of using the recurrent neural network for visual attention has gained popularity in computer vision community. Although the recurrent attention model (RAM) leverages the glimpses with more large patch size to increasing its scope,…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Gang Chen

GAM: Hierarchical Graph-based Agentic Memory for LLM Agents

To sustain coherent long-term interactions, Large Language Model (LLM) agents must navigate the tension between acquiring new information and retaining prior knowledge. Current unified stream-based memory systems facilitate context updates…

Artificial Intelligence · Computer Science 2026-04-15 Zhaofen Wu , Hanrong Zhang , Fulin Lin , Wujiang Xu , Xinran Xu , Yankai Chen , Henry Peng Zou , Shaowen Chen , Weizhi Zhang , Xue Liu , Philip S. Yu , Hongwei Wang

Memory-Augmented Reinforcement Learning for Image-Goal Navigation

In this work, we present a memory-augmented approach for image-goal navigation. Earlier attempts, including RL-based and SLAM-based approaches have either shown poor generalization performance, or are heavily-reliant on pose/depth sensors.…

Computer Vision and Pattern Recognition · Computer Science 2023-01-06 Lina Mezghani , Sainbayar Sukhbaatar , Thibaut Lavril , Oleksandr Maksymets , Dhruv Batra , Piotr Bojanowski , Karteek Alahari

EGAM: Extended Graph Attention Model for Solving Routing Problems

Neural combinatorial optimization (NCO) solvers, implemented with graph neural networks (GNNs), have introduced new approaches for solving routing problems. Trained with reinforcement learning (RL), the state-of-the-art graph attention…

Machine Learning · Computer Science 2026-01-30 Licheng Wang , Yuzi Yan , Mingtao Huang , Yuan Shen

Joint Graph Learning and Matching for Semantic Feature Correspondence

In recent years, powered by the learned discriminative representation via graph neural network (GNN) models, deep graph matching methods have made great progresses in the task of matching semantic features. However, these methods usually…

Computer Vision and Pattern Recognition · Computer Science 2021-11-18 He Liu , Tao Wang , Yidong Li , Congyan Lang , Yi Jin , Haibin Ling

GAMR: A Guided Attention Model for (visual) Reasoning

Humans continue to outperform modern AI systems in their ability to flexibly parse and understand complex visual scenes. Here, we present a novel module for visual reasoning, the Guided Attention Model for (visual) Reasoning (GAMR), which…

Artificial Intelligence · Computer Science 2023-03-22 Mohit Vaishnav , Thomas Serre

Neural Attention Memory

We propose a novel perspective of the attention mechanism by reinventing it as a memory architecture for neural networks, namely Neural Attention Memory (NAM). NAM is a memory structure that is both readable and writable via differentiable…

Machine Learning · Computer Science 2023-10-17 Hyoungwook Nam , Seung Byum Seo

Topological Semantic Graph Memory for Image-Goal Navigation

A novel framework is proposed to incrementally collect landmark-based graph memory and use the collected memory for image goal navigation. Given a target image to search, an embodied robot utilizes semantic memory to find the target in an…

Robotics · Computer Science 2022-09-20 Nuri Kim , Obin Kwon , Hwiyeon Yoo , Yunho Choi , Jeongho Park , Songhwai Oh

MemoNav: Selecting Informative Memories for Visual Navigation

Image-goal navigation is a challenging task, as it requires the agent to navigate to a target indicated by an image in a previously unseen scene. Current methods introduce diverse memory mechanisms which save navigation history to solve…

Computer Vision and Pattern Recognition · Computer Science 2022-08-23 Hongxin Li , Xu Yang , Yuran Yang , Shuqi Mei , Zhaoxiang Zhang

Temporal Attention-Augmented Graph Convolutional Network for Efficient Skeleton-Based Human Action Recognition

Graph convolutional networks (GCNs) have been very successful in modeling non-Euclidean data structures, like sequences of body skeletons forming actions modeled as spatio-temporal graphs. Most GCN-based action recognition methods use deep…

Computer Vision and Pattern Recognition · Computer Science 2021-04-26 Negar Heidari , Alexandros Iosifidis

Emergence of Fixational and Saccadic Movements in a Multi-Level Recurrent Attention Model for Vision

Inspired by foveal vision, hard attention models promise interpretability and parameter economy. However, existing models like the Recurrent Model of Visual Attention (RAM) and Deep Recurrent Attention Model (DRAM) failed to model the…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Pengcheng Pan , Yonekura Shogo , Yasuo Kuniyoshi

GAMA: A Neural Neighborhood Search Method with Graph-aware Multi-modal Attention for Vehicle Routing Problem

Recent advances in neural neighborhood search methods have shown potential in tackling Vehicle Routing Problems (VRPs). However, most existing approaches rely on simplistic state representations and fuse heterogeneous information via naive…

Artificial Intelligence · Computer Science 2025-12-04 Xiangling Chen , Yi Mei , Mengjie Zhang

GaAN: Gated Attention Networks for Learning on Large and Spatiotemporal Graphs

We propose a new network architecture, Gated Attention Networks (GaAN), for learning on graphs. Unlike the traditional multi-head attention mechanism, which equally consumes all attention heads, GaAN uses a convolutional sub-network to…

Machine Learning · Computer Science 2018-03-21 Jiani Zhang , Xingjian Shi , Junyuan Xie , Hao Ma , Irwin King , Dit-Yan Yeung

BAM: Bottleneck Attention Module

Recent advances in deep neural networks have been developed via architecture search for stronger representational power. In this work, we focus on the effect of attention in general deep neural networks. We propose a simple and effective…

Computer Vision and Pattern Recognition · Computer Science 2018-07-19 Jongchan Park , Sanghyun Woo , Joon-Young Lee , In So Kweon

Deep Graph Attention Model

Graph classification is a problem with practical applications in many different domains. Most of the existing methods take the entire graph into account when calculating graph features. In a graphlet-based approach, for instance, the entire…

Machine Learning · Computer Science 2017-09-20 John Boaz Lee , Ryan Rossi , Xiangnan Kong

GAM: Explainable Visual Similarity and Classification via Gradient Activation Maps

We present Gradient Activation Maps (GAM) - a machinery for explaining predictions made by visual similarity and classification models. By gleaning localized gradient and activation information from multiple network layers, GAM offers…

Computer Vision and Pattern Recognition · Computer Science 2021-09-03 Oren Barkan , Omri Armstrong , Amir Hertz , Avi Caciularu , Ori Katz , Itzik Malkiel , Noam Koenigstein

RATM: Recurrent Attentive Tracking Model

We present an attention-based modular neural framework for computer vision. The framework uses a soft attention mechanism allowing models to be trained with gradient descent. It consists of three modules: a recurrent attention module…

Machine Learning · Computer Science 2016-04-29 Samira Ebrahimi Kahou , Vincent Michalski , Roland Memisevic

MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this…

Computer Vision and Pattern Recognition · Computer Science 2021-03-23 Zachary Seymour , Kowshik Thopalli , Niluthpol Mithun , Han-Pang Chiu , Supun Samarasekera , Rakesh Kumar