Related papers: Learning Generative Models with Visual Attention

Progressive Attention Networks for Visual Attribute Prediction

We propose a novel attention model that can accurately attends to target objects of various scales and shapes in images. The model is trained to gradually suppress irrelevant regions in an input image via a progressive attentive process…

Computer Vision and Pattern Recognition · Computer Science 2018-08-08 Paul Hongsuck Seo , Zhe Lin , Scott Cohen , Xiaohui Shen , Bohyung Han

A Neural Network Model of Spatial and Feature-Based Attention

Visual attention is a mechanism closely intertwined with vision and memory. Top-down information influences visual processing through attention. We designed a neural network model inspired by aspects of human visual attention. This model…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Ruoyang Hu , Robert A. Jacobs

Visual Attention driven by Convolutional Features

The understanding of where humans look in a scene is a problem of great interest in visual perception and computer vision. When eye-tracking devices are not a viable option, models of human attention can be used to predict fixations. In…

Computer Vision and Pattern Recognition · Computer Science 2018-07-30 Dario Zanca , Marco Gori

Learning to See What You Need: Gaze Attention for Multimodal Large Language Models

When humans describe a visual scene, they do not process the entire image uniformly; instead, they selectively fixate on regions relevant to their intended description. In contrast, current multimodal large language models (MLLMs) attend to…

Computer Vision and Pattern Recognition · Computer Science 2026-05-14 Junha Song , Byeongho Heo , Geonmo Gu , Jaegul Choo , Dongyoon Han , Sangdoo Yun

Object Based Attention Through Internal Gating

Object-based attention is a key component of the visual system, relevant for perception, learning, and memory. Neurons tuned to features of attended objects tend to be more active than those associated with non-attended objects. There is a…

Neurons and Cognition · Quantitative Biology 2021-06-09 Jordan Lei , Ari S. Benjamin , Konrad P. Kording

Pre-training Attention Mechanisms

Recurrent neural networks with differentiable attention mechanisms have had success in generative and classification tasks. We show that the classification performance of such models can be enhanced by guiding a randomly initialized model…

Machine Learning · Computer Science 2017-12-18 Jack Lindsey

Age and Gender Prediction From Face Images Using Attentional Convolutional Network

Automatic prediction of age and gender from face images has drawn a lot of attention recently, due it is wide applications in various facial analysis problems. However, due to the large intra-class variation of face images (such as…

Computer Vision and Pattern Recognition · Computer Science 2020-12-09 Amirali Abdolrashidi , Mehdi Minaei , Elham Azimi , Shervin Minaee

Graph Convolutional Neural Networks via Motif-based Attention

Many real-world problems can be represented as graph-based learning problems. In this paper, we propose a novel framework for learning spatial and attentional convolution neural networks on arbitrary graphs. Different from previous…

Machine Learning · Computer Science 2019-02-26 Hao Peng , Jianxin Li , Qiran Gong , Senzhang Wang , Yuanxing Ning , Philip S. Yu

Generative Flows with Invertible Attentions

Flow-based generative models have shown an excellent ability to explicitly learn the probability density function of data via a sequence of invertible transformations. Yet, learning attentions in generative flows remains understudied, while…

Machine Learning · Computer Science 2022-04-01 Rhea Sanjay Sukthanker , Zhiwu Huang , Suryansh Kumar , Radu Timofte , Luc Van Gool

Recurrent Models of Visual Attention

Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of…

Machine Learning · Computer Science 2014-06-25 Volodymyr Mnih , Nicolas Heess , Alex Graves , Koray Kavukcuoglu

Visual Attention Methods in Deep Learning: An In-Depth Survey

Inspired by the human cognitive system, attention is a mechanism that imitates the human cognitive awareness about specific information, amplifying critical details to focus more on the essential aspects of data. Deep learning has employed…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Mohammed Hassanin , Saeed Anwar , Ibrahim Radwan , Fahad S Khan , Ajmal Mian

Subject2Vec: Generative-Discriminative Approach from a Set of Image Patches to a Vector

We propose an attention-based method that aggregates local image features to a subject-level representation for predicting disease severity. In contrast to classical deep learning that requires a fixed dimensional input, our method operates…

Computer Vision and Pattern Recognition · Computer Science 2018-07-02 Sumedha Singla , Mingming Gong , Siamak Ravanbakhsh , Frank Sciurba , Barnabas Poczos , Kayhan N. Batmanghelich

Towards Visually Explaining Similarity Models

We consider the problem of visually explaining similarity models, i.e., explaining why a model predicts two images to be similar in addition to producing a scalar score. While much recent work in visual model interpretability has focused on…

Computer Vision and Pattern Recognition · Computer Science 2020-10-15 Meng Zheng , Srikrishna Karanam , Terrence Chen , Richard J. Radke , Ziyan Wu

Attend and Guide (AG-Net): A Keypoints-driven Attention-based Deep Network for Image Recognition

This paper presents a novel keypoints-based attention mechanism for visual recognition in still images. Deep Convolutional Neural Networks (CNNs) for recognizing images with distinctive classes have shown great success, but their…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Asish Bera , Zachary Wharton , Yonghuai Liu , Nik Bessis , Ardhendu Behera

AttentionRNN: A Structured Spatial Attention Mechanism

Visual attention mechanisms have proven to be integrally important constituent components of many modern deep neural architectures. They provide an efficient and effective way to utilize visual information selectively, which has shown to be…

Computer Vision and Pattern Recognition · Computer Science 2019-05-24 Siddhesh Khandelwal , Leonid Sigal

On Understanding Attention-Based In-Context Learning for Categorical Data

In-context learning based on attention models is examined for data with categorical outcomes, with inference in such models viewed from the perspective of functional gradient descent (GD). We develop a network composed of attention blocks,…

Machine Learning · Statistics 2025-05-08 Aaron T. Wang , William Convertino , Xiang Cheng , Ricardo Henao , Lawrence Carin

Attentional Network for Visual Object Detection

We propose augmenting deep neural networks with an attention mechanism for the visual object detection task. As perceiving a scene, humans have the capability of multiple fixation points, each attended to scene content at different…

Computer Vision and Pattern Recognition · Computer Science 2017-02-07 Kota Hara , Ming-Yu Liu , Oncel Tuzel , Amir-massoud Farahmand

Show, Attend and Tell: Neural Image Caption Generation with Visual Attention

Inspired by recent work in machine translation and object detection, we introduce an attention based model that automatically learns to describe the content of images. We describe how we can train this model in a deterministic manner using…

Machine Learning · Computer Science 2016-04-20 Kelvin Xu , Jimmy Ba , Ryan Kiros , Kyunghyun Cho , Aaron Courville , Ruslan Salakhutdinov , Richard Zemel , Yoshua Bengio

Deep Attentive Tracking via Reciprocative Learning

Visual attention, derived from cognitive neuroscience, facilitates human perception on the most pertinent subset of the sensory data. Recently, significant efforts have been made to exploit attention schemes to advance computer vision…

Computer Vision and Pattern Recognition · Computer Science 2018-10-16 Shi Pu , Yibing Song , Chao Ma , Honggang Zhang , Ming-Hsuan Yang

Ripple Attention for Visual Perception with Sub-quadratic Complexity

Transformer architectures are now central to sequence modeling tasks. At its heart is the attention mechanism, which enables effective modeling of long-term dependencies in a sequence. Recently, transformers have been successfully applied…

Computer Vision and Pattern Recognition · Computer Science 2022-06-16 Lin Zheng , Huijie Pan , Lingpeng Kong