Related papers: Learning Visual Context by Comparison

Co-Attention for Conditioned Image Matching

We propose a new approach to determine correspondences between image pairs in the wild under large changes in illumination, viewpoint, context, and material. While other approaches find correspondences between pairs of images by treating…

Computer Vision and Pattern Recognition · Computer Science 2021-03-29 Olivia Wiles , Sebastien Ehrhardt , Andrew Zisserman

AMC: Attention guided Multi-modal Correlation Learning for Image Search

Given a user's query, traditional image search systems rank images according to its relevance to a single modality (e.g., image content or surrounding text). Nowadays, an increasing number of images on the Internet are available with…

Computer Vision and Pattern Recognition · Computer Science 2017-04-05 Kan Chen , Trung Bui , Fang Chen , Zhaowen Wang , Ram Nevatia

Attention-guided Context Feature Pyramid Network for Object Detection

For object detection, how to address the contradictory requirement between feature map resolution and receptive field on high-resolution inputs still remains an open question. In this paper, to tackle this issue, we build a novel…

Computer Vision and Pattern Recognition · Computer Science 2020-05-26 Junxu Cao , Qi Chen , Jun Guo , Ruichao Shi

Position-Aware Contrastive Alignment for Referring Image Segmentation

Referring image segmentation aims to segment the target object described by a given natural language expression. Typically, referring expressions contain complex relationships between the target and its surrounding objects. The main…

Computer Vision and Pattern Recognition · Computer Science 2022-12-29 Bo Chen , Zhiwei Hu , Zhilong Ji , Jinfeng Bai , Wangmeng Zuo

Regional Active Contours based on Variational level sets and Machine Learning for Image Segmentation

Image segmentation is the problem of partitioning an image into different subsets, where each subset may have a different characterization in terms of color, intensity, texture, and/or other features. Segmentation is a fundamental component…

Computer Vision and Pattern Recognition · Computer Science 2015-11-03 M. Abdelsamea

Location-Aware Pretraining for Medical Difference Visual Question Answering

Differential medical VQA models compare multiple images to identify clinically meaningful changes and rely on vision encoders to capture fine-grained visual differences that reflect radiologists' comparative diagnostic workflows. However,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Denis Musinguzi , Caren Han , Prasenjit Mitra

Context Encoding Chest X-rays

Chest X-rays are one of the most commonly used technologies for medical diagnosis. Many deep learning models have been proposed to improve and automate the abnormality detection task on this type of data. In this paper, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2019-04-10 Davide Belli , Shi Hu , Ecem Sogancioglu , Bram van Ginneken

Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing

Most of computer vision focuses on what is in an image. We propose to train a standalone object-centric context representation to perform the opposite task: seeing what is not there. Given an image, our context model can predict where…

Computer Vision and Pattern Recognition · Computer Science 2017-02-28 Jin Sun , David W. Jacobs

Deep Contextual Attention for Human-Object Interaction Detection

Human-object interaction detection is an important and relatively new class of visual relationship detection tasks, essential for deeper scene understanding. Most existing approaches decompose the problem into object localization and…

Computer Vision and Pattern Recognition · Computer Science 2019-10-18 Tiancai Wang , Rao Muhammad Anwer , Muhammad Haris Khan , Fahad Shahbaz Khan , Yanwei Pang , Ling Shao , Jorma Laaksonen

Attend what matters: Leveraging vision foundational models for breast cancer classification using mammograms

Vision Transformers $(\texttt{ViT})$ have become the architecture of choice for many computer vision tasks, yet their performance in computer-aided diagnostics remains limited. Focusing on breast cancer detection from mammograms, we…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Samyak Sanghvi , Piyush Miglani , Sarvesh Shashikumar , Kaustubh R Borgavi , Veenu Singla , Chetan Arora

Permutohedral Attention Module for Efficient Non-Local Neural Networks

Medical image processing tasks such as segmentation often require capturing non-local information. As organs, bones, and tissues share common characteristics such as intensity, shape, and texture, the contextual information plays a critical…

Computer Vision and Pattern Recognition · Computer Science 2019-10-22 Samuel Joutard , Reuben Dorent , Amanda Isaac , Sebastien Ourselin , Tom Vercauteren , Marc Modat

Learning to Agree on Vision Attention for Visual Commonsense Reasoning

Visual Commonsense Reasoning (VCR) remains a significant yet challenging research problem in the realm of visual reasoning. A VCR model generally aims at answering a textual question regarding an image, followed by the rationale prediction…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Zhenyang Li , Yangyang Guo , Kejie Wang , Fan Liu , Liqiang Nie , Mohan Kankanhalli

Merging Context Clustering with Visual State Space Models for Medical Image Segmentation

Medical image segmentation demands the aggregation of global and local feature representations, posing a challenge for current methodologies in handling both long-range and short-range feature interactions. Recently, vision mamba (ViM)…

Computer Vision and Pattern Recognition · Computer Science 2025-01-06 Yun Zhu , Dong Zhang , Yi Lin , Yifei Feng , Jinhui Tang

Image similarity has been extensively studied in computer vision. In recent years, machine-learned models have shown their ability to encode more semantics than traditional multivariate metrics. However, in labelling semantic similarity,…

Computer Vision and Pattern Recognition · Computer Science 2024-09-11 Zukang Liao , Min Chen

Attentive Contexts for Object Detection

Modern deep neural network based object detection methods typically classify candidate proposals using their interior features. However, global and local surrounding contexts that are believed to be valuable for object detection are not…

Computer Vision and Pattern Recognition · Computer Science 2016-03-25 Jianan Li , Yunchao Wei , Xiaodan Liang , Jian Dong , Tingfa Xu , Jiashi Feng , Shuicheng Yan

Attend to the Difference: Cross-Modality Person Re-identification via Contrastive Correlation

The problem of cross-modality person re-identification has been receiving increasing attention recently, due to its practical significance. Motivated by the fact that human usually attend to the difference when they compare two similar…

Computer Vision and Pattern Recognition · Computer Science 2021-12-02 Shizhou Zhang , Yifei Yang , Peng Wang , Guoqiang Liang , Xiuwei Zhang , Yanning Zhang

Contrastive Attention for Automatic Chest X-ray Report Generation

Recently, chest X-ray report generation, which aims to automatically generate descriptions of given chest X-ray images, has received growing research interests. The key challenge of chest X-ray report generation is to accurately capture and…

Computer Vision and Pattern Recognition · Computer Science 2023-04-12 Fenglin Liu , Changchang Yin , Xian Wu , Shen Ge , Yuexian Zou , Ping Zhang , Yuexian Zou , Xu Sun

Correlational Image Modeling for Self-Supervised Visual Pre-Training

We introduce Correlational Image Modeling (CIM), a novel and surprisingly effective approach to self-supervised visual pre-training. Our CIM performs a simple pretext task: we randomly crop image regions (exemplars) from an input image…

Computer Vision and Pattern Recognition · Computer Science 2023-03-31 Wei Li , Jiahao Xie , Chen Change Loy

Exploring Person Context and Local Scene Context for Object Detection

In this paper we explore two ways of using context for object detection. The first model focusses on people and the objects they commonly interact with, such as fashion and sports accessories. The second model considers more general object…

Computer Vision and Pattern Recognition · Computer Science 2015-11-26 Saurabh Gupta , Bharath Hariharan , Jitendra Malik

AnaXNet: Anatomy Aware Multi-label Finding Classification in Chest X-ray

Radiologists usually observe anatomical regions of chest X-ray images as well as the overall image before making a decision. However, most existing deep learning models only look at the entire X-ray image for classification, failing to…

Computer Vision and Pattern Recognition · Computer Science 2021-05-21 Nkechinyere N. Agu , Joy T. Wu , Hanqing Chao , Ismini Lourentzou , Arjun Sharma , Mehdi Moradi , Pingkun Yan , James Hendler