Related papers: Structured Visual Search via Composition-aware Lea…

Learning Structured Representations of Visual Scenes

As the intermediate-level representations bridging the two levels, structured representations of visual scenes, such as visual relationships between pairwise objects, have been shown to not only benefit compositional models in learning to…

Computer Vision and Pattern Recognition · Computer Science 2022-07-12 Meng-Jiun Chiou

Structured Query-Based Image Retrieval Using Scene Graphs

A structured query can capture the complexity of object interactions (e.g. 'woman rides motorcycle') unlike single objects (e.g. 'woman' or 'motorcycle'). Retrieval using structured queries therefore is much more useful than single object…

Computer Vision and Pattern Recognition · Computer Science 2020-05-15 Brigit Schroeder , Subarna Tripathi

Structure-aware Visualization Retrieval

With the wide usage of data visualizations, a huge number of Scalable Vector Graphic (SVG)-based visualizations have been created and shared online. Accordingly, there has been an increasing interest in exploring how to retrieve…

Human-Computer Interaction · Computer Science 2022-02-15 Haotian Li , Yong Wang , Aoyu Wu , Huan Wei , Huamin Qu

Flexible Compositional Learning of Structured Visual Concepts

Humans are highly efficient learners, with the ability to grasp the meaning of a new concept from just a few examples. Unlike popular computer vision systems, humans can flexibly leverage the compositional structure of the visual world,…

Computer Vision and Pattern Recognition · Computer Science 2021-05-21 Yanli Zhou , Brenden M. Lake

A Guide to Structureless Visual Localization

Visual localization algorithms, i.e., methods that estimate the camera pose of a query image in a known scene, are core components of many applications, including self-driving cars and augmented / mixed reality systems. State-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2025-04-25 Vojtech Panek , Qunjie Zhou , Yaqing Ding , Sérgio Agostinho , Zuzana Kukelova , Torsten Sattler , Laura Leal-Taixé

Combining Absolute and Semi-Generalized Relative Poses for Visual Localization

Visual localization is the problem of estimating the camera pose of a given query image within a known scene. Most state-of-the-art localization approaches follow the structure-based paradigm and use 2D-3D matches between pixels in a query…

Computer Vision and Pattern Recognition · Computer Science 2024-09-24 Vojtech Panek , Torsten Sattler , Zuzana Kukelova

Modeling Rational Adaptation of Visual Search to Hierarchical Structures

Efficient attention deployment in visual search is limited by human visual memory, yet this limitation can be offset by exploiting the environment's structure. This paper introduces a computational cognitive model that simulates how the…

Human-Computer Interaction · Computer Science 2024-09-16 Saku Sourulahti , Christian P Janssen , Jussi PP Jokinen

Structured Analysis Dictionary Learning for Image Classification

We propose a computationally efficient and high-performance classification algorithm by incorporating class structural information in analysis dictionary learning. To achieve more consistent classification, we associate a class…

Computer Vision and Pattern Recognition · Computer Science 2018-05-03 Wen Tang , Ashkan Panahi , Hamid Krim , Liyi Dai

Compositional Sketch Search

We present an algorithm for searching image collections using free-hand sketches that describe the appearance and relative positions of multiple objects. Sketch based image retrieval (SBIR) methods predominantly match queries containing a…

Computer Vision and Pattern Recognition · Computer Science 2021-06-16 Alexander Black , Tu Bui , Long Mai , Hailin Jin , John Collomosse

Does Data Scaling Lead to Visual Compositional Generalization?

Compositional understanding is crucial for human intelligence, yet it remains unclear whether contemporary vision models exhibit it. The dominant machine learning paradigm is built on the premise that scaling data and model sizes will…

Machine Learning · Computer Science 2025-07-10 Arnas Uselis , Andrea Dittadi , Seong Joon Oh

Learning Structured Semantic Embeddings for Visual Recognition

Numerous embedding models have been recently explored to incorporate semantic knowledge into visual recognition. Existing methods typically focus on minimizing the distance between the corresponding images and texts in the embedding space…

Computer Vision and Pattern Recognition · Computer Science 2017-06-06 Dong Li , Hsin-Ying Lee , Jia-Bin Huang , Shengjin Wang , Ming-Hsuan Yang

Learning a Hierarchical Compositional Shape Vocabulary for Multi-class Object Representation

Hierarchies allow feature sharing between objects at multiple levels of representation, can code exponential variability in a very compact way and enable fast inference. This makes them potentially suitable for learning and recognizing a…

Computer Vision and Pattern Recognition · Computer Science 2014-08-26 Sanja Fidler , Marko Boben , Ales Leonardis

Searching for Objects using Structure in Indoor Scenes

To identify the location of objects of a particular class, a passive computer vision system generally processes all the regions in an image to finally output few regions. However, we can use structure in the scene to search for objects…

Computer Vision and Pattern Recognition · Computer Science 2016-08-09 Varun K. Nagaraja , Vlad I. Morariu , Larry S. Davis

A causal view of compositional zero-shot recognition

People easily recognize new visual categories that are new combinations of known components. This compositional generalization capacity is critical for learning in real-world domains like vision and language because the long tail of new…

Computer Vision and Pattern Recognition · Computer Science 2020-11-03 Yuval Atzmon , Felix Kreuk , Uri Shalit , Gal Chechik

Explicitly Representing Syntax Improves Sentence-to-layout Prediction of Unexpected Situations

Recognizing visual entities in a natural language sentence and arranging them in a 2D spatial layout require a compositional understanding of language and space. This task of layout prediction is valuable in text-to-image synthesis as it…

Computation and Language · Computer Science 2024-12-03 Wolf Nuyts , Ruben Cartuyvels , Marie-Francine Moens

Learning to Compose: Improving Object Centric Learning by Injecting Compositionality

Learning compositional representation is a key aspect of object-centric learning as it enables flexible systematic generalization and supports complex visual reasoning. However, most of the existing approaches rely on auto-encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Whie Jung , Jaehoon Yoo , Sungjin Ahn , Seunghoon Hong

Linear Spaces of Meanings: Compositional Structures in Vision-Language Models

We investigate compositional structures in data embeddings from pre-trained vision-language models (VLMs). Traditionally, compositionality has been associated with algebraic operations on embeddings of words from a pre-existing vocabulary.…

Machine Learning · Computer Science 2024-01-12 Matthew Trager , Pramuditha Perera , Luca Zancato , Alessandro Achille , Parminder Bhatia , Stefano Soatto

Visual-Semantic Embedding Model Informed by Structured Knowledge

We propose a novel approach to improve a visual-semantic embedding model by incorporating concept representations captured from an external structured knowledge base. We investigate its performance on image classification under both…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Mirantha Jayathilaka , Tingting Mu , Uli Sattler

MIC: Mining Interclass Characteristics for Improved Metric Learning

Metric learning seeks to embed images of objects suchthat class-defined relations are captured by the embeddingspace. However, variability in images is not just due to different depicted object classes, but also depends on other latent…

Computer Vision and Pattern Recognition · Computer Science 2019-09-26 Karsten Roth , Biagio Brattoli , Björn Ommer

In this vision paper, we propose a shift in perspective for improving the effectiveness of similarity search. Rather than focusing solely on enhancing the data quality, particularly machine learning-generated embeddings, we advocate for a…

Databases · Computer Science 2023-08-03 Renzhi Wu , Jingfan Meng , Jie Jeff Xu , Huayi Wang , Kexin Rong