Related papers: Towards Visual Semantics

Classifying concepts via visual properties

We assume that substances in the world are represented by two types of concepts, namely substance concepts and classification concepts, the former instrumental to (visual) perception, the latter to (language based) classification. Based on…

Artificial Intelligence · Computer Science 2021-05-21 Fausto Giunchiglia , Mayukh Bagchi

Egocentric Hierarchical Visual Semantics

We are interested in aligning how people think about objects and what machines perceive, meaning by this the fact that object recognition, as performed by a machine, should follow a process which resembles that followed by humans when…

Artificial Intelligence · Computer Science 2023-05-10 Luca Erculiani , Andrea Bontempelli , Andrea Passerini , Fausto Giunchiglia

Learning Multi-Modal Word Representation Grounded in Visual Context

Representing the semantics of words is a long-standing problem for the natural language processing community. Most methods compute word semantics given their textual context in large corpora. More recently, researchers attempted to…

Computation and Language · Computer Science 2017-11-10 Éloi Zablocki , Benjamin Piwowarski , Laure Soulier , Patrick Gallinari

Object Recognition as Classification via Visual Properties

We base our work on the teleosemantic modelling of concepts as abilities implementing the distinct functions of recognition and classification. Accordingly, we model two types of concepts - substance concepts suited for object recognition…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 Fausto Giunchiglia , Mayukh Bagchi

Aligning Visual and Lexical Semantics

We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Fausto Giunchiglia , Mayukh Bagchi , Xiaolei Diao

Learning Object Semantic Similarity with Self-Supervision

Humans judge the similarity of two objects not just based on their visual appearance but also based on their semantic relatedness. However, it remains unclear how humans learn about semantic relationships between objects and categories. One…

Computer Vision and Pattern Recognition · Computer Science 2024-05-09 Arthur Aubret , Timothy Schaumlöffel , Gemma Roig , Jochen Triesch

Semantic projection: recovering human knowledge of multiple, distinct object features from word embeddings

The words of a language reflect the structure of the human mind, allowing us to transmit thoughts between individuals. However, language can represent only a subset of our rich and detailed cognitive architecture. Here, we ask what kinds of…

Computation and Language · Computer Science 2018-03-07 Gabriel Grand , Idan Asher Blank , Francisco Pereira , Evelina Fedorenko

Language-Informed Visual Concept Learning

Our understanding of the visual world is centered around various concept axes, characterizing different aspects of visual entities. While different concept axes can be easily specified by language, e.g. color, the exact visual nuances along…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Sharon Lee , Yunzhi Zhang , Shangzhe Wu , Jiajun Wu

Interacting Conceptual Spaces I : Grammatical Composition of Concepts

The categorical compositional approach to meaning has been successfully applied in natural language processing, outperforming other models in mainstream empirical language processing tasks. We show how this approach can be generalized to…

Logic in Computer Science · Computer Science 2017-10-02 Joe Bolt , Bob Coecke , Fabrizio Genovese , Martha Lewis , Dan Marsden , Robin Piedeleu

The Elements of Visual Art Recommendation: Learning Latent Semantic Representations of Paintings

Artwork recommendation is challenging because it requires understanding how users interact with highly subjective content, the complexity of the concepts embedded within the artwork, and the emotional and cognitive reflections they may…

Information Retrieval · Computer Science 2023-03-21 Bereket A. Yilma , Luis A. Leiva

Building a visual semantics aware object hierarchy

The semantic gap is defined as the difference between the linguistic representations of the same concept, which usually leads to misunderstanding between individuals with different knowledge backgrounds. Since linguistically annotated…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Xiaolei Diao

Human-like compositional learning of visually-grounded concepts using synthetic environments

The compositional structure of language enables humans to decompose complex phrases and map them to novel visual concepts, showcasing flexible intelligence. While several algorithms exhibit compositionality, they fail to elucidate how…

Computer Vision and Pattern Recognition · Computer Science 2025-04-10 Zijun Lin , M Ganesh Kumar , Cheston Tan

Semantic Representations of Word Senses and Concepts

Representing the semantics of linguistic items in a machine-interpretable form has been a major goal of Natural Language Processing since its earliest days. Among the range of different linguistic items, words have attracted the most…

Computation and Language · Computer Science 2016-08-04 José Camacho-Collados , Ignacio Iacobacci , Roberto Navigli , Mohammad Taher Pilehvar

Automatic Concept Discovery from Parallel Text and Visual Corpora

Humans connect language and vision to perceive the world. How to build a similar connection for computers? One possible way is via visual concepts, which are text terms that relate to visually discriminative entities. We propose an…

Computer Vision and Pattern Recognition · Computer Science 2015-09-25 Chen Sun , Chuang Gan , Ram Nevatia

Toward a Visual Concept Vocabulary for GAN Latent Space

A large body of recent work has identified transformations in the latent spaces of generative adversarial networks (GANs) that consistently and interpretably transform generated images. But existing techniques for identifying these…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Sarah Schwettmann , Evan Hernandez , David Bau , Samuel Klein , Jacob Andreas , Antonio Torralba

Learning Functional Distributional Semantics with Visual Data

Functional Distributional Semantics is a recently proposed framework for learning distributional semantics that provides linguistic interpretability. It models the meaning of a word as a binary classifier rather than a numerical vector. In…

Computation and Language · Computer Science 2022-04-25 Yinhong Liu , Guy Emerson

The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision

We propose the Neuro-Symbolic Concept Learner (NS-CL), a model that learns visual concepts, words, and semantic parsing of sentences without explicit supervision on any of them; instead, our model learns by simply looking at images and…

Computer Vision and Pattern Recognition · Computer Science 2019-04-30 Jiayuan Mao , Chuang Gan , Pushmeet Kohli , Joshua B. Tenenbaum , Jiajun Wu

Context Matters: Learning Global Semantics via Object-Centric Representation

Recent advances in language modeling have witnessed the rise of highly desirable emergent capabilities, such as reasoning and in-context learning. However, vision models have yet to exhibit comparable progress in these areas. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-10 Jike Zhong , Yuxiang Lai , Xiaofeng Yang , Konstantinos Psounis

Deriving Visual Semantics from Spatial Context: An Adaptation of LSA and Word2Vec to generate Object and Scene Embeddings from Images

Embeddings are an important tool for the representation of word meaning. Their effectiveness rests on the distributional hypothesis: words that occur in the same context carry similar semantic information. Here, we adapt this approach to…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Matthias S. Treder , Juan Mayor-Torres , Christoph Teufel

Visual Conceptual Blending with Large-scale Language and Vision Models

We ask the question: to what extent can recent large-scale language and image generation models blend visual concepts? Given an arbitrary object, we identify a relevant object and generate a single-sentence description of the blend of the…

Computation and Language · Computer Science 2021-06-29 Songwei Ge , Devi Parikh