Related papers: Encoding Spatial Relations from Natural Language

From Spatial Relations to Spatial Configurations

Spatial Reasoning from language is essential for natural language understanding. Supporting it requires a representation scheme that can capture spatial phenomena encountered in language as well as in images and videos. Existing spatial…

Computation and Language · Computer Science 2020-07-21 Soham Dan , Parisa Kordjamshidi , Julia Bonn , Archna Bhatia , Jon Cai , Martha Palmer , Dan Roth

Understanding Spatial Relations through Multiple Modalities

Recognizing spatial relations and reasoning about them is essential in multiple applications including navigation, direction giving and human-computer interaction in general. Spatial relations between objects can either be explicit --…

Computation and Language · Computer Science 2020-07-21 Soham Dan , Hangfeng He , Dan Roth

Linearity of Relation Decoding in Transformer Language Models

Much of the knowledge encoded in transformer language models (LMs) may be expressed in terms of relations: relations between words and their synonyms, entities and their attributes, etc. We show that, for a subset of relations, this…

Computation and Language · Computer Science 2024-02-19 Evan Hernandez , Arnab Sen Sharma , Tal Haklay , Kevin Meng , Martin Wattenberg , Jacob Andreas , Yonatan Belinkov , David Bau

Invariant Features in Language Models: Geometric Characterization and Model Attribution

Language models exhibit strong robustness to paraphrasing, suggesting that semantic information may be encoded through stable internal representations, yet the structure and origin of such invariance remain unclear. We propose a local…

Machine Learning · Computer Science 2026-05-08 Agnibh Dasgupta , Abdullah Tanvir , Xin Zhong

Learning Word Embeddings from Intrinsic and Extrinsic Views

While word embeddings are currently predominant for natural language processing, most of existing models learn them solely from their contexts. However, these context-based word embeddings are limited since not all words' meaning can be…

Computation and Language · Computer Science 2016-08-23 Jifan Chen , Kan Chen , Xipeng Qiu , Qi Zhang , Xuanjing Huang , Zheng Zhang

Natural Language and Spatial Rules

We develop a system that formally represents spatial semantics concepts within natural language descriptions of spatial arrangements. The system builds on a model of spatial semantics representation according to which words in a sentence…

Computation and Language · Computer Science 2021-11-30 Alexandros Haridis , Stella Rossikopoulou Pappa

The Acquisition of Semantic Relationships between words

The study of semantic relationships has revealed a close connection between these relationships and the morphological characteristics of a language. Morphology, as a subfield of linguistics, investigates the internal structure and formation…

Computation and Language · Computer Science 2023-07-14 Mohamed Naamane

Visual Reasoning with Natural Language

Natural language provides a widely accessible and expressive interface for robotic agents. To understand language in complex environments, agents must reason about the full range of language inputs and their correspondence to the world.…

Computation and Language · Computer Science 2017-10-03 Stephanie Zhou , Alane Suhr , Yoav Artzi

Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning

In natural language processing, most models try to learn semantic representations merely from texts. The learned representations encode the distributional semantics but fail to connect to any knowledge about the physical world. In contrast,…

Computation and Language · Computer Science 2021-11-16 Yizhen Zhang , Minkyu Choi , Kuan Han , Zhongming Liu

Representations in vision and language converge in a shared, multidimensional space of perceived similarities

Humans can effortlessly describe what they see, yet establishing a shared representational format between vision and language remains a significant challenge. Emerging evidence suggests that human brain representations in both vision and…

Neurons and Cognition · Quantitative Biology 2025-07-30 Katerina Marie Simkova , Adrien Doerig , Clayton Hickey , Ian Charest

Do Trajectories Encode Verb Meaning?

Distributional models learn representations of words from text, but are criticized for their lack of grounding, or the linking of text to the non-linguistic world. Grounded language models have had success in learning to connect concrete…

Computation and Language · Computer Science 2022-06-27 Dylan Ebert , Chen Sun , Ellie Pavlick

Incorporating Visual Semantics into Sentence Representations within a Grounded Space

Language grounding is an active field aiming at enriching textual representations with visual information. Generally, textual and visual elements are embedded in the same representation space, which implicitly assumes a one-to-one…

Computation and Language · Computer Science 2020-02-10 Patrick Bordes , Eloi Zablocki , Laure Soulier , Benjamin Piwowarski , Patrick Gallinari

Structured Relational Representations

Invariant representations are core to representation learning, yet a central challenge remains: uncovering invariants that are stable and transferable without suppressing task-relevant signals. This raises fundamental questions, requiring…

Machine Learning · Computer Science 2025-09-29 Arun Kumar , Paul Schrater

Universal Multimodal Representation for Language Understanding

Representation learning is the foundation of natural language processing (NLP). This work presents new methods to employ visual information as assistant signals to general NLP tasks. For each sentence, we first retrieve a flexible number of…

Computation and Language · Computer Science 2023-01-10 Zhuosheng Zhang , Kehai Chen , Rui Wang , Masao Utiyama , Eiichiro Sumita , Zuchao Li , Hai Zhao

Deep Visual-Semantic Alignments for Generating Image Descriptions

We present a model that generates natural language descriptions of images and their regions. Our approach leverages datasets of images and their sentence descriptions to learn about the inter-modal correspondences between language and…

Computer Vision and Pattern Recognition · Computer Science 2015-04-15 Andrej Karpathy , Li Fei-Fei

Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions

Natural language definitions possess a recursive, self-explanatory semantic structure that can support representation learning methods able to preserve explicit conceptual relations and constraints in the latent space. This paper presents a…

Computation and Language · Computer Science 2024-02-19 Marco Valentino , Danilo S. Carvalho , André Freitas

Temporal Grounding Graphs for Language Understanding with Accrued Visual-Linguistic Context

A robot's ability to understand or ground natural language instructions is fundamentally tied to its knowledge about the surrounding world. We present an approach to grounding natural language utterances in the context of factual…

Robotics · Computer Science 2018-11-19 Rohan Paul , Andrei Barbu , Sue Felshin , Boris Katz , Nicholas Roy

Explicitly Representing Syntax Improves Sentence-to-layout Prediction of Unexpected Situations

Recognizing visual entities in a natural language sentence and arranging them in a 2D spatial layout require a compositional understanding of language and space. This task of layout prediction is valuable in text-to-image synthesis as it…

Computation and Language · Computer Science 2024-12-03 Wolf Nuyts , Ruben Cartuyvels , Marie-Francine Moens

Visual Spatial Reasoning

Spatial relations are a basic part of human cognition. However, they are expressed in natural language in a variety of ways, and previous work has suggested that current vision-and-language models (VLMs) struggle to capture relational…

Computation and Language · Computer Science 2023-03-23 Fangyu Liu , Guy Emerson , Nigel Collier

Semantic Representations of Word Senses and Concepts

Representing the semantics of linguistic items in a machine-interpretable form has been a major goal of Natural Language Processing since its earliest days. Among the range of different linguistic items, words have attracted the most…

Computation and Language · Computer Science 2016-08-04 José Camacho-Collados , Ignacio Iacobacci , Roberto Navigli , Mohammad Taher Pilehvar