Related papers: Visual Translation Embedding Network for Visual Re…

Contextual Translation Embedding for Visual Relationship Detection and Scene Graph Generation

Relations amongst entities play a central role in image understanding. Due to the complexity of modeling (subject, predicate, object) relation triplets, it is crucial to develop a method that can not only recognize seen relations, but also…

Computer Vision and Pattern Recognition · Computer Science 2020-11-19 Zih-Siou Hung , Arun Mallya , Svetlana Lazebnik

Deeply Supervised Multimodal Attentional Translation Embeddings for Visual Relationship Detection

Detecting visual relationships, i.e. <Subject, Predicate, Object> triplets, is a challenging Scene Understanding task approached in the past via linguistic priors or spatial information in a single feature branch. We introduce a new deeply…

Computer Vision and Pattern Recognition · Computer Science 2019-02-18 Nikolaos Gkanatsios , Vassilis Pitsikalis , Petros Koutras , Athanasia Zlatintsi , Petros Maragos

STransE: a novel embedding model of entities and relationships in knowledge bases

Knowledge bases of real-world facts about entities and their relationships are useful resources for a variety of natural language processing tasks. However, because knowledge bases are typically incomplete, it is useful to be able to…

Computation and Language · Computer Science 2017-03-09 Dat Quoc Nguyen , Kairit Sirts , Lizhen Qu , Mark Johnson

Visual Relationship Detection with Language Priors

Visual relationships capture a wide variety of interactions between pairs of objects in images (e.g. "man riding bicycle" and "man pushing bicycle"). Consequently, the set of possible relationships is extremely large and it is difficult to…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Cewu Lu , Ranjay Krishna , Michael Bernstein , Li Fei-Fei

Visual Relationship Detection with Visual-Linguistic Knowledge from Multimodal Representations

Visual relationship detection aims to reason over relationships among salient objects in images, which has drawn increasing attention over the past few years. Inspired by human reasoning mechanisms, it is believed that external visual…

Computer Vision and Pattern Recognition · Computer Science 2021-04-06 Meng-Jiun Chiou , Roger Zimmermann , Jiashi Feng

RatE: Relation-Adaptive Translating Embedding for Knowledge Graph Completion

Many graph embedding approaches have been proposed for knowledge graph completion via link prediction. Among those, translating embedding approaches enjoy the advantages of light-weight structure, high efficiency and great interpretability.…

Computation and Language · Computer Science 2020-10-13 Hao Huang , Guodong Long , Tao Shen , Jing Jiang , Chengqi Zhang

Detecting Visual Relationships with Deep Relational Networks

Relationships among objects play a crucial role in image understanding. Despite the great success of deep learning techniques in recognizing individual objects, reasoning about the relationships among objects remains a challenging task.…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Bo Dai , Yuqi Zhang , Dahua Lin

Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment

Many recent works have demonstrated the benefits of knowledge graph embeddings in completing monolingual knowledge graphs. Inasmuch as related knowledge bases are built in several different languages, achieving cross-lingual knowledge…

Artificial Intelligence · Computer Science 2017-05-19 Muhao Chen , Yingtao Tian , Mohan Yang , Carlo Zaniolo

VReBERT: A Simple and Flexible Transformer for Visual Relationship Detection

Visual Relationship Detection (VRD) impels a computer vision model to 'see' beyond an individual object instance and 'understand' how different objects in a scene are related. The traditional way of VRD is first to detect objects in an…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Yu Cui , Moshiur Farazi

Visual Relationship Detection with Relative Location Mining

Visual relationship detection, as a challenging task used to find and distinguish the interactions between object pairs in one image, has received much attention recently. In this work, we propose a novel visual relationship detection…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Hao Zhou , Chongyang Zhang , Chuanping Hu

CREPE: Learnable Prompting With CLIP Improves Visual Relationship Prediction

In this paper, we explore the potential of Vision-Language Models (VLMs), specifically CLIP, in predicting visual object relationships, which involves interpreting visual features from images into language-based relations. Current…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Rakshith Subramanyam , T. S. Jayram , Rushil Anirudh , Jayaraman J. Thiagarajan

Relation Networks for Object Detection

Although it is well believed for years that modeling relations between objects would help object recognition, there has not been evidence that the idea is working in the deep learning era. All state-of-the-art object detection systems still…

Computer Vision and Pattern Recognition · Computer Science 2018-06-15 Han Hu , Jiayuan Gu , Zheng Zhang , Jifeng Dai , Yichen Wei

Visual Relationship Detection with Language prior and Softmax

Visual relationship detection is an intermediate image understanding task that detects two objects and classifies a predicate that explains the relationship between two objects in an image. The three components are linguistically and…

Computer Vision and Pattern Recognition · Computer Science 2019-04-17 Jaewon Jung , Jongyoul Park

VICE: Variational Interpretable Concept Embeddings

A central goal in the cognitive sciences is the development of numerical models for mental representations of object concepts. This paper introduces Variational Interpretable Concept Embeddings (VICE), an approximate Bayesian method for…

Machine Learning · Computer Science 2022-10-07 Lukas Muttenthaler , Charles Y. Zheng , Patrick McClure , Robert A. Vandermeulen , Martin N. Hebart , Francisco Pereira

Visual Grounding with Transformers

In this paper, we propose a transformer based approach for visual grounding. Unlike previous proposal-and-rank frameworks that rely heavily on pretrained object detectors or proposal-free frameworks that upgrade an off-the-shelf one-stage…

Computer Vision and Pattern Recognition · Computer Science 2022-03-15 Ye Du , Zehua Fu , Qingjie Liu , Yunhong Wang

Large-Scale Visual Relationship Understanding

Large scale visual understanding is challenging, as it requires a model to handle the widely-spread and imbalanced distribution of <subject, relation, object> triples. In real-world scenarios with large numbers of objects and relations,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Ji Zhang , Yannis Kalantidis , Marcus Rohrbach , Manohar Paluri , Ahmed Elgammal , Mohamed Elhoseiny

Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks

Visual relationship detection is fundamental for holistic image understanding. However, the localization and classification of (subject, predicate, object) triplets remain challenging tasks, due to the combinatorial explosion of possible…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Federico Baldassarre , Kevin Smith , Josephine Sullivan , Hossein Azizpour

Natural Language Guided Visual Relationship Detection

Reasoning about the relationships between object pairs in images is a crucial task for holistic scene understanding. Most of the existing works treat this task as a pure visual classification task: each type of relationship or phrase is…

Computer Vision and Pattern Recognition · Computer Science 2017-11-22 Wentong Liao , Lin Shuai , Bodo Rosenhahn , Michael Ying Yang

Generalized Visual Relation Detection with Diffusion Models

Visual relation detection (VRD) aims to identify relationships (or interactions) between object pairs in an image. Although recent VRD models have achieved impressive performance, they are all restricted to pre-defined relation categories,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Kaifeng Gao , Siqi Chen , Hanwang Zhang , Jun Xiao , Yueting Zhuang , Qianru Sun

ViP-CNN: Visual Phrase Guided Convolutional Neural Network

As the intermediate level task connecting image captioning and object detection, visual relationship detection started to catch researchers' attention because of its descriptive power and clear structure. It detects the objects and captures…

Computer Vision and Pattern Recognition · Computer Science 2017-04-11 Yikang Li , Wanli Ouyang , Xiaogang Wang , Xiao'ou Tang