Related papers: Improving Visual Relation Detection using Depth Ma…

Visual Relationship Detection with Relative Location Mining

Visual relationship detection, as a challenging task used to find and distinguish the interactions between object pairs in one image, has received much attention recently. In this work, we propose a novel visual relationship detection…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Hao Zhou , Chongyang Zhang , Chuanping Hu

Visual Relationship Detection Based on Guided Proposals and Semantic Knowledge Distillation

A thorough comprehension of image content demands a complex grasp of the interactions that may occur in the natural world. One of the key issues is to describe the visual relationships between objects. When dealing with real world data,…

Computer Vision and Pattern Recognition · Computer Science 2018-05-29 François Plesse , Alexandru Ginsca , Bertrand Delezoide , Françoise Prêteux

VrR-VG: Refocusing Visually-Relevant Relationships

Relationships encode the interactions among individual instances, and play a critical role in deep visual scene understanding. Suffering from the high predictability with non-visual information, existing methods tend to fit the statistical…

Computer Vision and Pattern Recognition · Computer Science 2019-08-27 Yuanzhi Liang , Yalong Bai , Wei Zhang , Xueming Qian , Li Zhu , Tao Mei

Knowledge-augmented Few-shot Visual Relation Detection

Visual Relation Detection (VRD) aims to detect relationships between objects for image understanding. Most existing VRD methods rely on thousands of training samples of each relationship to achieve satisfactory performance. Some recent…

Computer Vision and Pattern Recognition · Computer Science 2023-03-10 Tianyu Yu , Yangning Li , Jiaoyan Chen , Yinghui Li , Hai-Tao Zheng , Xi Chen , Qingbin Liu , Wenqiang Liu , Dongxiao Huang , Bei Wu , Yexin Wang

Predicting Relative Depth between Objects from Semantic Features

Vision and language tasks such as Visual Relation Detection and Visual Question Answering benefit from semantic features that afford proper grounding of language. The 3D depth of objects depicted in 2D images is one such feature. However it…

Computer Vision and Pattern Recognition · Computer Science 2021-01-13 Stefan Cassar , Adrian Muscat , Dylan Seychell

Visual Relationship Detection using Scene Graphs: A Survey

Understanding a scene by decoding the visual relationships depicted in an image has been a long studied problem. While the recent advances in deep learning and the usage of deep neural networks have achieved near human accuracy on many…

Computer Vision and Pattern Recognition · Computer Science 2020-05-19 Aniket Agarwal , Ayush Mangal , Vipul

Natural Language Guided Visual Relationship Detection

Reasoning about the relationships between object pairs in images is a crucial task for holistic scene understanding. Most of the existing works treat this task as a pure visual classification task: each type of relationship or phrase is…

Computer Vision and Pattern Recognition · Computer Science 2017-11-22 Wentong Liao , Lin Shuai , Bodo Rosenhahn , Michael Ying Yang

Objects Matter: Learning Object Relation Graph for Robust Camera Relocalization

Visual relocalization aims to estimate the pose of a camera from one or more images. In recent years deep learning based pose regression methods have attracted many attentions. They feature predicting the absolute poses without relying on…

Computer Vision and Pattern Recognition · Computer Science 2022-05-27 Chengyu Qiao , Zhiyu Xiang , Xinglu Wang

Detecting Visual Relationships with Deep Relational Networks

Relationships among objects play a crucial role in image understanding. Despite the great success of deep learning techniques in recognizing individual objects, reasoning about the relationships among objects remains a challenging task.…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Bo Dai , Yuqi Zhang , Dahua Lin

Leveraging Auxiliary Text for Deep Recognition of Unseen Visual Relationships

One of the most difficult tasks in scene understanding is recognizing interactions between objects in an image. This task is often called visual relationship detection (VRD). We consider the question of whether, given auxiliary textual data…

Computer Vision and Pattern Recognition · Computer Science 2019-10-29 Gal Sadeh Kenigsfield , Ran El-Yaniv

Using Depth for Improving Referring Expression Comprehension in Real-World Environments

In a human-robot collaborative task where a robot helps its partner by finding described objects, the depth dimension plays a critical role in successful task completion. Existing studies have mostly focused on comprehending the object…

Robotics · Computer Science 2021-07-13 Fethiye Irmak Dogan , Iolanda Leite

Perceptual deep depth super-resolution

RGBD images, combining high-resolution color and lower-resolution depth from various types of depth sensors, are increasingly common. One can significantly improve the resolution of depth maps by taking advantage of color information; deep…

Computer Vision and Pattern Recognition · Computer Science 2019-09-10 Oleg Voynov , Alexey Artemov , Vage Egiazarian , Alexander Notchenko , Gleb Bobrovskikh , Denis Zorin , Evgeny Burnaev

An Empirical Analysis of Visual Features for Multiple Object Tracking in Urban Scenes

This paper addresses the problem of selecting appearance features for multiple object tracking (MOT) in urban scenes. Over the years, a large number of features has been used for MOT. However, it is not clear whether some of them are better…

Computer Vision and Pattern Recognition · Computer Science 2020-10-16 Mehdi Miah , Justine Pepin , Nicolas Saunier , Guillaume-Alexandre Bilodeau

DisPlacing Objects: Improving Dynamic Vehicle Detection via Visual Place Recognition under Adverse Conditions

Can knowing where you are assist in perceiving objects in your surroundings, especially under adverse weather and lighting conditions? In this work we investigate whether a prior map can be leveraged to aid in the detection of dynamic…

Computer Vision and Pattern Recognition · Computer Science 2023-07-03 Stephen Hausler , Sourav Garg , Punarjay Chakravarty , Shubham Shrivastava , Ankit Vora , Michael Milford

Visual Relationship Prediction via Label Clustering and Incorporation of Depth Information

In this paper, we investigate the use of an unsupervised label clustering technique and demonstrate that it enables substantial improvements in visual relationship prediction accuracy on the Person in Context (PIC) dataset. We propose to…

Computer Vision and Pattern Recognition · Computer Science 2018-09-11 Hsuan-Kung Yang , An-Chieh Cheng , Kuan-Wei Ho , Tsu-Jui Fu , Chun-Yi Lee

Generalized Visual Relation Detection with Diffusion Models

Visual relation detection (VRD) aims to identify relationships (or interactions) between object pairs in an image. Although recent VRD models have achieved impressive performance, they are all restricted to pre-defined relation categories,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Kaifeng Gao , Siqi Chen , Hanwang Zhang , Jun Xiao , Yueting Zhuang , Qianru Sun

Large-Scale Visual Relationship Understanding

Large scale visual understanding is challenging, as it requires a model to handle the widely-spread and imbalanced distribution of <subject, relation, object> triples. In real-world scenarios with large numbers of objects and relations,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-20 Ji Zhang , Yannis Kalantidis , Marcus Rohrbach , Manohar Paluri , Ahmed Elgammal , Mohamed Elhoseiny

Support Relation Analysis for Objects in Multiple View RGB-D Images

Understanding physical relations between objects, especially their support relations, is crucial for robotic manipulation. There has been work on reasoning about support relations and structural stability of simple configurations in RGB-D…

Computer Vision and Pattern Recognition · Computer Science 2019-05-13 Peng Zhang , Xiaoyu Ge , Jochen Renz

A Problem Reduction Approach for Visual Relationships Detection

Identifying different objects (man and cup) is an important problem on its own, but identifying the relationship between them (holding) is critical for many real world use cases. This paper describes an approach to reduce a visual…

Computer Vision and Pattern Recognition · Computer Science 2018-09-27 Toshiyuki Fukuzawa

Window-Object Relationship Guided Representation Learning for Generic Object Detections

In existing works that learn representation for object detection, the relationship between a candidate window and the ground truth bounding box of an object is simplified by thresholding their overlap. This paper shows information loss in…

Computer Vision and Pattern Recognition · Computer Science 2015-12-10 Xingyu Zeng , Wanli Ouyang , Xiaogang Wang