Related papers: Agent-Centric Relation Graph for Object Visual Nav…

Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation

Given an object of interest, visual navigation aims to reach the object's location based on a sequence of partial observations. To this end, an agent needs to 1) learn a piece of certain knowledge about the relations of object categories in…

Computer Vision and Pattern Recognition · Computer Science 2023-12-07 Xiaobo Hu , Youfang Lin , HeHe Fan , Shuo Wang , Zhihao Wu , Kai Lv

Learning Object Relation Graph and Tentative Policy for Visual Navigation

Target-driven visual navigation aims at navigating an agent towards a given target based on the observation of the agent. In this task, it is critical to learn informative visual representation and robust navigation policy. Aiming to…

Computer Vision and Pattern Recognition · Computer Science 2020-07-23 Heming Du , Xin Yu , Liang Zheng

Aligning Knowledge Graph with Visual Perception for Object-goal Navigation

Object-goal navigation is a challenging task that requires guiding an agent to specific objects based on first-person visual observations. The ability of agent to comprehend its surroundings plays a crucial role in achieving successful…

Computer Vision and Pattern Recognition · Computer Science 2024-04-29 Nuo Xu , Wen Wang , Rong Yang , Mengjie Qin , Zheyuan Lin , Wei Song , Chunlong Zhang , Jason Gu , Chao Li

Spatial Relation Graph and Graph Convolutional Network for Object Goal Navigation

This paper describes a framework for the object-goal navigation task, which requires a robot to find and move to the closest instance of a target object class from a random starting position. The framework uses a history of robot…

Robotics · Computer Science 2022-08-30 D. A. Sasi Kiran , Kritika Anand , Chaitanya Kharyal , Gulshan Kumar , Nandiraju Gireesh , Snehasis Banerjee , Ruddra dev Roychoudhury , Mohan Sridharan , Brojeshwar Bhowmick , Madhava Krishna

Language and Visual Entity Relationship Graph for Agent Navigation

Vision-and-Language Navigation (VLN) requires an agent to navigate in a real-world environment following natural language instructions. From both the textual and visual perspectives, we find that the relationships among the scene, its…

Computer Vision and Pattern Recognition · Computer Science 2020-12-29 Yicong Hong , Cristian Rodriguez-Opazo , Yuankai Qi , Qi Wu , Stephen Gould

Hierarchical Object-to-Zone Graph for Object Navigation

The goal of object navigation is to reach the expected objects according to visual information in the unseen environments. Previous works usually implement deep models to train an agent to predict actions in real-time. However, in the…

Computer Vision and Pattern Recognition · Computer Science 2021-09-10 Sixian Zhang , Xinhang Song , Yubing Bai , Weijie Li , Yakui Chu , Shuqiang Jiang

Zero-Shot Object Goal Visual Navigation With Class-Independent Relationship Network

This paper investigates the zero-shot object goal visual navigation problem. In the object goal visual navigation task, the agent needs to locate navigation targets from its egocentric visual input. "Zero-shot" means that the target the…

Computer Vision and Pattern Recognition · Computer Science 2024-03-15 Xinting Li , Shiguang Zhang , Yue LU , Kerry Dang , Lingyan Ran

DRG: Dual Relation Graph for Human-Object Interaction Detection

We tackle the challenging problem of human-object interaction (HOI) detection. Existing methods either recognize the interaction of each human-object pair in isolation or perform joint inference based on complex appearance-based features.…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Chen Gao , Jiarui Xu , Yuliang Zou , Jia-Bin Huang

AVR: Attention based Salient Visual Relationship Detection

Visual relationship detection aims to locate objects in images and recognize the relationships between objects. Traditional methods treat all observed relationships in an image equally, which causes a relatively poor performance in the…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Jianming Lv , Qinzhe Xiao , Jiajie Zhong

Graph based Environment Representation for Vision-and-Language Navigation in Continuous Environments

Vision-and-Language Navigation in Continuous Environments (VLN-CE) is a navigation task that requires an agent to follow a language instruction in a realistic environment. The understanding of environments is a crucial part of the VLN-CE…

Computer Vision and Pattern Recognition · Computer Science 2023-01-12 Ting Wang , Zongkai Wu , Feiyu Yao , Donglin Wang

Relationship Oriented Affordance Learning through Manipulation Graph Construction

In this paper, we propose Manipulation Relationship Graph (MRG), a novel affordance representation which captures the underlying manipulation relationships of an arbitrary scene. To construct such a graph from raw visual observations, a…

Robotics · Computer Science 2021-11-02 Chao Tang , Jingwen Yu , Weinan Chen , Hong Zhang

VTNet: Visual Transformer Network for Object Goal Navigation

Object goal navigation aims to steer an agent towards a target object based on observations of the agent. It is of pivotal importance to design effective visual representations of the observed scene in determining navigation actions. In…

Computer Vision and Pattern Recognition · Computer Science 2021-05-21 Heming Du , Xin Yu , Liang Zheng

Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph

We present a novel two-layer hierarchical reinforcement learning approach equipped with a Goals Relational Graph (GRG) for tackling the partially observable goal-driven task, such as goal-driven visual navigation. Our GRG captures the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Xin Ye , Yezhou Yang

RSG-Net: Towards Rich Sematic Relationship Prediction for Intelligent Vehicle in Complex Environments

Behavioral and semantic relationships play a vital role on intelligent self-driving vehicles and ADAS systems. Different from other research focused on trajectory, position, and bounding boxes, relationship data provides a human…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Yafu Tian , Alexander Carballo , Ruifeng Li , Kazuya Takeda

Exploiting Temporal Relations on Radar Perception for Autonomous Driving

We consider the object recognition problem in autonomous driving using automotive radar sensors. Comparing to Lidar sensors, radar is cost-effective and robust in all-weather conditions for perception in autonomous driving. However, radar…

Computer Vision and Pattern Recognition · Computer Science 2022-04-05 Peizhao Li , Pu Wang , Karl Berntorp , Hongfu Liu

Navigating to Objects in Unseen Environments by Distance Prediction

Object Goal Navigation (ObjectNav) task is to navigate an agent to an object category in unseen environments without a pre-built map. In this paper, we solve this task by predicting the distance to the target using semantically-related…

Robotics · Computer Science 2022-07-14 Minzhao Zhu , Binglei Zhao , Tao Kong

Heterogeneous Trajectory Forecasting via Risk and Scene Graph Learning

Heterogeneous trajectory forecasting is critical for intelligent transportation systems, but it is challenging because of the difficulty of modeling the complex interaction relations among the heterogeneous road agents as well as their…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Jianwu Fang , Chen Zhu , Pu Zhang , Hongkai Yu , Jianru Xue

Scenes and Surroundings: Scene Graph Generation using Relation Transformer

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

Task-Driven Graph Attention for Hierarchical Relational Object Navigation

Embodied AI agents in large scenes often need to navigate to find objects. In this work, we study a naturally emerging variant of the object navigation task, hierarchical relational object navigation (HRON), where the goal is to find…

Artificial Intelligence · Computer Science 2023-06-27 Michael Lingelbach , Chengshu Li , Minjune Hwang , Andrey Kurenkov , Alan Lou , Roberto Martín-Martín , Ruohan Zhang , Li Fei-Fei , Jiajun Wu

Learning Actor Relation Graphs for Group Activity Recognition

Modeling relation between actors is important for recognizing group activity in a multi-person scene. This paper aims at learning discriminative relation between actors efficiently using deep models. To this end, we propose to build a…

Computer Vision and Pattern Recognition · Computer Science 2019-04-24 Jianchao Wu , Limin Wang , Li Wang , Jie Guo , Gangshan Wu