English
Related papers

Related papers: Learning Visual Commonsense for Robust Scene Graph…

200 papers

Visual Commonsense Reasoning, which is regarded as one challenging task to pursue advanced visual scene comprehension, has been used to diagnose the reasoning ability of AI systems. However, reliable reasoning requires a good grasp of the…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Fan Yuan , Xiaoyuan Fang , Rong Quan , Jing Li , Wei Bi , Xiaogang Xu , Piji Li

This work introduces an enhanced approach to generating scene graphs by incorporating both a relationship hierarchy and commonsense knowledge. Specifically, we begin by proposing a hierarchical relation head that exploits an informative…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Bowen Jiang , Zhijun Zhuang , Shreyas S. Shivakumar , Camillo J. Taylor

Scene graph generation has received growing attention with the advancements in image understanding tasks such as object detection, attributes and relationship prediction,~\etc. However, existing datasets are biased in terms of object and…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Jiuxiang Gu , Handong Zhao , Zhe Lin , Sheng Li , Jianfei Cai , Mingyang Ling

Text-based games are becoming commonly used in reinforcement learning as real-world simulation environments. They are usually imperfect information games, and their interactions are only in the textual modality. To challenge these games, it…

Computer Vision and Pattern Recognition · Computer Science 2022-10-26 Tsunehiko Tanaka , Daiki Kimura , Michiaki Tatsubori

Scene graphs are powerful representations that parse images into their abstract semantic elements, i.e., objects and their interactions, which facilitates visual comprehension and explainable reasoning. On the other hand, commonsense…

Computer Vision and Pattern Recognition · Computer Science 2020-07-21 Alireza Zareian , Svebor Karaman , Shih-Fu Chang

This work establishes the concept of commonsense scene composition, with a focus on extending Belief Scene Graphs by estimating the spatial distribution of unseen objects. Specifically, the commonsense scene composition capability refers to…

Controllable scene synthesis aims to create interactive environments for various industrial use cases. Scene graphs provide a highly suitable interface to facilitate these applications by abstracting the scene context in a compact manner.…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Guangyao Zhai , Evin Pınar Örnek , Shun-Cheng Wu , Yan Di , Federico Tombari , Nassir Navab , Benjamin Busam

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

Humans use natural language to compose common concepts from their environment into plausible, day-to-day scene descriptions. However, such generative commonsense reasoning (GCSR) skills are lacking in state-of-the-art text generation…

Computation and Language · Computer Science 2022-03-09 PeiFeng Wang , Jonathan Zamora , Junfeng Liu , Filip Ilievski , Muhao Chen , Xiang Ren

Answering complex questions about images is an ambitious goal for machine intelligence, which requires a joint understanding of images, text, and commonsense knowledge, as well as a strong reasoning ability. Recently, multimodal…

Computer Vision and Pattern Recognition · Computer Science 2021-12-17 Zhecan Wang , Haoxuan You , Liunian Harold Li , Alireza Zareian , Suji Park , Yiqing Liang , Kai-Wei Chang , Shih-Fu Chang

As a structured representation of the image content, the visual scene graph (visual relationship) acts as a bridge between computer vision and natural language processing. Existing models on the scene graph generation task notoriously…

Computer Vision and Pattern Recognition · Computer Science 2022-03-01 Yuyu Guo , Jingkuan Song , Lianli Gao , Heng Tao Shen

Generating realistic images from scene graphs asks neural networks to be able to reason about object relationships and compositionality. As a relatively new task, how to properly ensure the generated images comply with scene graphs or how…

Computer Vision and Pattern Recognition · Computer Science 2019-01-17 Subarna Tripathi , Anahita Bhiwandiwalla , Alexei Bastidas , Hanlin Tang

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

A major challenge in scene graph classification is that the appearance of objects and relations can be significantly different from one image to another. Previous works have addressed this by relational reasoning over all objects in an…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Sahand Sharifzadeh , Sina Moayed Baharlou , Volker Tresp

We propose an efficient and interpretable scene graph generator. We consider three types of features: visual, spatial and semantic, and we use a late fusion strategy such that each feature's contribution can be explicitly investigated. We…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Ji Zhang , Kevin Shih , Andrew Tao , Bryan Catanzaro , Ahmed Elgammal

3D scene graph prediction is a task that aims to concurrently predict object classes and their relationships within a 3D environment. As these environments are primarily designed by and for humans, incorporating commonsense knowledge…

Robotics · Computer Science 2023-08-15 Yiding Qiu , Henrik I. Christensen

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

Research in scene graph generation has quickly gained traction in the past few years because of its potential to help in downstream tasks like visual question answering, image captioning, etc. Many interesting approaches have been proposed…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Sandeep Inuganti , Vineeth N Balasubramanian

Scene graph generation (SGG) endeavors to predict visual relationships between pairs of objects within an image. Prevailing SGG methods traditionally assume a one-off learning process for SGG. This conventional paradigm may necessitate…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Tao He , Tongtong Wu , Dongyang Zhang , Guiduo Duan , Ke Qin , Yuan-Fang Li
‹ Prev 1 2 3 10 Next ›