English
Related papers

Related papers: Compact Scene Graphs for Layout Composition and Pa…

200 papers

Generating realistic images from scene graphs asks neural networks to be able to reason about object relationships and compositionality. As a relatively new task, how to properly ensure the generated images comply with scene graphs or how…

Computer Vision and Pattern Recognition · Computer Science 2019-01-17 Subarna Tripathi , Anahita Bhiwandiwalla , Alexei Bastidas , Hanlin Tang

Scene graphs are a powerful structured representation of the underlying content of images, and embeddings derived from them have been shown to be useful in multiple downstream tasks. In this work, we employ a graph convolutional network to…

Computer Vision and Pattern Recognition · Computer Science 2021-04-07 Paridhi Maheshwari , Ritwick Chaudhry , Vishwa Vinay

Scene graphs are a compact and explicit representation successfully used in a variety of 2D scene understanding tasks. This work proposes a method to incrementally build up semantic scene graphs from a 3D environment given a sequence of…

Computer Vision and Pattern Recognition · Computer Science 2021-04-01 Shun-Cheng Wu , Johanna Wald , Keisuke Tateno , Nassir Navab , Federico Tombari

Scene graph is a structured representation of a scene that can clearly express the objects, attributes, and relationships between objects in the scene. As computer vision technology continues to develop, people are no longer satisfied with…

Computer Vision and Pattern Recognition · Computer Science 2022-01-10 Xiaojun Chang , Pengzhen Ren , Pengfei Xu , Zhihui Li , Xiaojiang Chen , Alex Hauptmann

As a scene graph compactly summarizes the high-level content of an image in a structured and symbolic manner, the similarity between scene graphs of two images reflects the relevance of their contents. Based on this idea, we propose a novel…

Computer Vision and Pattern Recognition · Computer Science 2021-01-01 Sangwoong Yoon , Woo Young Kang , Sungwook Jeon , SeongEun Lee , Changjin Han , Jonghun Park , Eun-Sol Kim

Understanding a scene by decoding the visual relationships depicted in an image has been a long studied problem. While the recent advances in deep learning and the usage of deep neural networks have achieved near human accuracy on many…

Computer Vision and Pattern Recognition · Computer Science 2020-05-19 Aniket Agarwal , Ayush Mangal , Vipul

Recent advances in computer vision facilitate fully automatic extraction of object-centric relational representations from visual-inertial data. These state representations, dubbed 3D scene graphs, are a hierarchical decomposition of…

Robotics · Computer Science 2026-03-31 Christopher Agia

Training scene graph classification models requires a large amount of annotated image data. Meanwhile, scene graphs represent relational knowledge that can be modeled with symbolic data from texts or knowledge graphs. While image annotation…

Computer Vision and Pattern Recognition · Computer Science 2021-10-11 Sahand Sharifzadeh , Sina Moayed Baharlou , Martin Schmitt , Hinrich Schütze , Volker Tresp

Generating scene graph to describe all the relations inside an image gains increasing interests these years. However, most of the previous methods use complicated structures with slow inference speed or rely on the external data, which…

Computer Vision and Pattern Recognition · Computer Science 2018-08-28 Yikang Li , Wanli Ouyang , Bolei Zhou , Jianping Shi , Chao Zhang , Xiaogang Wang

In this paper, we study the problem of parsing structured knowledge graphs from textual descriptions. In particular, we consider the scene graph representation that considers objects together with their attributes and relations: this…

Computation and Language · Computer Science 2018-03-28 Yu-Siang Wang , Chenxi Liu , Xiaohui Zeng , Alan Yuille

Generating realistic images of complex visual scenes becomes challenging when one wishes to control the structure of the generated images. Previous approaches showed that scenes with few entities can be controlled using scene graphs, but…

Computer Vision and Pattern Recognition · Computer Science 2020-08-25 Roei Herzig , Amir Bar , Huijuan Xu , Gal Chechik , Trevor Darrell , Amir Globerson

Contrastively trained vision-language models have achieved remarkable progress in vision and language representation learning, leading to state-of-the-art models for various downstream multimodal tasks. However, recent research has…

Computation and Language · Computer Science 2023-10-26 Harman Singh , Pengchuan Zhang , Qifan Wang , Mengjiao Wang , Wenhan Xiong , Jingfei Du , Yu Chen

A structured query can capture the complexity of object interactions (e.g. 'woman rides motorcycle') unlike single objects (e.g. 'woman' or 'motorcycle'). Retrieval using structured queries therefore is much more useful than single object…

Computer Vision and Pattern Recognition · Computer Science 2020-05-15 Brigit Schroeder , Subarna Tripathi

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Inferring objects and their relationships from an image in the form of a scene graph is useful in many applications at the intersection of vision and language. We consider a challenging problem of compositional generalization that emerges…

Computer Vision and Pattern Recognition · Computer Science 2021-10-04 Boris Knyazev , Harm de Vries , Cătălina Cangea , Graham W. Taylor , Aaron Courville , Eugene Belilovsky

Scene understanding is a popular and challenging topic in both computer vision and photogrammetry. Scene graph provides rich information for such scene understanding. This paper presents a novel approach to infer such relations and then to…

Computer Vision and Pattern Recognition · Computer Science 2017-11-17 Michael Ying Yang , Wentong Liao , Hanno Ackermann , Bodo Rosenhahn

Visual scenes are composed of visual concepts and have the property of combinatorial explosion. An important reason for humans to efficiently learn from diverse visual scenes is the ability of compositional perception, and it is desirable…

Machine Learning · Computer Science 2023-06-16 Jinyang Yuan , Tonglin Chen , Bin Li , Xiangyang Xue

Many top-performing image captioning models rely solely on object features computed with an object detection model to generate image descriptions. However, recent studies propose to directly use scene graphs to introduce information about…

Computer Vision and Pattern Recognition · Computer Science 2020-10-28 Victor Milewski , Marie-Francine Moens , Iacer Calixto

Scene graphs provide valuable information to many downstream tasks. Many scene graph generation (SGG) models solely use the limited annotated relation triples for training, leading to their underperformance on low-shot (few and zero)…

Computer Vision and Pattern Recognition · Computer Science 2021-08-20 Tao He , Lianli Gao , Jingkuan Song , Jianfei Cai , Yuan-Fang Li

The ability for computational agents to reason about the high-level content of real world scene images is important for many applications. Existing attempts at addressing the problem of complex scene understanding lack representational…

Computer Vision and Pattern Recognition · Computer Science 2018-02-20 Zachary A. Daniels , Dimitris N. Metaxas
‹ Prev 1 2 3 10 Next ›