English
Related papers

Related papers: Structured Generative Models for Scene Understandi…

200 papers

Generative models have demonstrated remarkable abilities in generating high-fidelity visual content. In this work, we explore how generative models can further be used not only to synthesize visual content but also to understand the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Yanbo Wang , Justin Dauwels , Yilun Du

Scene graph is a structured representation of a scene that can clearly express the objects, attributes, and relationships between objects in the scene. As computer vision technology continues to develop, people are no longer satisfied with…

Computer Vision and Pattern Recognition · Computer Science 2022-01-10 Xiaojun Chang , Pengzhen Ren , Pengfei Xu , Zhihui Li , Xiaojiang Chen , Alex Hauptmann

A scene graph is a semantic representation that expresses the objects, attributes, and relationships between objects in a scene. Scene graphs play an important role in many cross modality tasks, as they are able to capture the interactions…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Xuming Hu , Zhijiang Guo , Yu Fu , Lijie Wen , Philip S. Yu

Understanding a visual scene incorporates objects, relationships, and context. Traditional methods working on an image mostly focus on object detection and fail to capture the relationship between the objects. Relationships can give rich…

Computer Vision and Pattern Recognition · Computer Science 2019-12-03 Himangi Mittal , Ajith Abraham , Anuja Arora

A comprehensive semantic understanding of a scene is important for many applications - but in what space should diverse semantic information (e.g., objects, scene categories, material types, texture, etc.) be grounded and what should be its…

Computer Vision and Pattern Recognition · Computer Science 2019-10-08 Iro Armeni , Zhi-Yang He , JunYoung Gwak , Amir R. Zamir , Martin Fischer , Jitendra Malik , Silvio Savarese

Recent approaches on visual scene understanding attempt to build a scene graph -- a computational representation of objects and their pairwise relationships. Such rich semantic representation is very appealing, yet difficult to obtain from…

Computer Vision and Pattern Recognition · Computer Science 2018-11-08 Paul Gay , Stuart James , Alessio Del Bue

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Michael Niemeyer , Andreas Geiger

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Scene graph generation (SGG) endeavors to predict visual relationships between pairs of objects within an image. Prevailing SGG methods traditionally assume a one-off learning process for SGG. This conventional paradigm may necessitate…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Tao He , Tongtong Wu , Dongyang Zhang , Guiduo Duan , Ke Qin , Yuan-Fang Li

We present a deep generative scene modeling technique for indoor environments. Our goal is to train a generative model using a feed-forward neural network that maps a prior distribution (e.g., a normal distribution) to the distribution of…

Computer Vision and Pattern Recognition · Computer Science 2018-08-08 Zaiwei Zhang , Zhenpei Yang , Chongyang Ma , Linjie Luo , Alexander Huth , Etienne Vouga , Qixing Huang

We propose an efficient and interpretable scene graph generator. We consider three types of features: visual, spatial and semantic, and we use a late fusion strategy such that each feature's contribution can be explicitly investigated. We…

Computer Vision and Pattern Recognition · Computer Science 2018-11-26 Ji Zhang , Kevin Shih , Andrew Tao , Bryan Catanzaro , Ahmed Elgammal

Despite the great success object detection and segmentation models have achieved in recognizing individual objects in images, performance on cognitive tasks such as image caption, semantic image retrieval, and visual QA is far from…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Weilin Cong , William Wang , Wang-Chien Lee

Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Guangming Zhu , Liang Zhang , Youliang Jiang , Yixuan Dang , Haoran Hou , Peiyi Shen , Mingtao Feng , Xia Zhao , Qiguang Miao , Syed Afaq Ali Shah , Mohammed Bennamoun

A proper scene representation is central to the pursuit of spatial intelligence where agents can robustly reconstruct and efficiently understand 3D scenes. A scene representation is either metric, such as landmark maps in 3D reconstruction,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Juexiao Zhang , Gao Zhu , Sihang Li , Xinhao Liu , Haorui Song , Xinran Tang , Chen Feng

As the intermediate-level representations bridging the two levels, structured representations of visual scenes, such as visual relationships between pairwise objects, have been shown to not only benefit compositional models in learning to…

Computer Vision and Pattern Recognition · Computer Science 2022-07-12 Meng-Jiun Chiou

Manipulating images of complex scenes to reconstruct, insert and/or remove specific object instances is a challenging task. Complex scenes contain multiple semantics and objects, which are frequently cluttered or ambiguous, thus hampering…

Computer Vision and Pattern Recognition · Computer Science 2020-10-20 Pierfrancesco Ardino , Yahui Liu , Elisa Ricci , Bruno Lepri , Marco De Nadai

In the field of action recognition, video clips are always treated as ordered frames for subsequent processing. To achieve spatio-temporal perception, existing approaches propose to embed adjacent temporal interaction in the convolutional…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Rongchang Li , Xiao-Jun Wu , Tianyang Xu

Machine understanding of complex images is a key goal of artificial intelligence. One challenge underlying this task is that visual scenes contain multiple inter-related objects, and that global context plays an important role in…

Machine Learning · Statistics 2018-11-05 Roei Herzig , Moshiko Raboh , Gal Chechik , Jonathan Berant , Amir Globerson
‹ Prev 1 2 3 10 Next ›