English
Related papers

Related papers: Table-Top Scene Analysis Using Knowledge-Supervise…

200 papers

Object detection, scene graph generation and region captioning, which are three scene understanding tasks at different semantic levels, are tied together: scene graphs are generated on top of objects detected in an image with their pairwise…

Computer Vision and Pattern Recognition · Computer Science 2017-09-18 Yikang Li , Wanli Ouyang , Bolei Zhou , Kun Wang , Xiaogang Wang

Scene understanding is a popular and challenging topic in both computer vision and photogrammetry. Scene graph provides rich information for such scene understanding. This paper presents a novel approach to infer such relations and then to…

Computer Vision and Pattern Recognition · Computer Science 2017-11-17 Michael Ying Yang , Wentong Liao , Hanno Ackermann , Bodo Rosenhahn

A proper scene representation is central to the pursuit of spatial intelligence where agents can robustly reconstruct and efficiently understand 3D scenes. A scene representation is either metric, such as landmark maps in 3D reconstruction,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Juexiao Zhang , Gao Zhu , Sihang Li , Xinhao Liu , Haorui Song , Xinran Tang , Chen Feng

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

This paper investigates a general framework to discover categories of unlabeled scene images according to their appearances (i.e., textures and structures). We jointly solve the two coupled tasks in an unsupervised manner: (i) classifying…

Computer Vision and Pattern Recognition · Computer Science 2015-02-03 Liang Lin , Ruimao Zhang , Xiaohua Duan

Scene understanding has been of high interest in computer vision. It encompasses not only identifying objects in a scene, but also their relationships within the given context. With this goal, a recent line of works tackles 3D semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Johanna Wald , Helisa Dhamo , Nassir Navab , Federico Tombari

Visual Commonsense Reasoning, which is regarded as one challenging task to pursue advanced visual scene comprehension, has been used to diagnose the reasoning ability of AI systems. However, reliable reasoning requires a good grasp of the…

Computer Vision and Pattern Recognition · Computer Science 2025-01-17 Fan Yuan , Xiaoyuan Fang , Rong Quan , Jing Li , Wei Bi , Xiaogang Xu , Piji Li

Learning from image-text data has demonstrated recent success for many recognition tasks, yet is currently limited to visual features or individual visual concepts such as objects. In this paper, we propose one of the first methods that…

Computer Vision and Pattern Recognition · Computer Science 2021-09-07 Yiwu Zhong , Jing Shi , Jianwei Yang , Chenliang Xu , Yin Li

We introduce the Contextual Graph Markov Model, an approach combining ideas from generative models and neural networks for the processing of graph data. It founds on a constructive methodology to build a deep architecture comprising layers…

Machine Learning · Computer Science 2019-11-26 Davide Bacciu , Federico Errica , Alessio Micheli

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

A major challenge in scene graph classification is that the appearance of objects and relations can be significantly different from one image to another. Previous works have addressed this by relational reasoning over all objects in an…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Sahand Sharifzadeh , Sina Moayed Baharlou , Volker Tresp

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Scene understanding is a fundamental capability needed in many domains, ranging from question-answering to robotics. Unlike recent end-to-end approaches that must explicitly learn varying compositions of the same scene, our method reasons…

Computer Vision and Pattern Recognition · Computer Science 2024-10-31 FNU Aryan , Simon Stepputtis , Sarthak Bhagat , Joseph Campbell , Kwonjoon Lee , Hossein Nourkhiz Mahjoub , Katia Sycara

This work establishes the concept of commonsense scene composition, with a focus on extending Belief Scene Graphs by estimating the spatial distribution of unseen objects. Specifically, the commonsense scene composition capability refers to…

Generating realistic images from scene graphs asks neural networks to be able to reason about object relationships and compositionality. As a relatively new task, how to properly ensure the generated images comply with scene graphs or how…

Computer Vision and Pattern Recognition · Computer Science 2019-01-17 Subarna Tripathi , Anahita Bhiwandiwalla , Alexei Bastidas , Hanlin Tang

Scene graph representations, which form a graph of visual object nodes together with their attributes and relations, have proved useful across a variety of vision and language applications. Recent work in the area has used Natural Language…

Computation and Language · Computer Science 2019-09-16 Martin Andrews , Yew Ken Chia , Sam Witteveen

This paper introduces key machine learning operations that allow the realization of robust, joint 6D pose estimation of multiple instances of objects either densely packed or in unstructured piles from RGB-D data. The first objective is to…

Robotics · Computer Science 2019-10-14 Chaitanya Mitash , Bowen Wen , Kostas Bekris , Abdeslam Boularias

In this paper, we propose a generalizable method that systematically combines data driven MCMC samplingand inference using rule-based context knowledge for data abstraction. In particular, we demonstrate the usefulness of our method in the…

Computer Vision and Pattern Recognition · Computer Science 2020-02-26 Ziyuan Liu , Georg von Wichert

Generating scene graph to describe all the relations inside an image gains increasing interests these years. However, most of the previous methods use complicated structures with slow inference speed or rely on the external data, which…

Computer Vision and Pattern Recognition · Computer Science 2018-08-28 Yikang Li , Wanli Ouyang , Bolei Zhou , Jianping Shi , Chao Zhang , Xiaogang Wang
‹ Prev 1 2 3 10 Next ›