English
Related papers

Related papers: Segmentation-grounded Scene Graph Generation

200 papers

Object detection, scene graph generation and region captioning, which are three scene understanding tasks at different semantic levels, are tied together: scene graphs are generated on top of objects detected in an image with their pairwise…

Computer Vision and Pattern Recognition · Computer Science 2017-09-18 Yikang Li , Wanli Ouyang , Bolei Zhou , Kun Wang , Xiaogang Wang

We introduce a novel framework to build a model that can learn how to segment objects from a collection of images without any human annotation. Our method builds on the observation that the location of object segments can be perturbed…

Computer Vision and Pattern Recognition · Computer Science 2019-11-05 Adam Bielski , Paolo Favaro

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Scene Graph Generation has gained much attention in computer vision research with the growing demand in image understanding projects like visual question answering, image captioning, self-driving cars, crowd behavior analysis, activity…

Computer Vision and Pattern Recognition · Computer Science 2021-11-29 Vishal Kumar , Albert Mundu , Satish Kumar Singh

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

The visual world we sense, interpret and interact everyday is a complex composition of interleaved physical entities. Therefore, it is a very challenging task to generate vivid scenes of similar complexity using computers. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2019-02-05 Mehmet Ozgur Turkoglu , William Thong , Luuk Spreeuwers , Berkay Kicanaoglu

Understanding a scene by decoding the visual relationships depicted in an image has been a long studied problem. While the recent advances in deep learning and the usage of deep neural networks have achieved near human accuracy on many…

Computer Vision and Pattern Recognition · Computer Science 2020-05-19 Aniket Agarwal , Ayush Mangal , Vipul

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

Scene graph generation is a sophisticated task because there is no specific recognition pattern (e.g., "looking at" and "near" have no conspicuous difference concerning vision, whereas "near" could occur between entities with different…

Computer Vision and Pattern Recognition · Computer Science 2022-03-18 Xiaoguang Chang , Teng Wang , Changyin Sun , Wenzhe Cai

The task of scene graph generation entails identifying object entities and their corresponding interaction predicates in a given image (or video). Due to the combinatorially large solution space, existing approaches to scene graph…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Siddhesh Khandelwal , Leonid Sigal

Research in scene graph generation has quickly gained traction in the past few years because of its potential to help in downstream tasks like visual question answering, image captioning, etc. Many interesting approaches have been proposed…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Sandeep Inuganti , Vineeth N Balasubramanian

3D point cloud segmentation aims to assign semantic labels to individual points in a scene for fine-grained spatial understanding. Existing methods typically adopt data augmentation to alleviate the burden of large-scale annotation.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Hongbin Lin , Yifan Jiang , Juangui Xu , Jesse Jiaxi Xu , Yi Lu , Zhengyu Hu , Ying-Cong Chen , Hao Wang

We propose an end-to-end learning framework for segmenting generic objects in both images and videos. Given a novel image or video, our approach produces a pixel-level mask for all "object-like" regions---even for object categories never…

Computer Vision and Pattern Recognition · Computer Science 2018-12-19 Bo Xiong , Suyog Dutt Jain , Kristen Grauman

There is a surge of interest in image scene graph generation (object, attribute and relationship detection) due to the need of building fine-grained image understanding models that go beyond object detection. Due to the lack of a good…

Computer Vision and Pattern Recognition · Computer Science 2021-07-28 Xiaotian Han , Jianwei Yang , Houdong Hu , Lei Zhang , Jianfeng Gao , Pengchuan Zhang

Scene graph generation refers to the task of automatically mapping an image into a semantic structural graph, which requires correctly labeling each extracted object and their interaction relationships. Despite the recent success in object…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Mengshi Qi , Weijian Li , Zhengyuan Yang , Yunhong Wang , Jiebo Luo

Panoptic Scene Graph has recently been proposed for comprehensive scene understanding. However, previous works adopt a fully-supervised learning manner, requiring large amounts of pixel-wise densely-annotated data, which is always tedious…

Computer Vision and Pattern Recognition · Computer Science 2025-04-11 Chengyang Zhao , Yikang Shen , Zhenfang Chen , Mingyu Ding , Chuang Gan

Scene graph generation has received growing attention with the advancements in image understanding tasks such as object detection, attributes and relationship prediction,~\etc. However, existing datasets are biased in terms of object and…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Jiuxiang Gu , Handong Zhao , Zhe Lin , Sheng Li , Jianfei Cai , Mingyang Ling

Scene parsing, or semantic segmentation, consists in labeling each pixel in an image with the category of the object it belongs to. It is a challenging task that involves the simultaneous detection, segmentation and recognition of all the…

Computer Vision and Pattern Recognition · Computer Science 2015-06-09 Clément Farabet , Camille Couprie , Laurent Najman , Yann LeCun

Driven by successes in deep learning, computer vision research has begun to move beyond object detection and image classification to more sophisticated tasks like image captioning or visual question answering. Motivating such endeavors is…

Computer Vision and Pattern Recognition · Computer Science 2018-02-09 Matthew Klawonn , Eric Heim

We propose an end-to-end learning framework for generating foreground object segmentations. Given a single novel image, our approach produces pixel-level masks for all "object-like" regions---even for object categories never seen during…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Suyog Dutt Jain , Bo Xiong , Kristen Grauman
‹ Prev 1 2 3 10 Next ›