English
Related papers

Related papers: Universal Scene Graph Generation

200 papers

Deep learning techniques have led to remarkable breakthroughs in the field of generic object detection and have spawned a lot of scene-understanding tasks in recent years. Scene graph has been the focus of research because of its powerful…

Computer Vision and Pattern Recognition · Computer Science 2022-06-23 Guangming Zhu , Liang Zhang , Youliang Jiang , Yixuan Dang , Haoran Hou , Peiyi Shen , Mingtao Feng , Xia Zhao , Qiguang Miao , Syed Afaq Ali Shah , Mohammed Bennamoun

From a computer science viewpoint, a surgical domain model needs to be a conceptual one incorporating both behavior and data. It should therefore model actors, devices, tools, their complex interactions and data flow. To capture and model…

Computer Vision and Pattern Recognition · Computer Science 2021-06-30 Ege Özsoy , Evin Pınar Örnek , Ulrich Eck , Federico Tombari , Nassir Navab

Existing Unbiased Scene Graph Generation (USGG) methods only focus on addressing the predicate-level imbalance that high-frequency classes dominate predictions of rare ones, while overlooking the concept-level imbalance. Actually, even if…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Xinyu Lyu , Lianli Gao , Junlin Xie , Pengpeng Zeng , Yulu Tian , Jie Shao , Heng Tao Shen

Scene graph generation (SGG) endeavors to predict visual relationships between pairs of objects within an image. Prevailing SGG methods traditionally assume a one-off learning process for SGG. This conventional paradigm may necessitate…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Tao He , Tongtong Wu , Dongyang Zhang , Guiduo Duan , Ke Qin , Yuan-Fang Li

Training Scene Graph Generation (SGG) models with natural language captions has become increasingly popular due to the abundant, cost-effective, and open-world generalization supervision signals that natural language offers. However, such…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Zuyao Chen , Jinlin Wu , Zhen Lei , Zhaoxiang Zhang , Changwen Chen

Despite recent advancements in single-domain or single-object image generation, it is still challenging to generate complex scenes containing diverse, multiple objects and their interactions. Scene graphs, composed of nodes as objects and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-13 Sarthak Garg , Helisa Dhamo , Azade Farshad , Sabrina Musatian , Nassir Navab , Federico Tombari

A proper scene representation is central to the pursuit of spatial intelligence where agents can robustly reconstruct and efficiently understand 3D scenes. A scene representation is either metric, such as landmark maps in 3D reconstruction,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Juexiao Zhang , Gao Zhu , Sihang Li , Xinhao Liu , Haorui Song , Xinran Tang , Chen Feng

Graph based representation has been widely used in modelling spatio-temporal relationships in video understanding. Although effective, existing graph-based approaches focus on capturing the human-object relationships while ignoring…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Chinthani Sugandhika , Chen Li , Deepu Rajan , Basura Fernando

Scene Graph Generation (SGG) unifies object localization and visual relationship reasoning by predicting boxes and subject-predicate-object triples. Yet most pipelines treat SGG as a one-shot, deterministic classification problem rather…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Xin Hu , Ke Qin , Wen Yin , Yuan-Fang Li , Ming Li , Tao He

Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Zeeshan Hayder , Xuming He

Along with generative AI, interest in scene graph generation (SGG), which comprehensively captures the relationships and interactions between objects in an image and creates a structured graph-based representation, has significantly…

Computer Vision and Pattern Recognition · Computer Science 2023-11-03 Hyeongjin Kim , Sangwon Kim , Jong Taek Lee , Byoung Chul Ko

Image-text retrieval of natural scenes has been a popular research topic. Since image and text are heterogeneous cross-modal data, one of the key challenges is how to learn comprehensive yet unified representations to express the…

Computer Vision and Pattern Recognition · Computer Science 2019-10-14 Sijin Wang , Ruiping Wang , Ziwei Yao , Shiguang Shan , Xilin Chen

Scene graphs provide a rich, structured representation of a scene by encoding the entities (objects) and their spatial relationships in a graphical format. This representation has proven useful in several tasks, such as question answering,…

Computer Vision and Pattern Recognition · Computer Science 2022-12-01 Sanjoy Kundu , Sathyanarayanan N. Aakur

Recent advancements in text-to-image generation have been propelled by the development of diffusion models and multi-modality learning. However, since text is typically represented sequentially in these models, it often falls short in…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Guibao Shen , Luozhou Wang , Jiantao Lin , Wenhang Ge , Chaozhe Zhang , Xin Tao , Yuan Zhang , Pengfei Wan , Zhongyuan Wang , Guangyong Chen , Yijun Li , Ying-Cong Chen

Scene Graph Generation (SGG) serves a comprehensive representation of the images for human understanding as well as visual understanding tasks. Due to the long tail bias problem of the object and predicate labels in the available annotated…

Computer Vision and Pattern Recognition · Computer Science 2022-11-10 Anh Duc Bui , Soyeon Caren Han , Josiah Poon

Despite the impressive progress on understanding and generating images shown by the recent unified architectures, the integration of 3D tasks remains challenging and largely unexplored. In this paper, we introduce UniUGG, the first unified…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Yueming Xu , Jiahui Zhang , Ze Huang , Yurui Chen , Yanpeng Zhou , Zhenyu Chen , Yu-Jie Yuan , Pengxiang Xia , Guowei Huang , Xinyue Cai , Zhongang Qi , Xingyue Quan , Jianye Hao , Hang Xu , Li Zhang

Recent advancements in 3D Gaussian Splatting(3DGS) have significantly improved semantic scene understanding, enabling natural language queries to localize objects within a scene. However, existing methods primarily focus on embedding…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Xihan Wang , Dianyi Yang , Yu Gao , Yufeng Yue , Yi Yang , Mengyin Fu

Scene graph generation (SGG) analyzes images to extract meaningful information about objects and their relationships. In the dynamic visual world, it is crucial for AI systems to continuously detect new objects and establish their…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Naitik Khandelwal , Xiao Liu , Mengmi Zhang

Scene graph is a structured representation of a scene that can clearly express the objects, attributes, and relationships between objects in the scene. As computer vision technology continues to develop, people are no longer satisfied with…

Computer Vision and Pattern Recognition · Computer Science 2022-01-10 Xiaojun Chang , Pengzhen Ren , Pengfei Xu , Zhihui Li , Xiaojiang Chen , Alex Hauptmann

3D visual grounding aims to localize the unique target described by natural languages in 3D scenes. The significant gap between 3D and language modalities makes it a notable challenge to distinguish multiple similar objects through the…

Computer Vision and Pattern Recognition · Computer Science 2025-08-18 Feng Xiao , Hongbin Xu , Guocan Zhao , Wenxiong Kang
‹ Prev 1 2 3 10 Next ›