English
Related papers

Related papers: Relation Transformer Network

200 papers

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

Objects and their relationships are critical contents for image understanding. A scene graph provides a structured description that captures these properties of an image. However, reasoning about the relationships between objects is very…

Computer Vision and Pattern Recognition · Computer Science 2018-11-16 Sanghyun Woo , Dahun Kim , Donghyeon Cho , In So Kweon

Different objects in the same scene are more or less related to each other, but only a limited number of these relationships are noteworthy. Inspired by DETR, which excels in object detection, we view scene graph generation as a set…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Yuren Cong , Michael Ying Yang , Bodo Rosenhahn

Scene graph generation refers to the task of automatically mapping an image into a semantic structural graph, which requires correctly labeling each extracted object and their interaction relationships. Despite the recent success in object…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Mengshi Qi , Weijian Li , Zhengyuan Yang , Yunhong Wang , Jiebo Luo

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

Network embedding, which aims to learn low-dimensional representations of nodes, has been used for various graph related tasks including visualization, link prediction and node classification. Most existing embedding methods rely solely on…

Social and Information Networks · Computer Science 2019-08-22 Palash Goyal , Homa Hosseinmardi , Emilio Ferrara , Aram Galstyan

We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images. Our model contains a Relation Proposal Network (RePN) that efficiently deals with…

Computer Vision and Pattern Recognition · Computer Science 2018-08-02 Jianwei Yang , Jiasen Lu , Stefan Lee , Dhruv Batra , Devi Parikh

Graph neural networks (GNNs) have been widely used in representation learning on graphs and achieved state-of-the-art performance in tasks such as node classification and link prediction. However, most existing GNNs are designed to learn…

Machine Learning · Computer Science 2020-02-06 Seongjun Yun , Minbyul Jeong , Raehyun Kim , Jaewoo Kang , Hyunwoo J. Kim

Graph Neural Networks (GNNs) have emerged as promising solutions for collaborative filtering (CF) through the modeling of user-item interaction graphs. The nucleus of existing GNN-based recommender systems involves recursive message passing…

Machine Learning · Computer Science 2024-05-21 Peiyan Zhang , Yuchen Yan , Xi Zhang , Chaozhuo Li , Senzhang Wang , Feiran Huang , Sunghun Kim

Text detection in scenes based on deep neural networks have shown promising results. Instead of using word bounding box regression, recent state-of-the-art methods have started focusing on character bounding box and pixel-level prediction.…

Machine Learning · Computer Science 2020-05-26 Mayank Kumar Singh , Sayan Banerjee , Shubhasis Chaudhuri

In this work, we have proposed an approach for improving the GCN for predicting ratings in social networks. Our model is expanded from the standard model with several layers of transformer architecture. The main focus of the paper is on the…

Machine Learning · Computer Science 2024-01-15 Thi Linh Hoang , Tuan Dung Pham , Viet Cuong Ta

Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Zeeshan Hayder , Xuming He

Image captioning is shown to be able to achieve a better performance by using scene graphs to represent the relations of objects in the image. The current captioning encoders generally use a Graph Convolutional Net (GCN) to represent the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Xuewen Yang , Yingru Liu , Xin Wang

Scene graphs are nodes and edges consisting of objects and object-object relationships, respectively. Scene graph generation (SGG) aims to identify the objects and their relationships. We propose a bidirectional GRU (BiGRU) transformer…

Computer Vision and Pattern Recognition · Computer Science 2021-09-14 Naina Dhingra , Florian Ritter , Andreas Kunz

To understand a scene in depth not only involves locating/recognizing individual objects, but also requires to infer the relationships and interactions among them. However, since the distribution of real-world relationships is seriously…

Computer Vision and Pattern Recognition · Computer Science 2019-03-11 Tianshui Chen , Weihao Yu , Riquan Chen , Liang Lin

A comprehensive representation of an image requires understanding objects and their mutual relationship, especially in image-to-graph generation, e.g., road network extraction, blood-vessel network extraction, or scene graph generation.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Suprosanna Shit , Rajat Koner , Bastian Wittmann , Johannes Paetzold , Ivan Ezhov , Hongwei Li , Jiazhen Pan , Sahand Sharifzadeh , Georgios Kaissis , Volker Tresp , Bjoern Menze

Despite decades of research, understanding human manipulation activities is, and has always been, one of the most attractive and challenging research topics in computer vision and robotics. Recognition and prediction of observed human…

Computer Vision and Pattern Recognition · Computer Science 2021-10-27 Gamze Akyol , Sanem Sariel , Eren Erdal Aksoy

Recent years have seen a growing interest in Scene Graph Generation (SGG), a comprehensive visual scene understanding task that aims to predict entity relationships using a relation encoder-decoder pipeline stacked on top of an object…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Gopika Sudhakaran , Devendra Singh Dhami , Kristian Kersting , Stefan Roth

Transformer architectures, capable of capturing sequential dependencies in the history of user interactions, have become the dominant approach in sequential recommender systems. Despite their success, such models consider sequence elements…

Information Retrieval · Computer Science 2026-03-02 Artur Gimranov , Viacheslav Yusupov , Elfat Sabitov , Tatyana Matveeva , Anton Lysenko , Ruslan Israfilov , Evgeny Frolov

This paper presents a novel graph-based deep learning model for tasks involving relations between two nodes (edge-centric tasks), where the focus lies on predicting relationships and interactions between pairs of nodes rather than node…

Machine Learning · Computer Science 2025-07-08 Eugenio Borzone , Leandro Di Persia , Matias Gerard
‹ Prev 1 2 3 10 Next ›