Related papers: Relation Transformer Network

Scenes and Surroundings: Scene Graph Generation using Relation Transformer

Identifying objects in an image and their mutual relationships as a scene graph leads to a deep understanding of image content. Despite the recent advancement in deep learning, the detection and labeling of visual object relationships…

Computer Vision and Pattern Recognition · Computer Science 2021-07-13 Rajat Koner , Poulami Sinhamahapatra , Volker Tresp

LinkNet: Relational Embedding for Scene Graph

Objects and their relationships are critical contents for image understanding. A scene graph provides a structured description that captures these properties of an image. However, reasoning about the relationships between objects is very…

Computer Vision and Pattern Recognition · Computer Science 2018-11-16 Sanghyun Woo , Dahun Kim , Donghyeon Cho , In So Kweon

RelTR: Relation Transformer for Scene Graph Generation

Different objects in the same scene are more or less related to each other, but only a limited number of these relationships are noteworthy. Inspired by DETR, which excels in object detection, we view scene graph generation as a set…

Computer Vision and Pattern Recognition · Computer Science 2023-04-18 Yuren Cong , Michael Ying Yang , Bodo Rosenhahn

Attentive Relational Networks for Mapping Images to Scene Graphs

Scene graph generation refers to the task of automatically mapping an image into a semantic structural graph, which requires correctly labeling each extracted object and their interaction relationships. Despite the recent success in object…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Mengshi Qi , Weijian Li , Zhengyuan Yang , Yunhong Wang , Jiebo Luo

Scene Graph Generation by Iterative Message Passing

Understanding a visual scene goes beyond recognizing individual objects in isolation. Relationships between objects also constitute rich semantic information about the scene. In this work, we explicitly model the objects and their…

Computer Vision and Pattern Recognition · Computer Science 2017-04-13 Danfei Xu , Yuke Zhu , Christopher B. Choy , Li Fei-Fei

Capturing Edge Attributes via Network Embedding

Network embedding, which aims to learn low-dimensional representations of nodes, has been used for various graph related tasks including visualization, link prediction and node classification. Most existing embedding methods rely solely on…

Social and Information Networks · Computer Science 2019-08-22 Palash Goyal , Homa Hosseinmardi , Emilio Ferrara , Aram Galstyan

Graph R-CNN for Scene Graph Generation

We propose a novel scene graph generation model called Graph R-CNN, that is both effective and efficient at detecting objects and their relations in images. Our model contains a Relation Proposal Network (RePN) that efficiently deals with…

Computer Vision and Pattern Recognition · Computer Science 2018-08-02 Jianwei Yang , Jiasen Lu , Stefan Lee , Dhruv Batra , Devi Parikh

Graph Transformer Networks

Graph neural networks (GNNs) have been widely used in representation learning on graphs and achieved state-of-the-art performance in tasks such as node classification and link prediction. However, most existing GNNs are designed to learn…

Machine Learning · Computer Science 2020-02-06 Seongjun Yun , Minbyul Jeong , Raehyun Kim , Jaewoo Kang , Hyunwoo J. Kim

TransGNN: Harnessing the Collaborative Power of Transformers and Graph Neural Networks for Recommender Systems

Graph Neural Networks (GNNs) have emerged as promising solutions for collaborative filtering (CF) through the modeling of user-item interaction graphs. The nucleus of existing GNN-based recommender systems involves recursive message passing…

Machine Learning · Computer Science 2024-05-21 Peiyan Zhang , Yuchen Yan , Xi Zhang , Chaozhuo Li , Senzhang Wang , Feiran Huang , Sunghun Kim

NENET: An Edge Learnable Network for Link Prediction in Scene Text

Text detection in scenes based on deep neural networks have shown promising results. Instead of using word bounding box regression, recent state-of-the-art methods have started focusing on character bounding box and pixel-level prediction.…

Machine Learning · Computer Science 2020-05-26 Mayank Kumar Singh , Sayan Banerjee , Shubhasis Chaudhuri

Improving Graph Convolutional Networks with Transformer Layer in social-based items recommendation

In this work, we have proposed an approach for improving the GCN for predicting ratings in social networks. Our model is expanded from the standard model with several layers of transformer architecture. The main focus of the paper is on the…

Machine Learning · Computer Science 2024-01-15 Thi Linh Hoang , Tuan Dung Pham , Viet Cuong Ta

DSGG: Dense Relation Transformer for an End-to-end Scene Graph Generation

Scene graph generation aims to capture detailed spatial and semantic relationships between objects in an image, which is challenging due to incomplete labelling, long-tailed relationship categories, and relational semantic overlap. Existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Zeeshan Hayder , Xuming He

ReFormer: The Relational Transformer for Image Captioning

Image captioning is shown to be able to achieve a better performance by using scene graphs to represent the relations of objects in the image. The current captioning encoders generally use a Graph Convolutional Net (GCN) to represent the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-18 Xuewen Yang , Yingru Liu , Xin Wang

BGT-Net: Bidirectional GRU Transformer Network for Scene Graph Generation

Scene graphs are nodes and edges consisting of objects and object-object relationships, respectively. Scene graph generation (SGG) aims to identify the objects and their relationships. We propose a bidirectional GRU (BiGRU) transformer…

Computer Vision and Pattern Recognition · Computer Science 2021-09-14 Naina Dhingra , Florian Ritter , Andreas Kunz

Knowledge-Embedded Routing Network for Scene Graph Generation

To understand a scene in depth not only involves locating/recognizing individual objects, but also requires to infer the relationships and interactions among them. However, since the distribution of real-world relationships is seriously…

Computer Vision and Pattern Recognition · Computer Science 2019-03-11 Tianshui Chen , Weihao Yu , Riquan Chen , Liang Lin

Relationformer: A Unified Framework for Image-to-Graph Generation

A comprehensive representation of an image requires understanding objects and their mutual relationship, especially in image-to-graph generation, e.g., road network extraction, blood-vessel network extraction, or scene graph generation.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Suprosanna Shit , Rajat Koner , Bastian Wittmann , Johannes Paetzold , Ivan Ezhov , Hongwei Li , Jiazhen Pan , Sahand Sharifzadeh , Georgios Kaissis , Volker Tresp , Bjoern Menze

A Variational Graph Autoencoder for Manipulation Action Recognition and Prediction

Despite decades of research, understanding human manipulation activities is, and has always been, one of the most attractive and challenging research topics in computer vision and robotics. Recognition and prediction of observed human…

Computer Vision and Pattern Recognition · Computer Science 2021-10-27 Gamze Akyol , Sanem Sariel , Eren Erdal Aksoy

Vision Relation Transformer for Unbiased Scene Graph Generation

Recent years have seen a growing interest in Scene Graph Generation (SGG), a comprehensive visual scene understanding task that aims to predict entity relationships using a relation encoder-decoder pipeline stacked on top of an object…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Gopika Sudhakaran , Devendra Singh Dhami , Kristian Kersting , Stefan Roth

Cross-Representation Knowledge Transfer for Improved Sequential Recommendations

Transformer architectures, capable of capturing sequential dependencies in the history of user interactions, have become the dominant approach in sequential recommender systems. Despite their success, such models consider sequence elements…

Information Retrieval · Computer Science 2026-03-02 Artur Gimranov , Viacheslav Yusupov , Elfat Sabitov , Tatyana Matveeva , Anton Lysenko , Ruslan Israfilov , Evgeny Frolov

A Hybrid Supervised and Self-Supervised Graph Neural Network for Edge-Centric Applications

This paper presents a novel graph-based deep learning model for tasks involving relations between two nodes (edge-centric tasks), where the focus lies on predicting relationships and interactions between pairs of nodes rather than node…

Machine Learning · Computer Science 2025-07-08 Eugenio Borzone , Leandro Di Persia , Matias Gerard