English
Related papers

Related papers: GRPE: Relative Positional Encoding for Graph Trans…

200 papers

Recent advances in Transformer models allow for unprecedented sequence lengths, due to linear space and time complexity. In the meantime, relative positional encoding (RPE) was proposed as beneficial for classical Transformers and consists…

Machine Learning · Computer Science 2021-06-11 Antoine Liutkus , Ondřej Cífka , Shih-Lun Wu , Umut Şimşekli , Yi-Hsuan Yang , Gaël Richard

Relative positional encoding is widely used in vanilla and linear transformers to represent positional information. However, existing encoding methods of a vanilla transformer are not always directly applicable to a linear transformer,…

Computation and Language · Computer Science 2023-07-19 Zhen Qin , Weixuan Sun , Kaiyue Lu , Hui Deng , Dongxu Li , Xiaodong Han , Yuchao Dai , Lingpeng Kong , Yiran Zhong

Relative position encoding (RPE) is important for transformer to capture sequence ordering of input tokens. General efficacy has been proven in natural language processing. However, in computer vision, its efficacy is not well studied and…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Kan Wu , Houwen Peng , Minghao Chen , Jianlong Fu , Hongyang Chao

Graph Transformers (GTs) facilitate the comprehension of graph-structured data by calculating the self-attention of node pairs without considering node position information. To address this limitation, we introduce an innovative and…

Machine Learning · Computer Science 2023-12-12 Kushal Bose , Swagatam Das

The distinguishing power of graph transformers is closely tied to the choice of positional encoding: features used to augment the base transformer with information about the graph. There are two primary types of positional encoding:…

Machine Learning · Computer Science 2024-08-26 Mitchell Black , Zhengchao Wan , Gal Mishne , Amir Nayyeri , Yusu Wang

Graph neural networks (GNNs) have become the standard learning architectures for graphs. GNNs have been applied to numerous domains ranging from quantum chemistry, recommender systems to knowledge graphs and natural language processing. A…

Machine Learning · Computer Science 2022-02-11 Vijay Prakash Dwivedi , Anh Tuan Luu , Thomas Laurent , Yoshua Bengio , Xavier Bresson

A current goal in the graph neural network literature is to enable transformers to operate on graph-structured data, given their success on language and vision tasks. Since the transformer's original sinusoidal positional encodings (PEs)…

Machine Learning · Computer Science 2023-04-11 Patrick Soga , David Chiang

Several recent works use positional encodings to extend the receptive fields of graph neural network (GNN) layers equipped with attention mechanisms. These techniques, however, extend receptive fields to the complete graph, at substantial…

Machine Learning · Computer Science 2023-12-14 Rickard Brüel-Gabrielsson , Mikhail Yurochkin , Justin Solomon

We study the extent to which rotary position encodings (RoPE), a recent transformer position encoding algorithm broadly adopted in large language models (LLMs) and vision transformers (ViTs), can be applied to graph-structured data. We find…

Transformer architectures rely on position encodings to model the spatial structure of input data. Rotary Position Encoding (RoPE) is a widely used method in language models that encodes relative positions through fixed, block-diagonal,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Sophie Ostmeier , Brian Axelrod , Maya Varma , Michael E. Moseley , Akshay Chaudhari , Curtis Langlotz

Positional and structural encodings (PSE) enable better identifiability of nodes within a graph, rendering them essential tools for empowering modern GNNs, and in particular graph Transformers. However, designing PSEs that work optimally…

In this work, we theoretically demonstrate that current graph positional encodings (PEs) are not beneficial and could potentially hurt performance in tasks involving heterophilous graphs, where nodes that are close tend to have different…

Machine Learning · Computer Science 2025-04-30 Michael Ito , Jiong Zhu , Dexiong Chen , Danai Koutra , Jenna Wiens

Graph neural networks (GNNs) largely rely on the message-passing paradigm, where nodes iteratively aggregate information from their neighbors. Yet, standard message passing neural networks (MPNNs) face well-documented theoretical and…

Machine Learning · Computer Science 2026-05-15 Juan Amboage , Ernst Röell , Patrick Schnider , Bastian Rieck

Natural reading orders of words are crucial for information extraction from form-like documents. Despite recent advances in Graph Convolutional Networks (GCNs) on modeling spatial layout patterns of documents, they have limited ability to…

Computation and Language · Computer Science 2021-06-22 Chen-Yu Lee , Chun-Liang Li , Chu Wang , Renshen Wang , Yasuhisa Fujii , Siyang Qin , Ashok Popat , Tomas Pfister

Recent studies have demonstrated the effectiveness of position encoding in transformer architectures. By incorporating positional information, this approach provides essential guidance for modeling dependencies between elements across…

Machine Learning · Computer Science 2025-08-27 Avinash Amballa

We present GRAPE (Group Representational Position Encoding), a unified framework for positional encoding based on group actions. GRAPE unifies two families of mechanisms: (i) multiplicative rotations (Multiplicative GRAPE) in…

Machine Learning · Computer Science 2026-05-15 Yifan Zhang , Zixiang Chen , Yifeng Liu , Zhen Qin , Huizhuo Yuan , Kangping Xu , Yang Yuan , Quanquan Gu , Andrew Chi-Chih Yao

Unsupervised pre-training on vast amounts of graph data is critical in real-world applications wherein labeled data is limited, such as molecule properties prediction or materials science. Existing approaches pre-train models for specific…

Machine Learning · Computer Science 2024-10-01 Viet Anh Nguyen , Nhat Khang Ngo , Truong Son Hy

Transformers are increasingly prevalent for multi-view computer vision tasks, where geometric relationships between viewpoints are critical for 3D perception. To leverage these relationships, multi-view transformers must use camera geometry…

Computer Vision and Pattern Recognition · Computer Science 2025-11-14 Ruilong Li , Brent Yi , Junchen Liu , Hang Gao , Yi Ma , Angjoo Kanazawa

Without positional information, attention-based Transformer neural networks are permutation-invariant. Absolute or relative positional embeddings are the most popular ways to feed Transformer models with positional information. Absolute…

Machine Learning · Computer Science 2021-11-10 Tatiana Likhomanenko , Qiantong Xu , Gabriel Synnaeve , Ronan Collobert , Alex Rogozhnikov

Graph neural networks based on iterative one-hop message passing have been shown to struggle in harnessing the information from distant nodes effectively. Conversely, graph transformers allow each node to attend to all other nodes directly,…

Machine Learning · Computer Science 2024-06-06 Yuhui Ding , Antonio Orvieto , Bobby He , Thomas Hofmann
‹ Prev 1 2 3 10 Next ›