English
Related papers

Related papers: Group Representational Position Encoding

200 papers

We propose a novel positional encoding for learning graph on Transformer architecture. Existing approaches either linearize a graph to encode absolute position in the sequence of nodes, or encode relative position with another node using…

Machine Learning · Computer Science 2022-10-17 Wonpyo Park , Woonggi Chang , Donggeon Lee , Juntae Kim , Seung-won Hwang

Rotary Position Embedding (RoPE) is widely adopted in large language models (LLMs) due to its efficient encoding of relative positions with strong extrapolation capabilities. However, while its application in higher-dimensional input…

Machine Learning · Computer Science 2025-07-16 Haiping Liu , Lijing Lin , Jingyuan Sun , Zhegong Shangguan , Mauricio A. Alvarez , Hongpeng Zhou

Rotary Position Embedding (RoPE) is the de facto positional encoding in large language models due to its ability to encode relative positions and support length extrapolation. When adapted to vision transformers, the standard axial…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Haoyu Liu , Sucheng Ren , Tingyu Zhu , Peng Wang , Cihang Xie , Alan Yuille , Zeyu Zheng , Feng Wang

We study the extent to which rotary position encodings (RoPE), a recent transformer position encoding algorithm broadly adopted in large language models (LLMs) and vision transformers (ViTs), can be applied to graph-structured data. We find…

Natural reading orders of words are crucial for information extraction from form-like documents. Despite recent advances in Graph Convolutional Networks (GCNs) on modeling spatial layout patterns of documents, they have limited ability to…

Computation and Language · Computer Science 2021-06-22 Chen-Yu Lee , Chun-Liang Li , Chu Wang , Renshen Wang , Yasuhisa Fujii , Siyang Qin , Ashok Popat , Tomas Pfister

Transformer architectures rely on position encodings to model the spatial structure of input data. Rotary Position Encoding (RoPE) is a widely used method in language models that encodes relative positions through fixed, block-diagonal,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Sophie Ostmeier , Brian Axelrod , Maya Varma , Michael E. Moseley , Akshay Chaudhari , Curtis Langlotz

The distinguishing power of graph transformers is closely tied to the choice of positional encoding: features used to augment the base transformer with information about the graph. There are two primary types of positional encoding:…

Machine Learning · Computer Science 2024-08-26 Mitchell Black , Zhengchao Wan , Gal Mishne , Amir Nayyeri , Yusu Wang

Standard Vision Transformers flatten 2D images into 1D sequences, disrupting the natural spatial topology. While Rotary Positional Embedding (RoPE) excels in 1D, it inherits this limitation, often treating spatially distant patches (e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2025-12-05 Yupu Yao , Bowen Yang

Rotary Positional Encodings (RoPE) have emerged as a highly effective technique for one-dimensional sequences in Natural Language Processing spurring recent progress towards generalizing RoPE to higher-dimensional data such as images and…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Chase van de Geijn , Timo Lüddecke , Polina Turishcheva , Alexander S. Ecker

Graph Representation Learning (GRL) methods opened new avenues for addressing complex, real-world problems represented by graphs. However, many graphs used in these applications comprise millions of nodes and billions of edges and are…

Positional encoding is essential for supplementing transformer with positional information of tokens. Existing positional encoding methods demand predefined token/feature order, rendering them unsuitable for real-world data with…

Machine Learning · Computer Science 2025-09-25 Kaichen Xu , Yihang Du , Mianpeng Liu , Zimu Yu , Xiaobo Sun

Understanding spatial location and relationships is a fundamental capability for modern artificial intelligence systems. Insights from human spatial cognition provide valuable guidance in this domain. Neuroscientific discoveries have…

Neural and Evolutionary Computing · Computer Science 2024-09-17 Boyang Li , Yulin Wu , Nuoxian Huang , Wenjia Zhang

The Transformer architecture has revolutionized various regions since it was proposed, and its effectiveness largely depends on the ability to encode positional information. Traditional position encoding methods exhibit significant…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Hao Yu , Tangyu Jiang , Shuning Jia , Shannan Yan , Shunning Liu , Haolong Qian , Guanghao Li , Shuting Dong , Huaisong Zhang , Chun Yuan

Transformers rely on explicit positional encoding to model structure in data. While Rotary Position Embedding (RoPE) excels in 1D domains, its application to image generation reveals significant limitations such as fine-grained spatial…

Computer Vision and Pattern Recognition · Computer Science 2026-03-13 Jiaye Li , Baoyou Chen , Hui Li , Zilong Dong , Jingdong Wang , Siyu Zhu

Positional encodings (PEs) are essential for building powerful and expressive graph neural networks and graph transformers, as they effectively capture the relative spatial relationships between nodes. Although extensive research has been…

Machine Learning · Computer Science 2026-03-16 Yinan Huang , Haoyu Wang , Pan Li

A current goal in the graph neural network literature is to enable transformers to operate on graph-structured data, given their success on language and vision tasks. Since the transformer's original sinusoidal positional encodings (PEs)…

Machine Learning · Computer Science 2023-04-11 Patrick Soga , David Chiang

Accurate and efficient modeling of agent interactions is essential for trajectory generation, the core of autonomous driving systems. Existing methods, scene-centric, agent-centric, and query-centric frameworks, each present distinct…

Robotics · Computer Science 2025-03-20 Jianbo Zhao , Taiyu Ban , Zhihao Liu , Hangning Zhou , Xiyang Wang , Qibin Zhou , Hailong Qin , Mu Yang , Lei Liu , Bin Li

Positional encoding (PE) underpins how permutation-invariant Transformers represent sequence order, yet how positional information is processed and stored remains poorly understood. Modern PE methods such as RoPE still struggle on tasks…

Computation and Language · Computer Science 2026-05-29 Pierre-Antoine Lequeu , Camille Barboule , Benjamin Piwowarski

Position information is essential for language modeling. In softmax transformers, Rotary Position Embeddings (\textit{RoPE}) encode positions through \textit{fixed-angle} rotations, while in linear transformers, order is handled via…

Computation and Language · Computer Science 2026-04-27 Sajad Movahedi , Timur Carstensen , Arshia Afzal , Frank Hutter , Antonio Orvieto , Volkan Cevher

Automatic Robotic Assembly Sequence Planning (RASP) can significantly improve productivity and resilience in modern manufacturing along with the growing need for greater product customization. One of the main challenges in realizing such…

Robotics · Computer Science 2023-07-28 Matan Atad , Jianxiang Feng , Ismael Rodríguez , Maximilian Durner , Rudolph Triebel
‹ Prev 1 2 3 10 Next ›