Related papers: A Contextual-Aware Position Encoding for Sequentia…

Context-aware Rotary Position Embedding

Positional encoding is a vital component of Transformer architectures, enabling models to incorporate sequence order into self-attention mechanisms. Rotary Positional Embeddings (RoPE) have become a widely adopted solution due to their…

Computation and Language · Computer Science 2025-08-01 Ali Veisi , Delaram Fartoot , Hamidreza Amirzadeh

Positional Encoding via Token-Aware Phase Attention

We prove under practical assumptions that Rotary Positional Embedding (RoPE) introduces an intrinsic distance-dependent bias in attention scores that limits RoPE's ability to model long-context. RoPE extension methods may alleviate this…

Computation and Language · Computer Science 2026-05-12 Yu Wang , Sheng Shen , Rémi Munos , Hongyuan Zhan , Yuandong Tian

Contextual Position Encoding: Learning to Count What's Important

The attention mechanism is a critical component of Large Language Models (LLMs) that allows tokens in a sequence to interact with each other, but is order-invariant. Incorporating position encoding (PE) makes it possible to address by…

Computation and Language · Computer Science 2024-05-31 Olga Golovneva , Tianlu Wang , Jason Weston , Sainbayar Sukhbaatar

Exploiting Positional Information for Session-based Recommendation

For present e-commerce platforms, session-based recommender systems are developed to predict users' preference for next-item recommendation. Although a session can usually reflect a user's current preference, a local shift of the user's…

Information Retrieval · Computer Science 2021-07-12 Ruihong Qiu , Zi Huang , Tong Chen , Hongzhi Yin

Causality-Induced Positional Encoding for Transformer-Based Representation Learning of Non-Sequential Features

Positional encoding is essential for supplementing transformer with positional information of tokens. Existing positional encoding methods demand predefined token/feature order, rendering them unsuitable for real-world data with…

Machine Learning · Computer Science 2025-09-25 Kaichen Xu , Yihang Du , Mianpeng Liu , Zimu Yu , Xiaobo Sun

CoPE: A Lightweight Complex Positional Encoding

Recent studies have demonstrated the effectiveness of position encoding in transformer architectures. By incorporating positional information, this approach provides essential guidance for modeling dependencies between elements across…

Machine Learning · Computer Science 2025-08-27 Avinash Amballa

DAPE: Data-Adaptive Positional Encoding for Length Extrapolation

Positional encoding plays a crucial role in transformers, significantly impacting model performance and length generalization. Prior research has introduced absolute positional encoding (APE) and relative positional encoding (RPE) to…

Computation and Language · Computer Science 2024-11-06 Chuanyang Zheng , Yihang Gao , Han Shi , Minbin Huang , Jingyao Li , Jing Xiong , Xiaozhe Ren , Michael Ng , Xin Jiang , Zhenguo Li , Yu Li

Rethinking Addressing in Language Models via Contexualized Equivariant Positional Encoding

Transformers rely on both content-based and position-based addressing mechanisms to make predictions, but existing positional encoding techniques often diminish the effectiveness of position-based addressing. Many current methods enforce…

Computation and Language · Computer Science 2025-08-22 Jiajun Zhu , Peihao Wang , Ruisi Cai , Jason D. Lee , Pan Li , Zhangyang Wang

Context-Aware Sequential Model for Multi-Behaviour Recommendation

Sequential recommendation models are crucial for next-item recommendations in online platforms, capturing complex patterns in user interactions. However, many focus on a single behavior, overlooking valuable implicit interactions like…

Information Retrieval · Computer Science 2023-12-18 Shereen Elsayed , Ahmed Rashed , Lars Schmidt-Thieme

Remember to Forget: Gated Adaptive Positional Encoding

Rotary Positional Encoding (RoPE) is widely used in modern large language models. However, when sequences are extended beyond the range seen during training, rotary phases can enter out-of-distribution regimes, leading to spurious…

Machine Learning · Computer Science 2026-05-12 Riccardo Ali , Alessio Borgi , Christopher Irwin , Mario Severino , Pietro Liò

Conditional Positional Encodings for Vision Transformers

We propose a conditional positional encoding (CPE) scheme for vision Transformers. Unlike previous fixed or learnable positional encodings, which are pre-defined and independent of input tokens, CPE is dynamically generated and conditioned…

Computer Vision and Pattern Recognition · Computer Science 2023-02-14 Xiangxiang Chu , Zhi Tian , Bo Zhang , Xinlong Wang , Chunhua Shen

Equipping Sketch Patches with Context-Aware Positional Encoding for Graphic Sketch Representation

When benefiting graphic sketch representation with sketch drawing orders, recent studies have linked sketch patches as graph edges by drawing orders in accordance to a temporal-based nearest neighboring strategy. However, such constructed…

Computer Vision and Pattern Recognition · Computer Science 2025-05-16 Sicong Zang , Zhijun Fang

A 2D Semantic-Aware Position Encoding for Vision Transformers

Vision transformers have demonstrated significant advantages in computer vision tasks due to their ability to capture long-range dependencies and contextual relationships through self-attention. However, existing position encoding…

Computer Vision and Pattern Recognition · Computer Science 2025-05-15 Xi Chen , Shiyang Zhou , Muqi Huang , Jiaxu Feng , Yun Xiong , Kun Zhou , Biao Yang , Yuhui Zhang , Huishuai Bao , Sijia Peng , Chuan Li , Feng Shi

Set-to-Sequence Ranking-based Concept-aware Learning Path Recommendation

With the development of the online education system, personalized education recommendation has played an essential role. In this paper, we focus on developing path recommendation systems that aim to generating and recommending an entire…

Information Retrieval · Computer Science 2023-06-08 Xianyu Chen , Jian Shen , Wei Xia , Jiarui Jin , Yakun Song , Weinan Zhang , Weiwen Liu , Menghui Zhu , Ruiming Tang , Kai Dong , Dingyin Xia , Yong Yu

Positional encoding is not the same as context: A study on positional encoding for sequential recommendation

The rapid growth of streaming media and e-commerce has driven advancements in recommendation systems, particularly Sequential Recommendation Systems (SRS). These systems employ users' interaction histories to predict future preferences.…

Information Retrieval · Computer Science 2025-01-22 Alejo Lopez-Avila , Jinhua Du , Abbas Shimary , Ze Li

Deep Context-Aware Recommender System Utilizing Sequential Latent Context

Context-aware recommender systems (CARSs) apply sensing and analysis of user context in order to provide personalized services. Adding context to a recommendation model is challenging, since the addition of context may increases both the…

Machine Learning · Computer Science 2020-08-07 Amit Livne , Moshe Unger , Bracha Shapira , Lior Rokach

Context-aware Sequential Recommendation

Since sequential information plays an important role in modeling user behaviors, various sequential recommendation methods have been proposed. Methods based on Markov assumption are widely-used, but independently combine several most recent…

Information Retrieval · Computer Science 2016-09-20 Qiang Liu , Shu Wu , Diyi Wang , Zhaokang Li , Liang Wang

Facet-Aware Multi-Head Mixture-of-Experts Model with Text-Enhanced Pre-training for Sequential Recommendation

Sequential recommendation (SR) systems excel at capturing users' dynamic preferences by leveraging their interaction histories. Most existing SR systems assign a single embedding vector to each item to represent its features, adopting…

Information Retrieval · Computer Science 2026-01-21 Mingrui Liu , Sixiao Zhang , Cheng Long

Learning Positional Attention for Sequential Recommendation

Self-attention-based networks have achieved remarkable performance in sequential recommendation tasks. A crucial component of these models is positional encoding. In this study, we delve into the learned positional embedding, demonstrating…

Information Retrieval · Computer Science 2024-11-27 Fan Luo , Haibo He , Juan Zhang , Shenghui Xu

Relative Positional Encoding for Transformers with Linear Complexity

Recent advances in Transformer models allow for unprecedented sequence lengths, due to linear space and time complexity. In the meantime, relative positional encoding (RPE) was proposed as beneficial for classical Transformers and consists…

Machine Learning · Computer Science 2021-06-11 Antoine Liutkus , Ondřej Cífka , Shih-Lun Wu , Umut Şimşekli , Yi-Hsuan Yang , Gaël Richard