English
Related papers

Related papers: Learning Positional Attention for Sequential Recom…

200 papers

Sequential self-attention models usually rely on additive positional embeddings, which inject positional information into item representations at the input. In the absence of positional signals, the attention block is…

Information Retrieval · Computer Science 2026-02-25 Timur Nabiev , Evgeny Frolov

The self-attention mechanism, which equips with a strong capability of modeling long-range dependencies, is one of the extensively used techniques in the sequential recommendation field. However, many recent studies represent that current…

Information Retrieval · Computer Science 2023-05-18 Xinyu Du , Huanhuan Yuan , Pengpeng Zhao , Jianfeng Qu , Fuzhen Zhuang , Guanfeng Liu , Victor S. Sheng

Recently, self-attention based models have achieved state-of-the-art performance in sequential recommendation task. Following the custom from language processing, most of these models rely on a simple positional embedding to exploit the…

Machine Learning · Computer Science 2020-08-24 Sung Min Cho , Eunhyeok Park , Sungjoo Yoo

Attentional mechanisms are order-invariant. Positional encoding is a crucial component to allow attention-based deep model architectures such as Transformer to address sequences or images where the position of information matters. In this…

Machine Learning · Computer Science 2021-11-10 Yang Li , Si Si , Gang Li , Cho-Jui Hsieh , Samy Bengio

Sequential recommendation models have achieved state-of-the-art performance using self-attention mechanism. It has since been found that moving beyond only using item ID and positional embeddings leads to a significant accuracy boost when…

Information Retrieval · Computer Science 2024-09-10 Linsey Pang , Amir Hossein Raffiee , Wei Liu , Keld Lundgaard

Constituting highly informative network embeddings is an important tool for network analysis. It encodes network topology, along with other useful side information, into low-dimensional node-based feature representations that can be…

Computation and Language · Computer Science 2019-06-06 Liqun Chen , Guoyin Wang , Chenyang Tao , Dinghan Shen , Pengyu Cheng , Xinyuan Zhang , Wenlin Wang , Yizhe Zhang , Lawrence Carin

User interests are usually dynamic in the real world, which poses both theoretical and practical challenges for learning accurate preferences from rich behavior data. Among existing user behavior modeling solutions, attention networks are…

Information Retrieval · Computer Science 2022-04-14 Chao Chen , Haoyu Geng , Nianzu Yang , Junchi Yan , Daiyue Xue , Jianping Yu , Xiaokang Yang

Positional encodings enable Transformers to incorporate sequential information, yet their theoretical understanding remains limited to two properties: distance attenuation and translation invariance. Because natural language lacks purely…

Machine Learning · Computer Science 2026-02-11 Zihan Gu , Ruoyu Chen , Han Zhang , Hua Zhang , Yue Hu

Sequential modelling with self-attention has achieved cutting edge performances in natural language processing. With advantages in model flexibility, computation complexity and interpretability, self-attention is gradually becoming a key…

Machine Learning · Computer Science 2019-12-02 Da Xu , Chuanwei Ruan , Sushant Kumar , Evren Korpeoglu , Kannan Achan

This paper describes how to apply self-attention with relative positional encodings to the task of relation extraction. We propose to use the self-attention encoder layer together with an additional position-aware attention layer that takes…

Computation and Language · Computer Science 2018-07-10 Ivan Bilan , Benjamin Roth

We prove under practical assumptions that Rotary Positional Embedding (RoPE) introduces an intrinsic distance-dependent bias in attention scores that limits RoPE's ability to model long-context. RoPE extension methods may alleviate this…

Computation and Language · Computer Science 2026-05-12 Yu Wang , Sheng Shen , Rémi Munos , Hongyuan Zhan , Yuandong Tian

In recent years, pre-trained Transformers have dominated the majority of NLP benchmark tasks. Many variants of pre-trained Transformers have kept breaking out, and most focus on designing different pre-training objectives or variants of…

Computation and Language · Computer Science 2020-10-13 Yu-An Wang , Yun-Nung Chen

Forecasting the future traffic flow distribution in an area is an important issue for traffic management in an intelligent transportation system. The key challenge of traffic prediction is to capture spatial and temporal relations between…

Machine Learning · Computer Science 2019-04-15 Shiheng Ma , Jingcai Guo , Song Guo , Minyi Guo

Neural sequence-to-sequence networks with attention have achieved remarkable performance for machine translation. One of the reasons for their effectiveness is their ability to capture relevant source-side contextual information at each…

Computation and Language · Computer Science 2018-10-02 Lesly Miculicich Werlen , Nikolaos Pappas , Dhananjay Ram , Andrei Popescu-Belis

Self-attentive transformer models have recently been shown to solve the next item recommendation task very efficiently. The learned attention weights capture sequential dynamics in user behavior and generalize well. Motivated by the special…

Machine Learning · Computer Science 2022-12-13 Evgeny Frolov , Ivan Oseledets

Token representation strategies within large-scale neural architectures often rely on contextually refined embeddings, yet conventional approaches seldom encode structured relationships explicitly within token interactions. Self-attention…

Computation and Language · Computer Science 2025-03-27 James Blades , Frederick Somerfield , William Langley , Susan Everingham , Maurice Witherington

Lattices are an efficient and effective method to encode ambiguity of upstream systems in natural language processing tasks, for example to compactly capture multiple speech recognition hypotheses, or to represent multiple linguistic…

Computation and Language · Computer Science 2019-06-05 Matthias Sperber , Graham Neubig , Ngoc-Quan Pham , Alex Waibel

In information retrieval, learning to rank constructs a machine-based ranking model which given a query, sorts the search results by their degree of relevance or importance to the query. Neural networks have been successfully applied to…

Machine Learning · Computer Science 2017-12-12 Baiyang Wang , Diego Klabjan

In recommender systems, models mostly use a combination of embedding layers and multilayer feedforward neural networks. The high-dimensional sparse original features are downscaled in the embedding layer and then fed into the fully…

Information Retrieval · Computer Science 2022-05-19 Mohan Hasama , Jing Li

Position encoding (PE), an essential part of self-attention networks (SANs), is used to preserve the word order information for natural language processing tasks, generating fixed position indices for input sequences. However, in…

Computation and Language · Computer Science 2020-11-24 Liang Ding , Longyue Wang , Dacheng Tao
‹ Prev 1 2 3 10 Next ›