Related papers: Sequential Recommendation with Relation-Aware Kern…

Sequential Recommendation via Stochastic Self-Attention

Sequential recommendation models the dynamics of a user's previous behaviors in order to forecast the next item, and has drawn a lot of attention. Transformer-based approaches, which embed items as vectors and use dot-product self-attention…

Information Retrieval · Computer Science 2022-03-08 Ziwei Fan , Zhiwei Liu , Alice Wang , Zahra Nazari , Lei Zheng , Hao Peng , Philip S. Yu

Next Item Recommendation with Self-Attention

In this paper, we propose a novel sequence-aware recommendation model. Our model utilizes self-attention mechanism to infer the item-item relationship from user's historical interactions. With self-attention, it is able to estimate the…

Information Retrieval · Computer Science 2018-08-28 Shuai Zhang , Yi Tay , Lina Yao , Aixin Sun

Attention Calibration for Transformer-based Sequential Recommendation

Transformer-based sequential recommendation (SR) has been booming in recent years, with the self-attention mechanism as its key component. Self-attention has been widely believed to be able to effectively select those informative and…

Information Retrieval · Computer Science 2024-03-19 Peilin Zhou , Qichen Ye , Yueqi Xie , Jingqi Gao , Shoujin Wang , Jae Boum Kim , Chenyu You , Sunghun Kim

Sequential Recommendation via Adaptive Robust Attention with Multi-dimensional Embeddings

Sequential recommendation models have achieved state-of-the-art performance using self-attention mechanism. It has since been found that moving beyond only using item ID and positional embeddings leads to a significant accuracy boost when…

Information Retrieval · Computer Science 2024-09-10 Linsey Pang , Amir Hossein Raffiee , Wei Liu , Keld Lundgaard

Gated Rotary-Enhanced Linear Attention with Rank Modulation for Long-term Sequential Recommendation

In Sequential Recommendation Systems (SRSs), Transformer models have demonstrated remarkable performance but face computational and memory cost challenges, especially when modeling long-term user behavior sequences. Due to its quadratic…

Information Retrieval · Computer Science 2026-03-25 Juntao Hu , Wei Zhou , Haini Cai , Xiao Du , Huayi Shen , Junhao Wen

Tensor-based Sequential Learning via Hankel Matrix Representation for Next Item Recommendations

Self-attentive transformer models have recently been shown to solve the next item recommendation task very efficiently. The learned attention weights capture sequential dynamics in user behavior and generalize well. Motivated by the special…

Machine Learning · Computer Science 2022-12-13 Evgeny Frolov , Ivan Oseledets

Pay Attention to Attention for Sequential Recommendation

Transformer-based approaches have demonstrated remarkable success in various sequence-based tasks. However, traditional self-attention models may not sufficiently capture the intricate dependencies within items in sequential recommendation…

Information Retrieval · Computer Science 2024-10-29 Yuli Liu , Min Liu , Xiaojing Liu

Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

Self-attention (SA), which encodes vector sequences according to their pairwise similarity, is widely used in speech recognition due to its strong context modeling ability. However, when applied to long sequence data, its accuracy is…

Sound · Computer Science 2021-10-11 Chengdong Liang , Menglong Xu , Xiao-Lei Zhang

Tensorized Self-Attention: Efficiently Modeling Pairwise and Global Dependencies Together

Neural networks equipped with self-attention have parallelizable computation, light-weight structure, and the ability to capture both long-range and local dependencies. Further, their expressive power and performance can be boosted by using…

Computation and Language · Computer Science 2019-03-27 Tao Shen , Tianyi Zhou , Guodong Long , Jing Jiang , Chengqi Zhang

Position-Aware Sequential Attention for Accurate Next Item Recommendations

Sequential self-attention models usually rely on additive positional embeddings, which inject positional information into item representations at the input. In the absence of positional signals, the attention block is…

Information Retrieval · Computer Science 2026-02-25 Timur Nabiev , Evgeny Frolov

Recursive Attentive Methods with Reused Item Representations for Sequential Recommendation

Sequential recommendation aims to recommend the next item of users' interest based on their historical interactions. Recently, the self-attention mechanism has been adapted for sequential recommendation, and demonstrated state-of-the-art…

Information Retrieval · Computer Science 2022-09-19 Bo Peng , Srinivasan Parthasarathy , Xia Ning

Collaboration and Transition: Distilling Item Transitions into Multi-Query Self-Attention for Sequential Recommendation

Modern recommender systems employ various sequential modules such as self-attention to learn dynamic user interests. However, these methods are less effective in capturing collaborative and transitional signals within user interaction…

Information Retrieval · Computer Science 2023-12-27 Tianyu Zhu , Yansong Shi , Yuan Zhang , Yihong Wu , Fengran Mo , Jian-Yun Nie

Designing a Sequential Recommendation System for Heterogeneous Interactions Using Transformers

While many production-ready and robust algorithms are available for the task of recommendation systems, many of these systems do not take the order of user's consumption into account. The order of consumption can be very useful and matters…

Information Retrieval · Computer Science 2022-05-03 Mehdi Soleiman Nejad , Meysam Varasteh , Hadi Moradi , Mohammad Amin Sadeghi

LinRec: Linear Attention Mechanism for Long-term Sequential Recommender Systems

Transformer models have achieved remarkable success in sequential recommender systems (SRSs). However, computing the attention matrix in traditional dot-product attention mechanisms results in a quadratic complexity with sequence lengths,…

Information Retrieval · Computer Science 2024-11-05 Langming Liu , Xiangyu Zhao , Chi Zhang , Jingtong Gao , Wanyu Wang , Wenqi Fan , Yiqi Wang , Ming He , Zitao Liu , Qing Li

Selective Attention: Enhancing Transformer through Principled Context Control

The attention mechanism within the transformer architecture enables the model to weigh and combine tokens based on their relevance to the query. While self-attention has enjoyed major success, it notably treats all queries $q$ in the same…

Machine Learning · Computer Science 2024-11-21 Xuechen Zhang , Xiangyu Chang , Mingchen Li , Amit Roy-Chowdhury , Jiasi Chen , Samet Oymak

Conditional Self-Attention for Query-based Summarization

Self-attention mechanisms have achieved great success on a variety of NLP tasks due to its flexibility of capturing dependency between arbitrary positions in a sequence. For problems such as query-based summarization (Qsumm) and knowledge…

Computation and Language · Computer Science 2020-02-19 Yujia Xie , Tianyi Zhou , Yi Mao , Weizhu Chen

Linear Self-Attention Approximation via Trainable Feedforward Kernel

In pursuit of faster computation, Efficient Transformers demonstrate an impressive variety of approaches -- models attaining sub-quadratic attention complexity can utilize a notion of sparsity or a low-rank approximation of inputs to reduce…

Machine Learning · Computer Science 2022-11-09 Uladzislau Yorsh , Alexander Kovalenko

Self-Attentive Sequential Recommendation with Cheap Causal Convolutions

Sequential Recommendation is a prominent topic in current research, which uses user behavior sequence as an input to predict future behavior. By assessing the correlation strength of historical behavior through the dot product, the model…

Information Retrieval · Computer Science 2023-02-23 Jiayi Chen , Wen Wu , Liye Shi , Yu Ji , Wenxin Hu , Xi Chen , Wei Zheng , Liang He

An Attentive Inductive Bias for Sequential Recommendation beyond the Self-Attention

Sequential recommendation (SR) models based on Transformers have achieved remarkable successes. The self-attention mechanism of Transformers for computer vision and natural language processing suffers from the oversmoothing problem, i.e.,…

Machine Learning · Computer Science 2024-02-20 Yehjin Shin , Jeongwhan Choi , Hyowon Wi , Noseong Park

Recency Biased Causal Attention for Time-series Forecasting

Recency bias is a useful inductive prior for sequential modeling: it emphasizes nearby observations and can still allow longer-range dependencies. Standard Transformer attention lacks this property, relying on all-to-all interactions that…

Machine Learning · Computer Science 2026-04-23 Kareem Hegazy , Michael W. Mahoney , N. Benjamin Erichson