Related papers: A Novel Mamba-based Sequential Recommendation Meth…

HMamba: Hyperbolic Mamba for Sequential Recommendation

Sequential recommendation systems have become a cornerstone of personalized services, adept at modeling the temporal evolution of user preferences by capturing dynamic interaction sequences. Existing approaches predominantly rely on…

Information Retrieval · Computer Science 2025-05-15 Qianru Zhang , Honggang Wen , Wei Yuan , Crystal Chen , Menglin Yang , Siu-Ming Yiu , Hongzhi Yin

M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation

Sequential recommendation systems aim to predict users' next preferences based on their interaction histories, but existing approaches face critical limitations in efficiency and multi-scale pattern recognition. While Transformer-based…

Information Retrieval · Computer Science 2025-05-08 Qianru Zhang , Liang Qu , Honggang Wen , Dong Huang , Siu-Ming Yiu , Nguyen Quoc Viet Hung , Hongzhi Yin

MLSA4Rec: Mamba Combined with Low-Rank Decomposed Self-Attention for Sequential Recommendation

In applications such as e-commerce, online education, and streaming services, sequential recommendation systems play a critical role. Despite the excellent performance of self-attention-based sequential recommendation models in capturing…

Information Retrieval · Computer Science 2026-02-06 Jinzhao Su , Zhenhua Huang

MaTrRec: Uniting Mamba and Transformer for Sequential Recommendation

Sequential recommendation systems aim to provide personalized recommendations by analyzing dynamic preferences and dependencies within user behavior sequences. Recently, Transformer models can effectively capture user preferences. However,…

Information Retrieval · Computer Science 2024-07-30 Shun Zhang , Runsen Zhang , Zhirong Yang

SIGMA: Selective Gated Mamba for Sequential Recommendation

In various domains, Sequential Recommender Systems (SRS) have become essential due to their superior capability to discern intricate user preferences. Typically, SRS utilize transformer-based architectures to forecast the subsequent item…

Artificial Intelligence · Computer Science 2024-12-25 Ziwei Liu , Qidong Liu , Yejing Wang , Wanyu Wang , Pengyue Jia , Maolin Wang , Zitao Liu , Yi Chang , Xiangyu Zhao

Mamba4Rec: Towards Efficient Sequential Recommendation with Selective State Space Models

Sequential recommendation aims to estimate the dynamic user preferences and sequential dependencies among historical user behaviors. Although Transformer-based models have proven to be effective for sequential recommendation, they suffer…

Information Retrieval · Computer Science 2024-07-02 Chengkai Liu , Jianghao Lin , Jianling Wang , Hanzhou Liu , James Caverlee

Decision Mamba: Reinforcement Learning via Hybrid Selective Sequence Modeling

Recent works have shown the remarkable superiority of transformer models in reinforcement learning (RL), where the decision-making problem is formulated as sequential generation. Transformer-based agents could emerge with self-improvement…

Machine Learning · Computer Science 2024-06-04 Sili Huang , Jifeng Hu , Zhejian Yang , Liwei Yang , Tao Luo , Hechang Chen , Lichao Sun , Bo Yang

Uncovering Selective State Space Model's Capabilities in Lifelong Sequential Recommendation

Sequential Recommenders have been widely applied in various online services, aiming to model users' dynamic interests from their sequential interactions. With users increasingly engaging with online platforms, vast amounts of lifelong user…

Information Retrieval · Computer Science 2024-03-26 Jiyuan Yang , Yuanzi Li , Jingyu Zhao , Hanbing Wang , Muyang Ma , Jun Ma , Zhaochun Ren , Mengqi Zhang , Xin Xin , Zhumin Chen , Pengjie Ren

Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data

In recent years, Transformers have become the de-facto architecture for sequence modeling on text and a variety of multi-dimensional data, such as images and video. However, the use of self-attention layers in a Transformer incurs…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Shufan Li , Harkanwar Singh , Aditya Grover

TransMamba: A Sequence-Level Hybrid Transformer-Mamba Language Model

Transformers are the cornerstone of modern large language models, but their quadratic computational complexity limits efficiency in long-sequence processing. Recent advancements in Mamba, a state space model (SSM) with linear complexity,…

Machine Learning · Computer Science 2026-01-08 Yixing Li , Ruobing Xie , Zhen Yang , Xingwu Sun , Shuaipeng Li , Weidong Han , Zhanhui Kang , Yu Cheng , Chengzhong Xu , Di Wang , Jie Jiang

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Foundation models, now powering most of the exciting applications in deep learning, are almost universally based on the Transformer architecture and its core attention module. Many subquadratic-time architectures such as linear attention,…

Machine Learning · Computer Science 2024-06-03 Albert Gu , Tri Dao

Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

Decision Transformer, a promising approach that applies Transformer architectures to reinforcement learning, relies on causal self-attention to model sequences of states, actions, and rewards. While this method has shown competitive…

Machine Learning · Computer Science 2024-04-01 Toshihiro Ota

SSD4Rec: A Structured State Space Duality Model for Efficient Sequential Recommendation

Sequential recommendation methods are crucial in modern recommender systems for their remarkable capability to understand a user's changing interests based on past interactions. However, a significant challenge faced by current methods…

Information Retrieval · Computer Science 2025-01-17 Haohao Qu , Yifeng Zhang , Liangbo Ning , Wenqi Fan , Qing Li

Decision Mamba Architectures

Recent advancements in imitation learning have been largely fueled by the integration of sequence models, which provide a structured flow of information to effectively mimic task behaviours. Currently, Decision Transformer (DT) and…

Machine Learning · Computer Science 2024-10-18 André Correia , Luís A. Alexandre

MMA: A Momentum Mamba Architecture for Human Activity Recognition with Inertial Sensors

Human activity recognition (HAR) from inertial sensors is essential for ubiquitous computing, mobile health, and ambient intelligence. Conventional deep models such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs),…

Human-Computer Interaction · Computer Science 2025-11-27 Thai-Khanh Nguyen , Uyen Vo , Tan M. Nguyen , Thieu N. Vo , Trung-Hieu Le , Cuong Pham

Scalable Sequential Recommendation under Latency and Memory Constraints

Sequential recommender systems must model long-range user behavior while operating under strict memory and latency constraints. Transformer-based approaches achieve strong accuracy but suffer from quadratic attention complexity, forcing…

Information Retrieval · Computer Science 2026-01-30 Adithya Parthasarathy , Aswathnarayan Muthukrishnan Kirubakaran , Vinoth Punniyamoorthy , Nachiappan Chockalingam , Lokesh Butra , Kabilan Kannan , Abhirup Mazumder , Sumit Saha

MLMA: Towards Multilingual ASR With Mamba-based Architectures

Multilingual automatic speech recognition (ASR) remains a challenging task, especially when balancing performance across high- and low-resource languages. Recent advances in sequence modeling suggest that architectures beyond Transformers…

Computation and Language · Computer Science 2025-10-24 Mohamed Nabih Ali , Daniele Falavigna , Alessio Brutti

Mamba for Scalable and Efficient Personalized Recommendations

In this effort, we propose using the Mamba for handling tabular data in personalized recommendation systems. We present the \textit{FT-Mamba} (Feature Tokenizer\,$+$\,Mamba), a novel hybrid model that replaces Transformer layers with Mamba…

Information Retrieval · Computer Science 2024-09-27 Andrew Starnes , Clayton Webster

Decision MetaMamba: Enhancing Selective SSM in Offline RL with Heterogeneous Sequence Mixing

Mamba-based models have drawn much attention in offline RL. However, their selective mechanism often detrimental when key steps in RL sequences are omitted. To address these issues, we propose a simple yet effective structure, called…

Machine Learning · Computer Science 2026-02-27 Wall Kim , Chaeyoung Song , Hanul Kim

A Survey of Mamba

As one of the most representative DL techniques, Transformer architecture has empowered numerous advanced models, especially the large language models (LLMs) that comprise billions of parameters, becoming a cornerstone in deep learning.…

Machine Learning · Computer Science 2026-04-07 Haohao Qu , Liangbo Ning , Rui An , Wenqi Fan , Tyler Derr , Hui Liu , Xin Xu , Qing Li