BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation

Kaiyuan Li; Rui Xiang; Yong Bai; Yongxiang Tang; Yanhua Cheng; Xialong Liu; Peng Jiang; Kun Gai

BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation

Information Retrieval 2025-04-10 v1

Authors: Kaiyuan Li , Rui Xiang , Yong Bai , Yongxiang Tang , Yanhua Cheng , Xialong Liu , Peng Jiang , Kun Gai

Abstract

Multi-modal sequential recommendation systems leverage auxiliary signals (e.g., text, images) to alleviate data sparsity in user-item interactions. While recent methods exploit large language models to encode modalities into discrete semantic IDs for autoregressive prediction, we identify two critical limitations: (1) Existing approaches adopt fragmented quantization, where modalities are independently mapped to semantic spaces misaligned with behavioral objectives, and (2) Over-reliance on semantic IDs disrupts inter-modal semantic coherence, thereby weakening the expressive power of multi-modal representations for modeling diverse user preferences. To address these challenges, we propose a Behavior-Bind multi-modal Quantization for Sequential Recommendation (BBQRec for short) featuring dual-aligned quantization and semantics-aware sequence modeling. First, our behavior-semantic alignment module disentangles modality-agnostic behavioral patterns from noisy modality-specific features through contrastive codebook learning, ensuring semantic IDs are inherently tied to recommendation tasks. Second, we design a discretized similarity reweighting mechanism that dynamically adjusts self-attention scores using quantized semantic relationships, preserving multi-modal synergies while avoiding invasive modifications to the sequence modeling architecture. Extensive evaluations across four real-world benchmarks demonstrate BBQRec's superiority over the state-of-the-art baselines.

Keywords

recommendation system multimodal learning

Cite

@article{arxiv.2504.06636,
  title  = {BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation},
  author = {Kaiyuan Li and Rui Xiang and Yong Bai and Yongxiang Tang and Yanhua Cheng and Xialong Liu and Peng Jiang and Kun Gai},
  journal= {arXiv preprint arXiv:2504.06636},
  year   = {2025}
}

BBQRec: Behavior-Bind Quantization for Multi-Modal Sequential Recommendation

Abstract

Keywords

Cite

Related papers