Aligning Large Language Models for Controllable Recommendations

Wensheng Lu; Jianxun Lian; Wei Zhang; Guanghua Li; Mingyang Zhou; Hao Liao; Xing Xie

Aligning Large Language Models for Controllable Recommendations

Information Retrieval 2024-08-06 v2 Artificial Intelligence

Authors: Wensheng Lu , Jianxun Lian , Wei Zhang , Guanghua Li , Mingyang Zhou , Hao Liao , Xing Xie

Abstract

Inspired by the exceptional general intelligence of Large Language Models (LLMs), researchers have begun to explore their application in pioneering the next generation of recommender systems - systems that are conversational, explainable, and controllable. However, existing literature primarily concentrates on integrating domain-specific knowledge into LLMs to enhance accuracy, often neglecting the ability to follow instructions. To address this gap, we initially introduce a collection of supervised learning tasks, augmented with labels derived from a conventional recommender model, aimed at explicitly improving LLMs' proficiency in adhering to recommendation-specific instructions. Subsequently, we develop a reinforcement learning-based alignment procedure to further strengthen LLMs' aptitude in responding to users' intentions and mitigating formatting errors. Through extensive experiments on two real-world datasets, our method markedly advances the capability of LLMs to comply with instructions within recommender systems, while sustaining a high level of accuracy performance.

Keywords

large language model instruction tuning language modeling

Cite

@article{arxiv.2403.05063,
  title  = {Aligning Large Language Models for Controllable Recommendations},
  author = {Wensheng Lu and Jianxun Lian and Wei Zhang and Guanghua Li and Mingyang Zhou and Hao Liao and Xing Xie},
  journal= {arXiv preprint arXiv:2403.05063},
  year   = {2024}
}

Comments

14 pages; Accepted by ACL 2024 main conference

Aligning Large Language Models for Controllable Recommendations

Abstract

Keywords

Cite

Comments

Related papers