English

USimAgent: Large Language Models for Simulating Search Users

Information Retrieval 2024-10-30 v2 Artificial Intelligence

Abstract

Due to the advantages in the cost-efficiency and reproducibility, user simulation has become a promising solution to the user-centric evaluation of information retrieval systems. Nonetheless, accurately simulating user search behaviors has long been a challenge, because users' actions in search are highly complex and driven by intricate cognitive processes such as learning, reasoning, and planning. Recently, Large Language Models (LLMs) have demonstrated remarked potential in simulating human-level intelligence and have been used in building autonomous agents for various tasks. However, the potential of using LLMs in simulating search behaviors has not yet been fully explored. In this paper, we introduce a LLM-based user search behavior simulator, USimAgent. The proposed simulator can simulate users' querying, clicking, and stopping behaviors during search, and thus, is capable of generating complete search sessions for specific search tasks. Empirical investigation on a real user behavior dataset shows that the proposed simulator outperforms existing methods in query generation and is comparable to traditional methods in predicting user clicks and stopping behaviors. These results not only validate the effectiveness of using LLMs for user simulation but also shed light on the development of a more robust and generic user simulators. The code and data are accessible at https://github.com/Meow-E/USimAgent.

Keywords

Cite

@article{arxiv.2403.09142,
  title  = {USimAgent: Large Language Models for Simulating Search Users},
  author = {Erhan Zhang and Xingzhu Wang and Peiyuan Gong and Yankai Lin and Jiaxin Mao},
  journal= {arXiv preprint arXiv:2403.09142},
  year   = {2024}
}